Recording long conversation, trancipting, then summarize, has became much simpler and affordable with new LLMs.
This is a small project to utilize OpenAI and Speech recognizer (supports various models) APIs.
Using the user primary microphone and a streamlit web UI, the user could choose one of three languages [English, German, Arabic], total duration of the recording time in minutes, and how many seconds to save as a file as the time goes.
- Clone Repository
git clone https://github.com/ahmadhatahet/transcript_and_summarize.git
cd transcript_and_summarize
- Create virtual environment
python -m venv transcript_and_summarize
# windows
source transcript_and_summarize\Scripts\activate
# linux
# source transcript_and_summarize/bin/activate
- Install packages
pip install -r requirements.txt
- Start Streamlit app
streamlit run app.py
- ✔ Multi language support (EN, DE, AR).
- ✔ Add options to the UI for faster modifications.
Arabic language is one of the Semitic language , thus, it is not well recognized by the models used yet. Transcripting works fairly well, but summarizing is less than average. Maybe a better prompt could help increase the quality of the summarized text.