I faced the task of enabling search within meeting transcripts, spanning discussions among 2 to 4 participants, whether conducted online or offline. The aim was to seamlessly reference past discussions during live conversations within the same group.
The objective was to develop a foundational solution that could later evolve to address this challenge comprehensively. It served as a checkpoint to identify all essential considerations for tackling the aforementioned problem, including:
- Real-time transcription of live conversations, regardless of their online or offline nature.
- Retrieval of past meeting conversations within the same group.
- Ensuring the accuracy and efficiency of reference searches.
- Exploring various scenarios to be accounted for in the solution design process.
Live-Microphone.mp4
Offline-Upload.Mode.mp4
Update environment variable for hugging face token in environment.yml
conda env create -f environment.yml
conda activate live_speech_reference_search
docker compose up
gradio app.py
UI Accessible at http://localhost:7860/
python -m spacy download en_core_web_sm