A powerful, flexible Python module for audio transcription leveraging OpenAI's Whisper model, designed to transform audio content into accurate, multilingual text.
-
🔊 Advanced Audio Transcription
- Utilizes state-of-the-art Whisper AI technology
- Supports multiple languages and dialects
-
🌐 Multilingual Support
- Transcribe and translate audio across 99 languages
- Automatic language detection
-
📄 Flexible Output Formats
- TXT, JSON, SRT, VTT
- Customizable transcription settings
-
📂 Versatile Processing
- Single file and batch processing
- Configurable model sizes
- GPU and CPU support
🇺🇸 English | 🇹🇷 Türkçe |
---|---|
Installation Guide | Installation Guide |
CLI Usage Guide | Komut Satırı Kullanım Kılavuzu |
Module Usage Guide | Modül Kullanım Kılavuzu |
Feature Specifications | Özellik Spesifikasyonları |
The demo_scripts
directory offers comprehensive scenarios demonstrating the module's capabilities:
Scenario | Description | Key Features |
---|---|---|
1: Basic Transcription | Simple audio transcription | Default 'base' model, quick processing |
2: Multilingual Translation | Translate audio to English | Multi-language support, configurable logging |
3: Batch Processing | Process multiple audio files | Directory-wide transcription, format flexibility |
4: Advanced Configuration | Detailed transcription control | Quality filtering, segment management |
5: Error Handling | Robust error management | Fallback strategies, comprehensive logging |
6: Advanced Batch Processing | Large-scale transcription | Parallel processing, detailed reporting |
- Python: 3.8+
- CPU: All models supported
- GPU: Optional acceleration
- Use
--device cuda
for GPU transcription - Automatic CPU fallback
- Use
- openai-whisper
- torch
- numpy
- soundfile
- ffmpeg-python
- Fork the repository
- Create a virtual environment
- Install development dependencies:
pip install -e .[dev]
- Run tests:
pytest
- Submit a pull request
- Open an Issue
- Consult Troubleshooting Guide
MIT License - see the LICENSE file for details.
- OpenAI for the Whisper model
- Python open-source community