🎙️ Whisper Transcription Module

🌟 Overview

A powerful, flexible Python module for audio transcription leveraging OpenAI's Whisper model, designed to transform audio content into accurate, multilingual text.

✨ Key Features

🔊 Advanced Audio Transcription
- Utilizes state-of-the-art Whisper AI technology
- Supports multiple languages and dialects
🌐 Multilingual Support
- Transcribe and translate audio across 99 languages
- Automatic language detection
📄 Flexible Output Formats
- TXT, JSON, SRT, VTT
- Customizable transcription settings
📂 Versatile Processing
- Single file and batch processing
- Configurable model sizes
- GPU and CPU support

📚 Documentation

🇺🇸 English	🇹🇷 Türkçe
Installation Guide	Installation Guide
CLI Usage Guide	Komut Satırı Kullanım Kılavuzu
Module Usage Guide	Modül Kullanım Kılavuzu
Feature Specifications	Özellik Spesifikasyonları

🚀 Demo Scripts

The demo_scripts directory offers comprehensive scenarios demonstrating the module's capabilities:

Scenario	Description	Key Features
1: Basic Transcription	Simple audio transcription	Default 'base' model, quick processing
2: Multilingual Translation	Translate audio to English	Multi-language support, configurable logging
3: Batch Processing	Process multiple audio files	Directory-wide transcription, format flexibility
4: Advanced Configuration	Detailed transcription control	Quality filtering, segment management
5: Error Handling	Robust error management	Fallback strategies, comprehensive logging
6: Advanced Batch Processing	Large-scale transcription	Parallel processing, detailed reporting

📋 System Requirements

💻 Computational Resources

Python: 3.8+
CPU: All models supported
GPU: Optional acceleration
- Use --device cuda for GPU transcription
- Automatic CPU fallback

📦 Dependencies

openai-whisper
torch
numpy
soundfile
ffmpeg-python

🤝 Contributing

Fork the repository
Create a virtual environment
Install development dependencies: pip install -e .[dev]
Run tests: pytest
Submit a pull request

🐛 Support

Open an Issue
Consult Troubleshooting Guide

📄 License

MIT License - see the LICENSE file for details.

🙏 Acknowledgements

OpenAI for the Whisper model
Python open-source community

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
demo_scripts		demo_scripts
docs		docs
whisper_transcriber		whisper_transcriber
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎙️ Whisper Transcription Module

🌟 Overview

✨ Key Features

📚 Documentation

🚀 Demo Scripts

📋 System Requirements

💻 Computational Resources

📦 Dependencies

🤝 Contributing

🐛 Support

📄 License

🙏 Acknowledgements

About

Languages

Arslanex/Whisper-Transcriber

Folders and files

Latest commit

History

Repository files navigation

🎙️ Whisper Transcription Module

🌟 Overview

✨ Key Features

📚 Documentation

🚀 Demo Scripts

📋 System Requirements

💻 Computational Resources

📦 Dependencies

🤝 Contributing

🐛 Support

📄 License

🙏 Acknowledgements

About

Topics

Resources

Stars

Watchers

Forks

Languages