Skip to content

A versatile tool for multi-language audio transcription, transliteration, and word timestamp extraction. Supporting English, Telugu, Hindi, Tamil, Malayalam, and Kannada, it offers a Gradio-powered interface for easy audio analysis, script conversion, and precise word tracking. Perfect for linguists, researchers, and developers.

License

Notifications You must be signed in to change notification settings

LSShrivathsan/Transcribe-Transliterate-Timestamp-Audiofile

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Transcribe-Transliterate-Timestamp-Audiofile

A versatile tool for multi-language audio transcription, transliteration, and word timestamp extraction. Supporting English, Telugu, Hindi, Tamil, Malayalam, and Kannada, it offers a Gradio-powered interface for easy audio analysis, script conversion, and precise word tracking. Perfect for linguists, researchers, and developers.

Features

  • Multi-Language Transcription: Supports transcription in English, Telugu, Hindi, Tamil, Malayalam, and Kannada.
  • Script Transliteration: Transliterates the transcribed text into a different script for enhanced compatibility.
  • Word Timestamp Extraction: Extracts timestamps for specific words within the audio file.
  • User-Friendly Interface: A Gradio-based web interface for easy interaction and instant results.
  • Audio Format Compatibility: Converts uploaded audio files to WAV format for seamless processing.

Installation

To get started , follow these steps:

  1. Clone the Repository:

    git clone https://github.com/LSShrivathsan/Transcribe-Transliterate-Timestamp-Audiofile.git
    cd Transcribe-Transliterate-Timestamp-Audiofile
  2. Install the Required Dependencies:

    pip install -r requirements.txt
  3. Run the Gradio Interface:

    python app.py

Usage

  1. Upload Audio: Use the interface to upload an audio file.
  2. Select Language: Choose the transcription language from the options.
  3. Search Word: Enter a word to extract its timestamps from the audio.
  4. View Results: See the transcribed text, transliterated script and word timestamps.

Here's how to use the tool:

  1. Upload an Audio File:

    • The app supports various audio formats. Upload your file to start processing.
  2. Choose a Language:

    • Select the desired language for transcription.
  3. Enter a Word for Timestamp Extraction:

    • Input the word you want to search within the audio. The app will provide timestamps where this word occurs.
  4. View and Download Results:

    • Get the transcription, transliteration, and timestamps. You can copy or download the results.

Contributing

We welcome contributions! If you'd like to contribute, please fork the repository and use a feature branch. Pull requests are warmly welcome.

License

This project is licensed under the MIT License

Contact

For any questions or inquiries, please contact - [email protected]

About

A versatile tool for multi-language audio transcription, transliteration, and word timestamp extraction. Supporting English, Telugu, Hindi, Tamil, Malayalam, and Kannada, it offers a Gradio-powered interface for easy audio analysis, script conversion, and precise word tracking. Perfect for linguists, researchers, and developers.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published