Skip to content

Latest commit

 

History

History
37 lines (29 loc) · 1.38 KB

README.md

File metadata and controls

37 lines (29 loc) · 1.38 KB

forthebadge

made-with-bash made-with-python made-with-VSCode Generic badge

ASR

Description

A collection of scripts for speech recognition based on Kaldi and meant to simplify the training process as much as possible.

Speech Recognition

    1. Data prep
    1. Lexicon generation
    1. Grammar generation (pocolm & srilm)
    1. Feature extraction
    1. HMM-GMM training
    1. Data augmentation (speed, volume, reverb, music, noise, babble)
    1. Embedding (i-vector, x-vector)
    1. DNN training
    1. RNNLM training
    1. Rescoring

Datasets

English Spanish
common voice common voice
heroico
dimex

License

GPL

Authors

(c) 2020 Sylvain Le Groux [email protected]