Provide codes of streaming attention mechanisms for end-to-end automatic speech recognition using Pytorch
Provide algorithms of the implementation of these streaming attention mechanisms for better understading
Compatible to ESPnet
sMoChA is described in "Online Hybrid CTC/Attention Architecture for End-to-End Speech Recognition" by H Miao, G Gao, P Zhang, T Li, Y Yan, in Proceedings of Interspeech 2019.
MTA is described in an unpublished submitted journal.