GitHub - alphacep/vosk-api at 0.3.5

3 Branches 30 Tags

Name	Name	Last commit message	Last commit date
Latest commit nshmyrev Fix threading bug Apr 22, 2020 ffd810f · Apr 22, 2020 History 93 Commits
android	android	Fix pocketsphinx references in Android sources	Apr 9, 2020
csharp	csharp	C-only wrapper	Apr 4, 2020
doc	doc	How to ask for the accuracy updates	Apr 20, 2020
ios	ios	UI updates	Apr 20, 2020
java	java	Introduce pure C api to deal with Windows runtime issues and make it …	Apr 3, 2020
nodejs	nodejs	Clarify licences	Jan 19, 2020
python	python	Fix threading bug	Apr 22, 2020
src	src	Fix threading bug	Apr 22, 2020
travis	travis	Version 0.3.4 and arm wheels for python 3.6	Apr 21, 2020
.gitignore	.gitignore	Repair travis build	Apr 4, 2020
.travis.yml	.travis.yml	Added basic travis	Jan 10, 2020
COPYING	COPYING	Imported Python bindings and Node bindings	Jan 2, 2020
README.md	README.md	Added per-language readme	Apr 22, 2020
README.ru.md	README.ru.md	Added per-language readme	Apr 22, 2020
README.zh.md	README.zh.md	Update README.zh.md	Apr 22, 2020

Repository files navigation

РУС

中文

Vosk is a speech recognition toolkit. The best things in Vosk are:

Supports 8 languages - English, German, French, Spanish, Portuguese, Chinese, Russian, Vietnamese. More is coming.
Works offline even on lightweight devices - Raspberry Pi, Android, iOS
Installs with simple pip3 install vosk
Portable models per language is just only 50Mb but there are much bigger server models available.
Provides streaming API for the best user experience (unlike popular speech-recognition python package)
There bindings for different prograbmming languages too - java/csharp/javascript etc.
Allows quick reconfiguration of vocabulary for best accuracy.
Supports speaker identification beside simple speech recognition

Android build

cd android
gradle build

Please note that medium blog post about 64-bit is not relevant anymore, the script builds x86, arm64 and armv7 libraries automatically without any modifications.

For example of Android application using Vosk-API check https://github.com/alphacep/kaldi-android-demo project

iOS build

Available on request. Drop as a mail at [email protected].

Python installation from Pypi

The easiest way to install vosk api is with pip. You do not have to compile anything. We currently support only Linux on x86_64 and Raspberry Pi. Other systems (windows, mac) will come soon.

Make sure you have newer pip and python:

Python version >= 3.4
pip version >= 19.0

Uprade python and pip if needed. Then install vosk on Linux with a simple command

pip3 install vosk

Websocket Server and GRPC server

We also provide a websocket server and grpc server which can be used in telephony and other applications. With bigger models adapted for 8khz audio it provides more accuracy.

The server is installed with docker and can run with a single command:

docker run -d -p 2700:2700 alphacep/kaldi-en:latest

For details see https://github.com/alphacep/vosk-server

Compilation from source

If you still want to build from scratch, you can compile Kaldi and Vosk yourself. The compilation is straightforward but might be a little confusing for newbie. In case you want to follow this, please watch the errors.

Kaldi compilation for local python, node and java modules

git clone -b lookahead --single-branch https://github.com/alphacep/kaldi
cd kaldi/tools
make

install all dependencies and repeat make if needed

extras/install_openblas.sh
cd ../src
./configure --mathlib=OPENBLAS --shared --use-cuda=no
make -j 10

Python module build

Then build the python module

export KALDI_ROOT=<KALDI_ROOT>
cd python
python3 setup.py install

Running the example code with python

Run like this:

cd vosk-api/python/example
wget https://github.com/alphacep/kaldi-android-demo/releases/download/2020-01/alphacep-model-android-en-us-0.3.tar.gz
tar xf alphacep-model-android-en-us-0.3.tar.gz 
mv alphacep-model-android-en-us-0.3 model-en
python3 ./test_simple.py test.wav

To run with your audio file make sure it has proper format - PCM 16khz 16bit mono, otherwise decoding will not work.

You can find other examples of using a microphone, decoding with a fixed small vocabulary or speaker identification setup in python/example subfolder

Java example API build

Or Java

cd java && KALDI_ROOT=<KALDI_ROOT> make
wget https://github.com/alphacep/kaldi-android-demo/releases/download/2020-01/alphacep-model-android-en-us-0.3.tar.gz
tar xf alphacep-model-android-en-us-0.3.tar.gz 
mv alphacep-model-android-en-us-0.3 model
make run

C# build

Or C#

cd csharp && KALDI_ROOT=<KALDI_ROOT> make
wget https://github.com/alphacep/kaldi-android-demo/releases/download/2020-01/alphacep-model-android-en-us-0.3.tar.gz
tar xf alphacep-model-android-en-us-0.3.tar.gz 
mv alphacep-model-android-en-us-0.3 model
mono test.exe

Models for different languages

For information about models see the documentation on available models.

Contact Us

If you have any questions, feel free to

Post an issue here on github
Send us an e-mail at [email protected]
Join our group dedicated to speech recognition on Telegram @speech_recognition

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Android build

iOS build

Python installation from Pypi

Websocket Server and GRPC server

Compilation from source

Kaldi compilation for local python, node and java modules

Python module build

Running the example code with python

Java example API build

C# build

Models for different languages

Contact Us

About

Releases 20

Contributors 42

Languages

License

alphacep/vosk-api

Folders and files

Latest commit

History

Repository files navigation

Android build

iOS build

Python installation from Pypi

Websocket Server and GRPC server

Compilation from source

Kaldi compilation for local python, node and java modules

Python module build

Running the example code with python

Java example API build

C# build

Models for different languages

Contact Us

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 20

Contributors 42

Languages