TensorVox

NOTE: In very early alpha stage.

TensorVox is an application designed to enable user-friendly and lightweight neural speech synthesis in the desktop, aimed at increasing accessibility to such technology.

Powered by TensorflowTTS, it is written in pure C++/Qt, using the Tensorflow C API for interacting with the models. This way, we can perform inference without having to install gigabytes worth of pip libraries, just a 100MB DLL.

Try it out!

Note: The download version is a very old iteration

Download the program, and LJSpeech model
Run.

Supported architectures

Currently, only FastSpeech2 (phoneme-based) and Multi-Band MelGAN from TensorflowTTS are supported.

Build instructions

Currently, only Windows x64 is supported.

Requirements:

Qt Creator
MSVC 2017 (v141) compiler

Primed build (with all provided libraries):

Download precompiled binary dependencies and includes
Unzip it so that the deps folder is in the same place as the .pro and main source files.
Open the project with Qt Creator, add your compiler and compile

Note that to try your shiny new executable you'll need to download the program as described above and insert the models folder where your new build is output.

TODO: Add instructions for compile from scratch.

Externals (and thanks)

Tensorflow C API: https://www.tensorflow.org/install/lang_c
CppFlow (TF C API -> C++ wrapper): https://github.com/serizba/cppflow
AudioFile (for WAV export): https://github.com/adamstark/AudioFile
Frameless Dark Style Window: https://github.com/Jorgen-VikingGod/Qt-Frameless-Window-DarkStyle
JSON for modern C++: https://github.com/nlohmann/json
r8brain-free-src (Resampling): https://github.com/avaneev/r8brain-free-src
rnnoise (CMake version, denoising output): https://github.com/almogh52/rnnoise-cmake

Contact

You can open an issue here or join the Discord server and discuss/ask anything there

Note about licensing

This project is MIT licensed almost everywhere except for Vietnam, where, due to using TensorflowTTS models as backend, it cannot be used without permission from the TensorflowTTS authors. See here for details

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
ext		ext
g2p_train		g2p_train
res		res
.gitignore		.gitignore
EnglishPhoneticProcessor.cpp		EnglishPhoneticProcessor.cpp
EnglishPhoneticProcessor.h		EnglishPhoneticProcessor.h
FastSpeech2.cpp		FastSpeech2.cpp
FastSpeech2.h		FastSpeech2.h
LICENSE.md		LICENSE.md
MultiBandMelGAN.cpp		MultiBandMelGAN.cpp
MultiBandMelGAN.h		MultiBandMelGAN.h
README.md		README.md
TensorVox.pro		TensorVox.pro
TextTokenizer.cpp		TextTokenizer.cpp
TextTokenizer.h		TextTokenizer.h
Voice.cpp		Voice.cpp
Voice.h		Voice.h
VoxCommon.cpp		VoxCommon.cpp
VoxCommon.hpp		VoxCommon.hpp
main.cpp		main.cpp
mainwindow.cpp		mainwindow.cpp
mainwindow.h		mainwindow.h
mainwindow.ui		mainwindow.ui
modelinfodlg.cpp		modelinfodlg.cpp
modelinfodlg.h		modelinfodlg.h
modelinfodlg.ui		modelinfodlg.ui
phddialog.cpp		phddialog.cpp
phddialog.h		phddialog.h
phddialog.ui		phddialog.ui
phonemizer.cpp		phonemizer.cpp
phonemizer.h		phonemizer.h
phoneticdict.cpp		phoneticdict.cpp
phoneticdict.h		phoneticdict.h
phonetichighlighter.cpp		phonetichighlighter.cpp
phonetichighlighter.h		phonetichighlighter.h
stdres.qrc		stdres.qrc
tfg2p.cpp		tfg2p.cpp
tfg2p.h		tfg2p.h
voicemanager.cpp		voicemanager.cpp
voicemanager.h		voicemanager.h
voxer.cpp		voxer.cpp
voxer.h		voxer.h
winicon.ico		winicon.ico

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TensorVox

Try it out!

Supported architectures

Build instructions

Externals (and thanks)

Contact

Note about licensing

About

Releases

Packages

Languages

License

brainbpe/TensorVox

Folders and files

Latest commit

History

Repository files navigation

TensorVox

Try it out!

Supported architectures

Build instructions

Externals (and thanks)

Contact

Note about licensing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages