SAPI4

Web interface for Microsoft Sam & friends written in C & D (vibe-d), runnable on headless linux.

Setup

SAPI4 server compilation on local Windows machine

Install Microsoft Speech SDK 4.0 (SAPI4SDK.exe)
Run build.bat. If you have Visual Studio older than 2022, change vcvars32 path
Install ldc2 and dub
Go to SAPI4_web and compile the web server: dub --compiler=ldc2 --arch=x86 --build=release

Web server compilation & SAPI4 setup on remote Linux machine

Install wine (sudo apt install wine), 1.8.7 is fine. If wine doesn't work on your system, you must stop here
In a VNC/RDP session or X11-forwarded SSH connection:
- Install Microsoft Speech 4.0 API in the wine environment: wine spchapi.exe
- Install Lernout & Hauspie TruVoice Amer. Eng. TTS Engine in the wine environment: wine tv_enua.exe
Move:
- public (static web assets)
- sapi4.exe (web server)
- sapi4.dll (SAPI4 voice audio generation library)
- sapi4limits.exe (SAPI4 voice enumerator)
- sapi4out.exe (SAPI4 voice audio generation program) to a new empty folder
Install xvfb (apt install xvfb)
Run web server: while true; do; xvfb-run -a wine sapi4.exe; sleep 1; done;
Pass the web server through nginx - add this to nginx config: location ^~ /SAPI4/ { proxy_pass http://127.0.0.1:23451/; }. Note that the web server will work only on /SAPI4/ location, if you want to change that, change references to scripts and other assets in SAPI4_web/views/layout.dt, SAPI4_web/public/scripts/tts.js.
Go to http(s)://localhost/SAPI4/, put soi soi soi soi soi soi soi soi soi soi soi soi soi soi soi soi soi soi soi soi soi as text, set speed to 450 and enjoy.

You might be familiar with Speakonia. As CFS-Technologies have released an unlimited license (http://www.cfs-technologies.com/home/) for Speakonia, you can get .wavs Microsoft Sam & other voices genereated text with Speakonia too, however web interface is more convenient and generates text much faster. Speakonia is set to generate text at real-time of speaking speed and SAPI4 server is set to generate text at x16777215 of real-time speaking speed. You can download .wavs from web interface too (right click the player and press Save audio as..., at least on Chrome).

You can generate text from an API too, endpoints are /SAPI4/VoiceLimitations?voice=(voice) and /SAPI4/SAPI4?text=(text)[&voice=(voice)][&pitch=(pitch)][&speed=(speed)]. () - required parameters, [] - optional parameters.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
SAPI4_web		SAPI4_web
LICENSE		LICENSE
README.md		README.md
SAPI4SDK.exe		SAPI4SDK.exe
build.bat		build.bat
sapi4.cpp		sapi4.cpp
sapi4.hpp		sapi4.hpp
sapi4limits.cpp		sapi4limits.cpp
sapi4out.cpp		sapi4out.cpp
spchapi.exe		spchapi.exe
tv_enua.exe		tv_enua.exe

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SAPI4

Setup

SAPI4 server compilation on local Windows machine

Web server compilation & SAPI4 setup on remote Linux machine

About

Releases

Packages

Contributors 3

Languages

License

TETYYS/SAPI4

Folders and files

Latest commit

History

Repository files navigation

SAPI4

Setup

SAPI4 server compilation on local Windows machine

Web server compilation & SAPI4 setup on remote Linux machine

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages