Speaker Diarization

pyannote-audio is an open-source toolkit written in Python for speaker diarization.

pyannote-onnx is used to convert the pretrained model defined in PyTorch into the ONNX format and then run it with ONNX Runtime (in C++ or Python).

Only Python 3.8+ is supported.

Usage

Download the pretrained model from Hugging Face pyannote/segmentation-3.0.
Export the pretrained model to ONNX model.
Run the ONNX model with ONNX Runtime in C++ or Python.

$ pip install torch onnx https://github.com/pyannote/pyannote-audio/archive/refs/heads/develop.zip
$ python export_onnx.py pytorch_model.bin segmentation-3.0.onnx

$ pip install pyannote-onnx
$ diarize data/test_16k.wav --plot

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Speaker Diarization

Usage

Files

README.md

Latest commit

History

README.md

File metadata and controls

Speaker Diarization

Usage