ASP_Project

This project is for Audio signal class final project.

VAD part

The current VAD folder contains a pipline for TDNN-LSTM frame work and a Pyannote framework using AMI dataset which is not the main problem we are solving in this project. The pipeline here is just for comparing the result with our result. In order to reproduce the result of these pipelines, please first install the OVAD package using the following commands:

git clone https://github.com/desh2608/ovad.git

python setup.py install

Goal of this project

In this project we are trying to work on the Overlapped Voice Activity Detection problem as a Multiclass classification problem with three different classes: {Silence, Speech, Overlapped speech}. We are trying to work on this problem in a signal processing aspect. We will try to solve this problem using Energy Based, Format detector, and Pitch detector. We will also try to work on different speech augmentation methods to provide a better result.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
VAD		VAD
data		data
dataprocess		dataprocess
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ASP_Project

VAD part

Goal of this project

About

Releases

Packages

Contributors 2

Languages

License

Gnosil/ASP_Project

Folders and files

Latest commit

History

Repository files navigation

ASP_Project

VAD part

Goal of this project

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages