egorsmkv / rVADfast

This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

Home Page:https://www.sciencedirect.com/science/article/pii/S0885230819300920

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

rVADfast

The Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as presented in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method, Computer Speech and Language, 2020. More info on the rVAD GitHub page.

Fast noise-robust voice activity detection algorithm (rVAD-fast).
Version 2.0

02 Dec 2017, Achintya Kumar Sarkar and Zheng-Hua Tan

Usage: python rVAD_fast_2.0.py inWaveFile  outputVadLabel

Refs:
[1] Z.-H. Tan, A.k. Sarkara and N. Dehak, "rVAD: an unsupervised segment-based robust voice activity detection method," Computer Speech and Language, vol. 59, pp. 1-21, 2020. 
[2] Z.-H. Tan and B. Lindberg, "Low-complexity variable frame rate analysis for speech recognition and voice activity detection,” IEEE Journal of Selected Topics in Signal Processing, vol. 4, no. 5, pp. 798-807, 2010.

Contact:
  Prof Zheng-Hua Tan
  Aalborg University, Denmark
  zt@es.aau.dk
  https://vbn.aau.dk/en/persons/107665  

About

This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised Segment-Based Robust Voice Activity Detection Method.

https://www.sciencedirect.com/science/article/pii/S0885230819300920

License:MIT License


Languages

Language:Python 100.0%