sebastianruder / NLP-progress

Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.

Home Page:https://nlpprogress.com/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Voice Activity Detection

unhammer opened this issue · comments

I didn't see anything on VAD, so maybe that should be a new category? I don't know enough about it to say if it could be considered a language independent task, nor what the current state of the art is (which is why I'm opening this issue ;-))

It does seem like webrtc-vad is used a lot, so that might be the de-facto baseline, while
https://ieeexplore.ieee.org/document/8309294 / https://github.com/jtkim-kaist/VAD seems like a contender for state-of-the-art (has a freely available dataset).

Thanks for the mention. We could potentially add this to a speech-related section if there's interest.