maxbaluev / vad

Voice activity detector (VAD) for the browser with a simple API

Home Page:https://www.vad.ricky0123.com

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Voice Activity Detection for Javascript

npm vad-web npm vad-node npm vad-react

⚠️ This project no longer publishes to the @ricky0123/vad npm package. Please use the new platform-specific packages: @ricky0123/vad-web, @ricky0123/vad-node, etc.

This package aims to provide an accurate, user-friendly voice activity detector (VAD) that runs in the browser. It also has limited support for node. Currently, it runs Silero VAD [1] using ONNX Runtime Web / ONNX Runtime Node.js.

For documentation and a demo, visit vad.ricky0123.com.

Quick Start

To use the VAD via a script tag in the browser, include the following script tags:

<script src="https://cdn.jsdelivr.net/npm/onnxruntime-web/dist/ort.js"></script>
<script src="https://cdn.jsdelivr.net/npm/@ricky0123/vad-web@0.0.7/dist/bundle.min.js"></script>
<script>
  async function main() {
    const myvad = await vad.MicVAD.new({
      onSpeechStart: () => {
        console.log("Speech start detected")
      },
      onSpeechEnd: (audio) => {
        // do something with `audio` (Float32Array of audio samples at sample rate 16000)...
      }
    })
    myvad.start()
  }
  main()
</script>

Documentation for bundling the voice activity detector for the browser or using it in node or React projects can be found on vad.ricky0123.com.

References

[1] Silero Team. (2021). Silero VAD: pre-trained enterprise-grade Voice Activity Detector (VAD), Number Detector and Language Classifier. GitHub, GitHub repository, https://github.com/snakers4/silero-vad, hello@silero.ai.

About

Voice activity detector (VAD) for the browser with a simple API

https://www.vad.ricky0123.com

License:Other


Languages

Language:TypeScript 59.0%Language:JavaScript 22.4%Language:HTML 10.6%Language:Shell 4.1%Language:Nunjucks 2.7%Language:CSS 1.3%