deep-learning esp32 wake-word-detection wakeword esp32-s3 voice-activity-detection hotword-detection

libnyumaya_esp32

Experimental support for nyumaya audio recognition on ESP32

Currently only ESP32S3 is supported

Dependencies:

ESP-NN
ESP-DSP

Steps to use your custom model the model:

The model is provided as unsigned char array. You have to provide this array to the function addModelFromBuffer. In this example this array is just included via marvin_model.h and marvin_model.c as source files in the CMakeLists.txt

Limitations:

Currently there is only support for listening to one keyword
The library needs to allocate a 115kB memory block on the IRAM
You need a PSRAM

The wake words are compatible with the corresponding version from our other repository However to make them loadable without a file system they are converted using xxd. xxd -i model.premium > model.cc

VAD:

Voice activity detection is supported for commercial customers. VAD is here defined as follows: Activity shoud be detected if Speech is present in the last 1.2 seconds of audio. Music/Singing/Breathing/Coughing should not count as speech. The output is RAW probabilities and should be filtered in an appropiate way.

About

Experimental support for nyumaya audio recognition on ESP32

https://nyumaya.com

deep-learning esp32 wake-word-detection wakeword esp32-s3 voice-activity-detection hotword-detection

Languages

Language:C++ 76.5%Language:CMake 16.0%Language:Makefile 7.5%