nyumaya / libnyumaya_esp32

Experimental support for nyumaya audio recognition on ESP32

Home Page:https://nyumaya.com

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

libnyumaya_esp32

Experimental support for nyumaya audio recognition on ESP32

Currently only ESP32S3 is supported

Dependencies:

  • ESP-NN
  • ESP-DSP

Steps to use your custom model the model:

The model is provided as unsigned char array. You have to provide this array to the function addModelFromBuffer. In this example this array is just included via marvin_model.h and marvin_model.c as source files in the CMakeLists.txt

Limitations:

  • Currently there is only support for listening to one keyword
  • The library needs to allocate a 115kB memory block on the IRAM
  • You need a PSRAM

The wake words are compatible with the corresponding version from our other repository However to make them loadable without a file system they are converted using xxd. xxd -i model.premium > model.cc

VAD:

Voice activity detection is supported for commercial customers. VAD is here defined as follows: Activity shoud be detected if Speech is present in the last 1.2 seconds of audio. Music/Singing/Breathing/Coughing should not count as speech. The output is RAW probabilities and should be filtered in an appropiate way.

About

Experimental support for nyumaya audio recognition on ESP32

https://nyumaya.com


Languages

Language:C++ 76.5%Language:CMake 16.0%Language:Makefile 7.5%