Our vision is to offer an out-of-box engineering implementation for ASR.
A cpp implementation of recognize-onnx.py in Wenet-asr in which it implements the inference with ONNXRuntime. For a version of pure CPP code, we need to do a bit of work to rewrite some components.
Just offline mode, not support stream mode, aka separate files can be recognized.
- CTC_GREEDY_SEARCH
- CTC_RPEFIX_BEAM_SEARCH
- ATTENSION_RESCORING
- Python
- Linux
- Mac
- Android
- Windows
The model is original from https://github.com/wenet-e2e/wenet/tree/main/examples/wenetspeech/s0 and tested with recognize-onnx.py.
Bidirectional model: http://mobvoi-speech-public.ufile.ucloud.cn/public/wenet/wenetspeech/20211025_conformer_bidecoder_exp.tar.gz
Download:
URL:https://pan.baidu.com/s/1BTR-uR_8WWBFpvOisNR_PA
CODE:9xjz
- Sample Rate:
16000Hz
- sample Depth:
16bits
- channel:
single
- Linux
TBD
- Windows
Visual studio 2019 & cmake 3.20
cd thirdpart
build_win.cmd x86|x64
The project is under the protection of GPL V2, Apache license and commercial license.
For so/dll/c++ interface, it complies with GPL V2.
For python interface, it belongs to Apache license.
For a commercial license, please contact us: znsoft@163.com (commercial license only).
For a commercial user, we offer a library to resample input data including mp3, mp4, mkv and so on.
Please visit: https://github.com/RapidAI/RapidAudioKit