wantt / RapidASR

A Cross platform implementation of Wenet ASR inference. It's based on ONNXRuntime and Wenet. We provide a set of easier APIs to call wenet models.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

RapidASR: a new member of RapidAI family.

Our vision is to offer an out-of-box engineering implementation for ASR.

A cpp implementation of recognize-onnx.py in Wenet-asr in which it implements the inference with ONNXRuntime. For a version of pure CPP code, we need to do a bit of work to rewrite some components.

Special thanks to its original author SlyneD.

Less is more. Less dependency, more usability.

Just offline mode, not support stream mode, aka separate files can be recognized.

Supported modes:

  • CTC_GREEDY_SEARCH
  • CTC_RPEFIX_BEAM_SEARCH
  • ATTENSION_RESCORING

Progress:

  • Python
  • Linux
  • Mac
  • Android
  • Windows

Models

The model is original from https://github.com/wenet-e2e/wenet/tree/main/examples/wenetspeech/s0 and tested with recognize-onnx.py.

Bidirectional model: http://mobvoi-speech-public.ufile.ucloud.cn/public/wenet/wenetspeech/20211025_conformer_bidecoder_exp.tar.gz

Download:

URL:https://pan.baidu.com/s/1BTR-uR_8WWBFpvOisNR_PA 
CODE:9xjz 

  • Sample Rate:

16000Hz

  • sample Depth:

16bits

  • channel:

single

Build

  • Linux

TBD

  • Windows
Visual studio 2019 & cmake 3.20



cd thirdpart
build_win.cmd x86|x64

Notice:

The project is under the protection of GPL V2, Apache license and commercial license.

For so/dll/c++ interface, it complies with GPL V2.

For python interface, it belongs to Apache license.

For a commercial license, please contact us: znsoft@163.com (commercial license only).

Commercial support

For a commercial user, we offer a library to resample input data including mp3, mp4, mkv and so on.

Please visit: https://github.com/RapidAI/RapidAudioKit

About

A Cross platform implementation of Wenet ASR inference. It's based on ONNXRuntime and Wenet. We provide a set of easier APIs to call wenet models.

License:Other


Languages

Language:C++ 79.7%Language:Makefile 7.6%Language:Shell 5.1%Language:Python 3.7%Language:Cython 2.4%Language:CMake 0.6%Language:Starlark 0.3%Language:C 0.2%Language:M4 0.2%Language:Batchfile 0.0%