ForestLee / RapidASR

A Cross platform implementation of Wenet ASR inference. It's based on ONNXRuntime and Wenet. We provide a set of easier APIs to call wenet models.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

RapidASR: a new member of RapidAI family.

  • Our vision is to offer an out-of-box engineering implementation for ASR.

  • A cpp implementation of recognize-onnx.py in Wenet-asr in which it implements the inference with ONNXRuntime.

  • For a version of pure CPP code, we need to do a bit of work to rewrite some components.

  • Special thanks to its original author SlyneD.

  • Less is more. Less dependency, more usability.

  • Just offline mode, not support stream mode, aka separate files can be recognized.

Supported modes:

  • CTC_GREEDY_SEARCH
  • CTC_RPEFIX_BEAM_SEARCH
  • ATTENSION_RESCORING

Progress:

  • Python
  • Linux
  • Mac
  • Android
  • Windows

Models

Sample Rate: 16000Hz

Sample Depth: 16bits

Channel: single

Build

  • Linux
Visual studio 2019 & cmake 3.20

cd thirdpart
build_win.cmd x86|x64

TBD

  • Windows

Notice:

The project is under the protection of GPL V2, Apache license and commercial license.

For so/dll/c++ interface, it complies with GPL V2.

For python interface, it belongs to Apache license.

For a commercial license, please contact us: znsoft@163.com (commercial license only).

Commercial support

For a commercial user, we offer a library to resample input data including mp3, mp4, mkv and so on.

Please visit: https://github.com/RapidAI/RapidAudioKit

About

A Cross platform implementation of Wenet ASR inference. It's based on ONNXRuntime and Wenet. We provide a set of easier APIs to call wenet models.

License:Other


Languages

Language:C++ 79.7%Language:Makefile 7.6%Language:Shell 5.1%Language:Python 3.7%Language:Cython 2.4%Language:CMake 0.6%Language:Starlark 0.3%Language:C 0.2%Language:M4 0.2%Language:Batchfile 0.0%