RapidAI

:zap: A newly designed ultra lightweight anchor free target detection algorithm， weight only 250K parameters， reduces the time consumption by 10% compared with yolo-fastest, and the post-processing is simpler

Language:PythonBSD-3-Clause100

PaddleOCR2Pytorch

PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)

Apache-2.0100

QRCode-NCNN

QRCode(from WeChat) implement in ncnn⚡二维码检测&解码⚡ncnn⚡

100

RapidPix2Pix

Inference code based on the onnxruntime about pix2pix

Language:PythonApache-2.0100

TensorflowASR

集成了Tensorflow 2版本的端到端语音识别模型，并且RTF(实时率)在0.1左右/Mandarin State-of-the-art Automatic Speech Recognition in Tensorflow 2

Language:C++Apache-2.0100

3m-asr

Apache-2.0000

asv-subtools

An Open Source Tools for Speaker Recognition

Apache-2.0000

kenlm

KenLM: Faster and Smaller Language Model Queries

Language:C++NOASSERTION000

nanodet-plus-opencv

使用OpenCV部署NanoDet-Plus，包含C++和Python两个版本的程序

Language:C++000

ncnn_paddleocr

Android paddleocr demo infer by ncnn

000

NER-CPP

Named Entity Recognition Automatic Annotation Algorithms

Language:C++GPL-3.0000

OCRmyPDF

OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched

MPL-2.0000

OpenSpeaker

OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognition including multi-platform deployment and model optimization.

Apache-2.0000

ParlAI

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

MIT000

RapidOcrOnnxLibTest

rapidocr onnx cpp lib test

Language:CMakeApache-2.002 1

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector

MIT000

stb

stb single-file public domain libraries for C/C++

NOASSERTION000

tr

Free Offline OCR 离线的中文文本检测+识别SDK

Apache-2.0000

ultimatevocalremovergui

GUI for a Vocal Remover that uses Deep Neural Networks.

MIT000

vgpu_unlock

Unlock vGPU functionality for consumer grade GPUs.

MIT000

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:C++Apache-2.0000

Yolo-FastestV2

:zap: Based on Yolo's low-power, ultra-lightweight universal target detection algorithm, the parameter is only 250k, and the speed of the smart phone mobile terminal can reach ~300fps+

000

RapidAI

RapidAI's repositories

Paddle2OnnxConvertor

RapidOcrAndroidOnnxCompose

keyframe_extractor

RapidAudioKit

RapidOcrNcnnLibTest

SealOcr

YOLOX

bytetrack-opencv-onnxruntime

FastestDet

PaddleOCR2Pytorch

QRCode-NCNN

RapidPix2Pix

TensorflowASR

3m-asr

asv-subtools

kenlm

nanodet-plus-opencv

ncnn_paddleocr

NER-CPP

OCRmyPDF

OpenSpeaker

ParlAI

RapidOcrOnnxLibTest

silero-vad

stb

tr

ultimatevocalremovergui

vgpu_unlock

wenet

Yolo-FastestV2