RapidAI's repositories
Paddle2OnnxConvertor
Convert paddle model to onnx model
RapidOcrAndroidOnnxCompose
opencv onnxruntime ocr android demo, jetpack compose + kotlin
keyframe_extractor
To extract key frames from a video.
RapidAudioKit
It's for the repository of audio resampling tools
RapidOcrNcnnLibTest
rapid ocr ncnn lib test
bytetrack-opencv-onnxruntime
分别使用OpenCV、ONNXRuntime部署ByteTrack目标跟踪,包含C++和Python两个版本的程序
FastestDet
:zap: A newly designed ultra lightweight anchor free target detection algorithm, weight only 250K parameters, reduces the time consumption by 10% compared with yolo-fastest, and the post-processing is simpler
PaddleOCR2Pytorch
PaddleOCR inference in PyTorch. Converted from [PaddleOCR](https://github.com/PaddlePaddle/PaddleOCR)
QRCode-NCNN
QRCode(from WeChat) implement in ncnn⚡二维码检测&解码⚡ncnn⚡
RapidPix2Pix
Inference code based on the onnxruntime about pix2pix
TensorflowASR
集成了Tensorflow 2版本的端到端语音识别模型,并且RTF(实时率)在0.1左右/Mandarin State-of-the-art Automatic Speech Recognition in Tensorflow 2
asv-subtools
An Open Source Tools for Speaker Recognition
kenlm
KenLM: Faster and Smaller Language Model Queries
nanodet-plus-opencv
使用OpenCV部署NanoDet-Plus,包含C++和Python两个版本的程序
ncnn_paddleocr
Android paddleocr demo infer by ncnn
NER-CPP
Named Entity Recognition Automatic Annotation Algorithms
OCRmyPDF
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
OpenSpeaker
OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognition including multi-platform deployment and model optimization.
ParlAI
A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
RapidOcrOnnxLibTest
rapidocr onnx cpp lib test
silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector, Language Classifier and Spoken Number Detector
stb
stb single-file public domain libraries for C/C++
tr
Free Offline OCR 离线的中文文本检测+识别SDK
ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
vgpu_unlock
Unlock vGPU functionality for consumer grade GPUs.
wenet
Production First and Production Ready End-to-End Speech Recognition Toolkit
Yolo-FastestV2
:zap: Based on Yolo's low-power, ultra-lightweight universal target detection algorithm, the parameter is only 250k, and the speed of the smart phone mobile terminal can reach ~300fps+