There are 19 repositories under recognition topic.
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.
✨ Recognize all contributors, not just the ones who push code ✨
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
Convolutional recurrent network in pytorch
[ECCV 2018] CCPD: a diverse and well-annotated dataset for license plate detection and recognition
Open-Source Large Vocabulary Continuous Speech Recognition Engine
(CRNN) Chinese Characters Recognition.
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Learn OpenCV in 4 Hours - Code used in my Python and OpenCV course on freeCodeCamp.
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
Open source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.
FaceXlib aims at providing ready-to-use face-related functions based on current STOA open-source methods.
OCR software for recognition of handwritten text
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
TensorFlow android demo 车道线 车辆 人脸 动作 骨架 识别 检测 抽烟 打电话 闭眼 睁眼
Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.
Keras beit,caformer,CMT,CoAtNet,convnext,davit,dino,efficientdet,edgenext,efficientformer,efficientnet,eva,fasternet,fastervit,fastvit,flexivit,gcvit,ghostnet,gpvit,hornet,hiera,iformer,inceptionnext,lcnet,levit,maxvit,mobilevit,moganet,nat,nfnets,pvt,swin,tinynet,tinyvit,uniformer,volo,vanillanet,yolor,yolov7,yolov8,yolox,gpt2,llama2, alias kecam
🤖 A GitHub App to automate acknowledging contributors to your open source projects
food image to recipe with deep convolutional neural networks.
Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021
Full-python LiDAR SLAM using ICP and Scan Context
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
Android speech recognition and text to speech made easy
✨ 首个CJK(中日韩)字体识别以及样式提取模型 YuzuMarker的字体识别模型与实现 / First-ever CJK (Chinese Japanese Korean) Font Recognition and Style Extractor, side project of YuzuMarker
Tutorial for computer vision and machine learning in PHP 7/8 by opencv (installation + examples + documentation)
Geometric Augmentation for Text Image
"Google Now" style animation for Speech Recognizer.
A curated list of awesome papers on Embodied AI and related research/industry-driven resources.
Tool to help automate adding contributor acknowledgements according to the all-contributors specification ✨
When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition (ECCV’2022 Poster).
opencv 4.5+ with dnn module for php 7/8