There are 18 repositories under recognition topic.
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.
✨ Recognize all contributors, not just the ones who push code ✨
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
Convolutional recurrent network in pytorch
[ECCV 2018] CCPD: a diverse and well-annotated dataset for license plate detection and recognition
Open-Source Large Vocabulary Continuous Speech Recognition Engine
(CRNN) Chinese Characters Recognition.
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
Learn OpenCV in 4 Hours - Code used in my Python and OpenCV course on freeCodeCamp.
Open source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.
TensorFlow android demo 车道线 车辆 人脸 动作 骨架 识别 检测 抽烟 打电话 闭眼 睁眼
OCR software for recognition of handwritten text
Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.
food image to recipe with deep convolutional neural networks.
Keras beit,caformer,CMT,CoAtNet,convnext,davit,dino,efficientdet,edgenext,efficientformer,efficientnet,eva,fasternet,fastervit,fastvit,flexivit,gcvit,ghostnet,gpvit,hornet,hiera,iformer,inceptionnext,lcnet,levit,maxvit,mobilevit,moganet,nat,nfnets,pvt,swin,tinynet,tinyvit,uniformer,volo,vanillanet,yolor,yolov7,yolov8,yolox,gpt2,llama2, alias kecam
🤖 A GitHub App to automate acknowledging contributors to your open source projects
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
"Google Now" style animation for Speech Recognizer.
Geometric Augmentation for Text Image
Android speech recognition and text to speech made easy
Tutorial for computer vision and machine learning in PHP 7/8 by opencv (installation + examples + documentation)
Full-python LiDAR SLAM using ICP and Scan Context
Tool to help automate adding contributor acknowledgements according to the all-contributors specification ✨
Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021
✨ 首个CJK(中日韩)字体识别以及样式提取模型 YuzuMarker的字体识别模型与实现 / First-ever CJK (Chinese Japanese Korean) Font Recognition and Style Extractor, side project of YuzuMarker
opencv 4.5+ with dnn module for php 7/8
✂️ Detect and crop faces, barcodes and texts in image with iOS 11 Vision api.
[NeurIPS 2022] Implementation of "AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition"
AFPlan is an architectural floor plan analysis and recognition system to create extended plans for building services.