recognition

There are 19 repositories under recognition topic.

HumanSignal / labelImg
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source data labeling tool for images, text, hypertext, audio, video and time-series data.
annotations deep-learning detection image-classification imagenet python2 python3 recognition tools
Language:Python 24447
jofpin / trape
People tracker on the Internet: OSINT analysis and research tool by Jose Pino
flask footprint hacking hacking-tool jose-pino osint phising python recognition security social-engineering tracking
Language:Python 8492
all-contributors / all-contributors.github.io
✨ The all-contributors bot website and documentation. Recognize all contributors, not just the ones who push code ✨
acknowledgements all-contributors contributors open-source-tooling opensource opensource-management recognition
Language:MDX 7951
clovaai / deep-text-recognition-benchmark
Text recognition (optical character recognition) with deep learning methods, ICCV 2019
iccv2019 ocr ocr-recognition text-recognition deep-learning scene-text-recognition recognition scene-text crnn rare r2am grcnn rosetta star-net
Language:Jupyter Notebook 3901
meijieru / crnn.pytorch
Convolutional recurrent network in pytorch
neural-network recognition scene-texts
Language:Python 2469
detectRecog / CCPD
[ECCV 2018] CCPD: a diverse and well-annotated dataset for license plate detection and recognition
ccpd dataset detection large-scale plate-detection recognition
Language:Python 2466
julius-speech / julius
Open-Source Large Vocabulary Continuous Speech Recognition Engine
speech recognition audio-processing speech-recognition
Language:C 1913
Sierkinhane / CRNN_Chinese_Characters_Rec
(CRNN) Chinese Characters Recognition.
deep-learning ocr pytorch recognition
Language:Python 1870
chrismattmann / tika-python
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
buffer covid-19 detection extraction memex mime nlp nlp-library nlp-machine-learning parse parser-interface python recognition text-extraction text-recognition tika-python tika-server tika-server-jar translation-interface usc
Language:Python 1628
jasmcaus / opencv-course
Learn OpenCV in 4 Hours - Code used in my Python and OpenCV course on freeCodeCamp.
opencv opencv-course opencv-python python faces face-recognition face-detection concepts caer recognition video freecodecamp
Language:Python 1291
sdkcarlos / artyom.js
A voice control - voice commands - speech recognition and speech synthesis javascript library. Create your own siri,google now or cortana with Google Chrome within your website.
recognition speech-recognition speech-synthesis speech-to-text voice-commands
Language:JavaScript 1266
jenly1314 / MLKit
🌝 MLKit是一个强大易用的工具包。通过ML Kit您可以很轻松的实现文字识别、条码识别、图像标记、人脸检测、对象检测等功能。
mlkit barcode-scanning face-detection camerax image-labeling object-detection pose-detection text-recognition segmentation-selfie ocr machine-learning qrcode android object-recognition vision recognition machine-learning-library barcode
Language:Java 1104
sooftware / conformer
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
conformer transformer cnn transformer-xl asr speech-recognition pytorch conv convolution augmented speech recognition
Language:Python 1084
soundfingerprinting
AddictedCS / soundfingerprinting
Open source audio fingerprinting in .NET. An efficient algorithm for acoustic fingerprinting written purely in C#.
audio fingerprints algorithm acoustic-fingerprints recognition locality-sensitive-hashing nearest-neighbor-search shazam c-sharp audio-processing
Language:C# 1011
xinntao / facexlib
FaceXlib aims at providing ready-to-use face-related functions based on current STOA open-source methods.
alignment assessment deep-learning detection face headpose matting parsing pytorch recognition tracking
Language:Python 951
nyrahealth / CrisperWhisper
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
asr audio detection filler recognition speech speech-recognition timestamps transcription verbatim whisper speech-processing
Language:Python 859
Breta01 / handwriting-ocr
OCR software for recognition of handwritten text
handwriting-ocr machine-learning ocr opencv python recognition tensorflow
Language:Jupyter Notebook 817
opendataloader-project / opendataloader-pdf
Safe, Open, High-Performance — PDF for AI
json markdown pdf ai document-parser document-parsing documents html ocr-recognition pdf-converter pdf-to-json pdf-to-markdown recognition tables dataloader sdk pdf-to-html
Language:Java 748
yuxitong / TensorFlowAndroidDemo
TensorFlow android demo 车道线车辆人脸动作骨架识别检测抽烟打电话闭眼睁眼
ai android android-tensorflow app artificial-intelligence demo detection java phone recognition smoking tensordemo tensorflow tensorflow-android-demo tensorflow-demo
Language:Java 739
bgshih / aster
Recognizing cropped text in natural images.
ocr computer-vision scene-text recognition
Language:Python 736
openspeech-team / openspeech
Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.
asr speech recognition speech-recognition open end-to-end e2e
Language:Python 711
leondgarse / keras_cv_attention_models
Keras beit,caformer,CMT,CoAtNet,convnext,davit,dino,efficientdet,edgenext,efficientformer,efficientnet,eva,fasternet,fastervit,fastvit,flexivit,gcvit,ghostnet,gpvit,hornet,hiera,iformer,inceptionnext,lcnet,levit,maxvit,mobilevit,moganet,nat,nfnets,pvt,swin,tinynet,tinyvit,uniformer,volo,vanillanet,yolor,yolov7,yolov8,yolox,gpt2,llama2, alias kecam
attention clip coco ddpm detection imagenet keras model recognition segment-anything stable-diffusion tensorflow tf tf2 visualizing
Language:Python 622
app
all-contributors / app
🤖 A GitHub App to automate acknowledging contributors to your open source projects
acknowledgements all-contributors contributors github-apps open-source-tooling opensource opensource-management probot-app recognition
Language:JavaScript 605
Food-Recipe-CNN
Murgio / Food-Recipe-CNN
food image to recipe with deep convolutional neural networks.
convolutional-neural-networks keras chef deep-learning data-science python3 recipes cooking-dishes machine-learning inceptionv3 food cnn recognition dish jupyter-notebook classification vgg16 vgg tsne food-classification
Language:Jupyter Notebook 584
gisbi-kim / PyICP-SLAM
Full-python LiDAR SLAM using ICP and Scan Context
icp lidar loop odometry place pointcloud recognition robot scan slam
Language:Python 567
clovaai / synthtiger
Official Implementation of SynthTIGER (Synthetic Text Image Generator), ICDAR 2021
ocr recognition synthetic generation dataset deep-learning ocr-recognition text-recognition icdar2021 scene-text-recognition scene-text
Language:Python 553
taosir / cnn_handwritten_chinese_recognition
CNN在线识别手写中文。
cnn handwrite recognition chinese python flask
Language:Python 544
lkuza2 / java-speech-api
The J.A.R.V.I.S. Speech API is designed to be simple and efficient, using the speech engines created by Google to provide functionality for parts of the API. Essentially, it is an API written in Java, including a recognizer, synthesizer, and a microphone capture utility. The project uses Google services for the synthesizer and recognizer. While this requires an Internet connection, it provides a complete, modern, and fully functional speech API in Java.
api google jarvis java recognition speech speech-recognition speech-synthesis speech-to-text
Language:Java 542
gotev / android-speech
Android speech recognition and text to speech made easy
android recognition speech tts
Language:Java 529
JeffersonQin / YuzuMarker.FontDetection
✨ 首个CJK（中日韩）字体识别以及样式提取模型 YuzuMarker的字体识别模型与实现 / First-ever CJK (Chinese Japanese Korean) Font Recognition and Style Extractor, side project of YuzuMarker
cnn computer-vision cv font font-recognition fonts pytorch pytorch-cnn pytorch-lightning recognition chinese cjk-characters cjk-font japanese korean
Language:Python 517
Canjie-Luo / Text-Image-Augmentation
Geometric Augmentation for Text Image
opencv recognition scene-text detection image-transformations
Language:C++ 490
php-opencv / php-opencv-examples
Tutorial for computer vision and machine learning in PHP 7/8 by opencv (installation + examples + documentation)
php opencv face detection recognition dnn caffe torch lbf lbph facemark facial-landmarks waifu2x docker mobilenet imagenet tensorflow ml onnx darknet
Language:PHP 486
zagum / SpeechRecognitionView
"Google Now" style animation for Speech Recognizer.
android style-animation speech-recognizer material-ui recognition
Language:Java 484
haoranD / Awesome-Embodied-AI
A curated list of awesome papers on Embodied AI and related research/industry-driven resources.
awesome awesome-list classification detection embodied embodied-agent embodied-ai embodied-artificial-intelligence embodied-cognition languange learning navigation perception perceptron recognition segmentation understanding visual visual-language vlm
480
all-contributors / cli
Tool to help automate adding contributor acknowledgements according to the all-contributors specification ✨
contributors all-contributors opensource opensource-management open-source-tooling acknowledgements recognition command-line-tool
Language:JavaScript 419
LBH1024 / CAN
When Counting Meets HMER: Counting-Aware Network for Handwritten Mathematical Expression Recognition (ECCV’2022 Poster).
counting hmer ocr recognition
Language:Python 379

recognition

HumanSignal / labelImg

jofpin / trape

all-contributors / all-contributors.github.io

clovaai / deep-text-recognition-benchmark

meijieru / crnn.pytorch

detectRecog / CCPD

julius-speech / julius

Sierkinhane / CRNN_Chinese_Characters_Rec

chrismattmann / tika-python

jasmcaus / opencv-course

sdkcarlos / artyom.js

jenly1314 / MLKit

sooftware / conformer

AddictedCS / soundfingerprinting

xinntao / facexlib

nyrahealth / CrisperWhisper

Breta01 / handwriting-ocr

opendataloader-project / opendataloader-pdf

yuxitong / TensorFlowAndroidDemo

bgshih / aster

openspeech-team / openspeech

leondgarse / keras_cv_attention_models

all-contributors / app

Murgio / Food-Recipe-CNN

gisbi-kim / PyICP-SLAM

clovaai / synthtiger

taosir / cnn_handwritten_chinese_recognition

lkuza2 / java-speech-api

gotev / android-speech

JeffersonQin / YuzuMarker.FontDetection

Canjie-Luo / Text-Image-Augmentation

php-opencv / php-opencv-examples

zagum / SpeechRecognitionView

haoranD / Awesome-Embodied-AI

all-contributors / cli

LBH1024 / CAN