image-to-text

There are 17 repositories under image-to-text topic.

thiagoalessio / tesseract-ocr-for-php
A wrapper to work with Tesseract OCR inside PHP.
image-to-text ocr php tesseract text-recognition
Language:PHP 2814
lucidrains / CoCa-pytorch
Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch
artificial-intelligence attention-mechanism contrastive-learning deep-learning multimodal transformers image-to-text
Language:Python 1012
MORT
killkimno / MORT
MORT 번역기 프로젝트 - Real-time game translator with OCR
ocr auto-translation translation translate game game-translation tesseract-ocr image-to-text
Language:C# 497
zapolnoch / node-tesseract-ocr
A Node.js wrapper for the Tesseract OCR API
image-to-text ocr tesseract text-recognition
Language:JavaScript 296
PaddlePaddle / PaddleMIX
Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
aigc stable-diffusion blip2 clip minigpt4 image-to-text text-to-image ppdiffusers controlnet multimodal eva-clip sd-xl text-to-video dit llava qwen-vl sora stablevideodiffusion
Language:Python 253
imageinwords
google / imageinwords
Data release for the ImageInWords (IIW) paper.
dataset dataset-generation detailed-annotations detailed-descriptions evaluation human-annotation i2t image-captioning image-descriptions image-text image-to-text t2i
Language:JavaScript 184
yardstick17 / image_text_reader
The module extracts text from image using the tesseract-OCR engine. Generally, text present in the images are blur or are of uneven sizes. The image is pre-processed for better comprehension by OCR. This module first makes bounding box for text in images and then normalizes it to 300 dpi, suitable for OCR engine to read.
ocr image-reader image-to-text tesseract-ocr read-image ocr-text-reader
Language:Python 146
Yushi-Hu / tifa
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
image-to-text large-language-models text-to-image visual-question-answering
Language:Python 121
LGAI-Research / L-Verse
L-Verse: Bidirectional Generation Between Image and Text
deep-learning pytorch pytorch-lightning l-verse vq-vae transformer image-to-text text-to-image image-captioning
Language:Python 108
NormXU / nougat-latex-ocr
Codebase for fine-tuning / evaluating nougat-based image2latex generation models
image-to-text
Language:Python 103
nateshmbhat / card-scanner-flutter
A flutter package for Fast, Accurate and Secure Credit card & Debit card scanning
ai card-scanner card-scanner-library card-scanning credit-card credit-card-scaning dart debit-card flutter image-processing image-re image-to-text ml
Language:Swift 101
note-it
MuhametSmaili / note-it
OCR functionality in a feature-rich note-taking extension.
chrome chrome-extension image-to-text note-taking ocr ocr-recognition react tiptap
Language:TypeScript 94
im2latex
untrix / im2latex
Solution to im2latex request for research of openai
neural-network deep-learning computer-vision machine-learning tensorflow generative-model sequence-to-sequence encoder-decoder ocr-recognition im2latex image-to-text image-to-markup
Language:Jupyter Notebook 87
farhanchoudhary / PAN_Card_OCR_Project
To extract details from Indian National Identification Cards such as PAN (completed) & Aadhar, Passport, Driving License (WIP) in a structured format
tesseract pan-card pan pytesseract ocr optical-character-recognition image-processing image-to-text
Language:Python 76
NanoNets / ocr-python
OCR library to extract text & tables from PDF files and images. Convert any image or PDF to CSV / TXT / JSON / Searchable PDF.
ocr pdf-to-csv searchable-pdf tesseract extract-text-from-image extract-text-from-pdf image-to-text-converter pdf pytesseract-ocr python table-extract textract pdf-to-json pdf-to-text extract-table image-to-text
Language:Jupyter Notebook 71
Carleslc / ImageToText
OCR with Google's AI technology (Cloud Vision API)
ocr optical-character-recognition image-to-text google-cloud-vision artificial-intelligence google-cloud img2txt
Language:Python 67
glami / glami-1m
The largest multilingual image-text classification dataset. It contains fashion products.
computer-vision dataset deep-learning fashion image-text image-to-text multilingual multimodal natural-language-processing classification image-text-classification multilingual-image-text-classification image-classification text-classification text-to-image-generation multi-modal-deep-learning
Language:Jupyter Notebook 64
BEPb / image_to_ascii
Everything is very simple: you either download a picture file or specify its link when running a python script, and output you get a text file, and you can immediately view on the command line how it will look the result of your conversion.
cmd gif-to-ascii image-to image-to-text video-to-text cmdline conversion convert converter py python python3 script
Language:Python 61
Tesseract-OCR
bensonruan / Tesseract-OCR
Tesseract.js OCR
ocr image-to-text tesseract javascript machine-learning artificial-intelligence computer-vision
Language:HTML 59
amit-y11 / the_ocr_bot
Telegram bot to convert image to text using python
ocr-bot telegram-bot python image-recognition image-to-text python-telegram-bot
Language:Python 54
zhangming8 / Dango-ocr
DangoOCR: screenshot OCR recognize 文字识别，支持多种语言，识别后翻译，播放声音
ocr screenshot machine-translation ocr-recognition text-recognition image-to-text
Language:Python 49
DS2BRAIN / ds2
Easiest way to use AI models without coding (Web UI & API support)
ai automl data-science machine-learning ml mlops annotation-tool auto-labeling deep-learning feature-engineering neural-network python pytorch tensorflow image-annotation-tool text-annotation dalle huggingface image-to-text stable-diffusion
Language:Python 47
mshdabiola / NotePad
Notepad is multi module Jetpack compose note taking app with sketch pad, voice recorder, image capturing app
android-app github-actions jetpack-compose kotin-coroutines kotlin multiplayer room-database hilt-dependency-injection modulization image-to-text room-persistence-library
Language:Kotlin 41
affjljoo3581 / Inverse-DALL-E-for-Optical-Character-Recognition
Inverse DALL-E for Optical Character Recognition
dalle nlp gpt2 huggingface image-captioning image-generation image-to-text multimodal ocr optical-character-recognition pytorch text-to-image transformers vqvae
Language:Python 38
torresflo / Tag-Machine
A little Python application to auto tag your photos with the power of machine learning.
python pytorch pytorch-transformers pretrained-models machine-learning computer-vision image-classification image-tagging image-tagger auto-tagging image-to-text photo-tag photo-tagging
Language:Python 37
geoffsmith82 / Symposium2023
Demonstrates Voice Recognition, Text to Speech, Language Translation, OAuth2, Image Generation, Face Detection and Voice Chatbot. Source code and Documentation for my 2023 ADUG Symposium Talk.
ai artificial-intelligence gpt gpt-4 text-to-speech translation voice-recognition oauth2 palm palm2 speech-to-text text-to-image websockets gpt-4o claude-3-opus claude-3-haiku claude-3-sonnet computer-vision image-to-text
Language:Pascal 35
mddunlap924 / StableDiffusion2-Image-to-Text
Stable Diffusion with Text-to-Image and Image-to-Text
generative-art multi-modal prompt-engineering stable-diffusion vision image-to-text kaggle-competition text-to-image
Language:Jupyter Notebook 33
visinf / lnfmm
Latent Normalizing Flows for Many-to-Many Cross Domain Mappings (ICLR 2020)
multimodal-deep-learning conditional-vae generative-models image-to-text latent-variable-models normalizing-flows text-to-image vision-and-language
Language:Python 33
Akascape / TEXTEMAGE
A simple image to text converter with GUI!
image-to-text image-to-text-converter image-to-text-extracter image-to-text-application image-to-text-software gui textemage textemage-software converter picture-to-text photo-to-text photo-to-text-converter scan-to-text-converter
Language:Python 31
zsdonghao / im2txt2im
I2T2I: Text-to-Image Synthesis with textual data augmentation
image-to-text text-to-image tensorflow tensorlayer
Language:Python 30
zeeshanali-k / Classy
Text to image generation and Image Captioning Android, iOS, Desktop and Web app using Compose Multiplatform with Clean Architecture
android anroid-studio compose-multiplatform jetpack-compose kotlin kotlin-multiplatform ios ai stable-diffusion text-to-image texttoimage image-captioning image-processing image-to-text
Language:Kotlin 26
AliShazly / ascii-py
Convert images or videos to ASCII in the terminal
python ascii-art ascii-graphics image-to-text video-to-text image-processing video-processing pillow terminal-graphics terminal-based
Language:Python 25
ITE-5th / image-captioning-gan
image-captioning image-to-text pytorch deep-learning coco-dataset cgan reinforcement-learning policy-gradient
Language:Python 23
N0iire / Image-to-text-Translate
Image to text translator using Open AI API & Tesseract
image-to-text image-translation openai-api opencv-python python tesseract-ocr
Language:Python 19
Viresh-R / ml-CCA
Implementation of Fast ml-CCA from the ICCV-2015 work "Multi-Label Cross-Modal Retrieval"
cross-modal image-to-text text-to-image canonical-correlation-analysis iccv
Language:MATLAB 19
Spidy20 / Optical_Character_Reccognition
In this system we need to enter an image(like government document) ,it can convert image data into string
ocr-recognition image-to-text optical-character-recognition
Language:Python 18