There are 1 repository under image2text topic.
pix2tex: Using a ViT to convert images of equations into LaTeX code.
:clipboard: Python wrapper to grab text from images and save as text files using Tesseract Engine
读过的CV方向的一些论文,图像生成文字、弱监督分割等
CNN-Encoder and RNN-Decoder (Bahdanau Attention) for image caption or image to text on MS-COCO dataset. 图片描述
Deep Extreme Cut http://www.vision.ee.ethz.ch/~cvlsegmentation/dextr . a tool to do automatically object segmentation from extreme points.
Various nodes for ComfyUI
A collection of scripts to "help" you with your programming exams and assignments.
An AutoIT 3 wrapper around the OCRSpace API to convert images and PDFs to text.
Python tool, which takes 1..n images, tries to rotate them based on the text, extract the text and store 1..n images to a pdf.
Civitai Stable Diffusion 337k Dataset; dataset of ai generated image
This repository contains code for paper IMAD: IMage Augmented multi-modal Dialogue
Run im2txt trained model in inference mode
🎞 Video editor with description generation for MTS TrueTech Hack
[INLG2023] The High-Level (HL) dataset is a Vision and Language (V&L) resource aligning object-centric descriptions from COCO with high-level descriptions crowdsourced along 3 axes: scene, action, rationale.
A Large Language Model (LLM) Based App to Generate Stories from Pictures
Text-to-Image and Image-to-Text model retrieval
A CRUD application; my third project for GA Software Engineering Immersive.
An android app that will use on device ml to recognize text in a image