image-caption

There are 10 repositories under image-caption topic.

Vision-CAIR / VisualGPT
VisualGPT, CVPR 2022 Proceeding, GPT as a decoder for vision-language models
data-efficient-image-caption image-caption visualgpt
Language:Python 328
dabasajay / Image-Caption-Generator
A neural network to generate captions for an image using CNN and RNN with BEAM Search.
image-captioning recurrent-neural-networks convolutional-neural-networks lstm deep-learning inceptionv3 inception-v3 cnn-keras image-caption bleu bleu-score flickr-dataset flickr-8k attention attention-mechanism beam-search attention-model vgg16 captioning-images caption-generation
Language:Python 307
jmisilo / clip-gpt-captioning
CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.
computer-vision cv deep-learning image-caption image-caption-generator image-captioning machine-learning nlp python pytorch
Language:Python 117
jianjieluo / SCD-Net
[CVPR23] A cascaded diffusion captioning model with a novel semantic-conditional diffusion process that upgrades conventional diffusion model with additional semantic prior.
diffusion-model image-caption
Language:Python 60
Hsuxu / Paper-Notes
Paper notes in deep learning/machine learning and computer vision
research papers deep-learning computer-vision machine-learning semantic-segmentation image-recognition object-detection image-classification image-caption gan generative-adversarial-network medical-image-computing pytorch tensorflow papanoptic-segmentation instance-segmentation
39
bhushan2311 / image_caption_generator
An Image captioning web application combines the power of React.js for front-end, Flask and Node.js for back-end, utilizing the MERN stack. Users can upload images and instantly receive automatic captions. Authenticated users have access to extra features like translating captions and text-to-speech functionality.
expressjs flask image-caption image-caption-generator image-captioning nodejs python rapidapi reactjs text-to-speech translation translation-api final-year-project flickr8k-dataset resnet-50 cnn-keras lstm lstm-neural-networks tensorflow hacktoberfest
Language:JavaScript 38
fireicewolf / wd-llm-caption-cli
A Python base cli tool for caption images with WD series, Joy-caption-pre-alpha,meta Llama 3.2 Vision Instruct and Qwen2 VL Instruct models.
florence-2 image-caption joy-caption llama3-vision qwen2-vl wd14
Language:Python 34
maxy0524 / image_captioning
Tensorflow implementation of "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention" Support python3.6, python3.7 TensorFlow1.8 TensorFlow1.12 TensorFlow1.13 TensorFlow1.14 numpy 1.12 or newer
image-caption python-3-7-4 tensorflow-1-14 numpy-1-17-2 opencv-python-4-1-1-16 image-show-attend-tell
Language:Python 34
purveshpatel511 / imageCaptioning
pre-trained model and source code for generate description of images.
image-caption image-caption-generator image-captioning neural-network python python2 python3
Language:Python 27
Junjue-Wang / CapFormer
[IGARSS 2022] CapFormer: Pure transformer for remote sensing image caption
image-caption remote-sensing transformer
20
zlsh80826 / image-caption-tf
Image Caption
cs565600 image-caption image-captioning kaggle-competition tensorflow
Language:Python 17
haoyu-he / ImageCaption
Image captioning project.
image-caption image-caption-generator image-captioning
Language:Python 16
EricWWWW / image-caption-metrics
a py3 lib for NLP & image-caption metrics : BLEU METEOR CIDEr ROUGE SPICE WMD
image-captioning image-caption nlp-apis nlp
Language:Python 14
zyj0021200 / simpleImageCaptionZoo
Simple but Comprehensive PyTorch Implementation of Image Captioning Models.
pytorch-implementation image-caption self-critical-sequence-training coco-datasets flickr-dataset
Language:OpenEdge ABL 13
kenya-sk / show_attend_and_tell
This repository reimplements "Show, Attend and Tell" model and add extra deep learning techniques.
pytorch soft-attention image-caption beam-search mscoco-image-dataset
Language:Python 12
ludwig7685 / image-caption-generation-with-ai-and-api
ai api hcaptcha hcaptcha-solver image image-caption image-caption-generator image-captioning website
Language:Python 10
kouxichao / ncnn
some works on ncnn
image-caption crnn-ocr face-recognition ncnn-model armv8
Language:C++ 9
parmarsuraj99 / keras-transformer-flex
Transformer block in tf.keras similar to PyTorch's nn.Transformer block.
transformer keras object-detection detr virtex image-captioning convolutional-neural-networks image-caption
Language:Jupyter Notebook 9
alpertunga-bile / image-caption-comfyui
Using image caption models to extract prompts in ComfyUI
comfyui comfyui-nodes prompt prompt-generation prompt-generator prompt-tool image-caption
Language:Python 8
mostafax / Image-Caption
End to End Deep learning model that generate image captions
python3 deep-learning nuralnetwork cnn lstm keras image-captioning rnn image-caption text-from-image image-classifier
Language:Python 8
Delphboy / karpathy-splits
Karpathy Splits json files for image captioning
flickr30k flickr8k-dataset image-caption mscoco-dataset karpathy-split
7
hongkiat / css3-image-captions
Say good bye to jQuery plugins. Today, we can create similar image caption effect only with CSS3. This demo shows how this effects runs.
css3 image-captioning image-caption
Language:CSS 7
SemanticMediaWiki / SemanticImageCaption
Allows to generate image caption information from annotations
mediawiki smw sic image-caption semantic semantic-mediawiki semantic-mediawiki-extension mw
Language:PHP 7
WuLC / ImageCaption
Image Captioning with Google‘s NIC For AI Challenger
image-caption tensorflow
Language:Python 7
CAOANJIA / image-caption
PyTorch implementation of image captioning based on attention mechanism
deep-learning pytorch multimodal attention-mechanism image-caption encoder-decoder
Language:Python 6
jianjieluo / PCM-Net
[ECCV24] Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning
image-caption text-only-image-caption
Language:Python 6
krishnakumarbhat / Visionsarathi
This is an innovative project aimed at enhancing the visual experience for individuals with impairments. Leveraging machine learning and natural language processing, this repository houses the codebase for generating efficient and coherent natural language descriptions of captured images. The project integrates seamlessly with image recognition,
blip image-caption ml
Language:Jupyter Notebook 3
rodrigo-barraza / inscriptor
Blip 2 Captioning, Mass Captioning, Question Answering, and other tools.
dataset-analysis dataset-creation dataset-generation dataset-generator image-caption image-caption-generation image-caption-generator image-captioning
Language:Jupyter Notebook 3
linjianz / tf-image-caption
image caption for AI challenger
image-caption tenforflow
Language:Jupyter Notebook 2
lychengrex / Image-Descriptor
Image Descriptor with Visual Attention Mechanism Using Long Short-term Memory
cnn-lstm coco image-caption image-captioning image-descriptor pytorch
Language:Jupyter Notebook 2
NicholasKX / ShowAndTell
A Mindspore Implementation of paper "Show and Tell : Neural Image Caption Generation"
image-captioning mindspore image-caption show-and-tell
Language:Python 2
Aldenhovel / ConceptualCaptions-940k
A subset of Google's ConceptualCaptions(3M) dataset which include 940k samples.
dataset image-caption image-to-text multimodal text-to-image
1
anshdavid / pytorch-image-caption
Image caption using VGG16 + LSTM
image-caption pytorch lstm vgg16 resnet python
Language:Python 1
ashishyadav2 / SeptaSEM
Major Project Repository
cnn embeddings image-caption image-caption-generator image-captioning image-recognition keras lstm lstm-neural-networks neural-network nlp object-recognition tensorflow vgg16
Language:Jupyter Notebook 1
NicholasKX / ShowAttendTell
A Mindspore Implementation of "Show, Attend and Tell: Neural Image Caption Generation with Visual Attention".
image-caption image-captioning mindspore
Language:Python 1
VAIBHAV-2303 / ImageCaptionRetrieval
Pytorch Image-caption retrieval model
deep-learning image-caption pytorch coco-dataset vist
Language:Python 1

image-caption

Vision-CAIR / VisualGPT

dabasajay / Image-Caption-Generator

jmisilo / clip-gpt-captioning

jianjieluo / SCD-Net

Hsuxu / Paper-Notes

bhushan2311 / image_caption_generator

fireicewolf / wd-llm-caption-cli

maxy0524 / image_captioning

purveshpatel511 / imageCaptioning

Junjue-Wang / CapFormer

zlsh80826 / image-caption-tf

haoyu-he / ImageCaption

EricWWWW / image-caption-metrics

zyj0021200 / simpleImageCaptionZoo

kenya-sk / show_attend_and_tell

ludwig7685 / image-caption-generation-with-ai-and-api

kouxichao / ncnn

parmarsuraj99 / keras-transformer-flex

alpertunga-bile / image-caption-comfyui

mostafax / Image-Caption

Delphboy / karpathy-splits

hongkiat / css3-image-captions

SemanticMediaWiki / SemanticImageCaption

WuLC / ImageCaption

CAOANJIA / image-caption

jianjieluo / PCM-Net

krishnakumarbhat / Visionsarathi

rodrigo-barraza / inscriptor

linjianz / tf-image-caption

lychengrex / Image-Descriptor

NicholasKX / ShowAndTell

Aldenhovel / ConceptualCaptions-940k

anshdavid / pytorch-image-caption

ashishyadav2 / SeptaSEM

NicholasKX / ShowAttendTell

VAIBHAV-2303 / ImageCaptionRetrieval