There are 40 repositories under image-captioning topic.
Show, Attend, and Tell | a PyTorch Tutorial to Image Captioning
Simple Swift class to provide all the configurations you need to create custom camera view in your app
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
TensorFlow Implementation of "Show, Attend and Tell"
Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.
X-modaler is a versatile and high-performance codebase for cross-modal analytics(e.g., image captioning, video captioning, vision-language pre-training, visual question answering, visual commonsense reasoning, and cross-modal retrieval).
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Official Pytorch implementation of "OmniNet: A unified architecture for multi-modal multi-task learning" | Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain
PyTorch source code for "Stacked Cross Attention for Image-Text Matching" (ECCV 2018)
An open-source tool for sequence learning in NLP built on TensorFlow.
Meshed-Memory Transformer for Image Captioning. CVPR 2020
Complete Assignments for CS231n: Convolutional Neural Networks for Visual Recognition
Implementation of "Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning"
Image Captioning using InceptionV3 and beam search
Transformer-based image captioning extension for pytorch/fairseq
A reverse image search engine powered by elastic search and tensorflow
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions. CVPR 2019
A modular library built on top of Keras and TensorFlow to generate a caption in natural language for any input image.
ML data annotations made super easy for teams. Just upload data, add your team and build training/evaluation dataset in hours.
Implementation of 'X-Linear Attention Networks for Image Captioning' [CVPR 2020]
Automatic image captioning model based on Caffe, using features from bottom-up attention.
Image Captions Generation with Spatial and Channel-wise Attention
A neural network to generate captions for an image using CNN and RNN with BEAM Search.
Video to Text: Natural language description generator for some given video. [Video Captioning]
Code for "Show, Adapt and Tell: Adversarial Training of Cross-domain Image Captioner" in ICCV 2017
This repository explores the variety of techniques and algorithms commonly used in deep learning and the implementation in MATLAB and PYTHON
Official pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).
[DEPRECATED] A Neural Network based generative model for captioning images using Tensorflow
A pytorch implementation of On the Automatic Generation of Medical Imaging Reports.
Computer vision tools for fairseq, containing PyTorch implementation of text recognition and object detection