There are 8 repositories under flickr8k-dataset topic.
Image Captioning with Keras
CaptionBot : Sequence to Sequence Modelling where Encoder is CNN(Resnet-50) and Decoder is LSTMCell with soft attention mechanism
Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"
An Image captioning web application combines the power of React.js for front-end, Flask and Node.js for back-end, utilizing the MERN stack. Users can upload images and instantly receive automatic captions. Authenticated users have access to extra features like translating captions and text-to-speech functionality.
Implementation of 'merge' architecture for generating image captions from paper "What is the Role of Recurrent Neural Networks (RNNs) in an Image Caption Generator?" using Keras. Dataset used is Flickr8k available on Kaggle.
Image captioning model with Resnet50 encoder and LSTM decoder
An attention based sequential deep learning model implemented in pytorch to generate single line caption given an input image
Yet another im2txt (show and tell: A Neural Image Caption Generator)
Fabricating a Python application that generates a caption for a selected image. Involves the use of Deep Learning and NLP Frameworks in Tensorflow, Keras and NLTK modules for data processing and creation of deep learning models and their evaluation.
Generate captions from images
Generating Captions for images using CNN & LSTM on Flickr8K dataset.The generation of captions from images has various practical benefits, ranging from aiding the visually impaired.
In this project, we use a Deep Recurrent Architecture, which uses CNN (VGG-16 Net) pretrained on ImageNet to extract 4096-Dimensional image feature Vector and an LSTM which generates a caption from these feature vectors.
Image Captioning using Deep learning models in Keras.
🚀 Image Caption Generator Project 🚀 🧠Building Customized LSTM Neural Network Encoder model with Dropout, Dense, RepeatVector, and Bidirectional LSTM layers. Sequence feature layers with Embedding, Dropout, and Bidirectional LSTM layers. Attention mechanism using Dot product, Softmax attention scores,...
Implementation of Image Captioning Model using CNNs and LSTMs
Image Captioning is the task of describing the content of an image in words. This task lies at the intersection of computer vision and natural language processing.
Image Captioning With MobileNet-LLaMA 3
Comparitive analysis of image captioning model using RNN, BiLSTM and Transformer model architectures on the Flickr8K dataset and InceptionV3 for image feature extraction.
Automatically generating the captions for an image.
Caption Generation using Flickr8k dataset by @jbrownlee and image generation from caption prompt using pretrained models
Karpathy Splits json files for image captioning
The concept of the project is to generate Arabic captions from the Arabic Flickr8K dataset, the tools that were used are the pre-trained CNN (MobileNet-V2) and the LSTM model, in addition to a set of steps using the NLP. The aim of the project is to create a solid ground and very initial steps in order to help children with learning difficulties.
Image Caption Generation
Automatic image captioning with PyTorch
In this capstone project, we need to create a deep learning model which can explain the contents of an image in the form of speech through caption generation with an attention mechanism on Flickr8K dataset.
Text-Image-Text is a bidirectional system that enables seamless retrieval of images based on text descriptions, and vice versa. It leverages state-of-the-art language and vision models to bridge the gap between textual and visual representations.
Image Caption Generator using Python | Flickr Dataset | Deep Learning(CNN & RNN)
This Notebook Shows a Neural Image Captioning model using Merge Architecture in keras which generates captions for given image.
Image Caption Generator, a project aims to generate descriptive captions for input images using advanced predictive techniques.
Image captioning of Flickr 8k dataset using Attention and Merge model
An Image Captioning implementation of a CNN Encoder and an RNN Decoder in PyTorch.