flickr8k

There are 0 repository under flickr8k topic.

kakshak07 / Image-Captioining
The objective is to process by generating textual description from an image – based on the objects and actions in the image. Using generative models so that it creates novel sentences. Pipeline type models uses two separate learning process, one for language modelling and other for image recognition. It first identifies objects in image and provides the result to the Inception-v3 model to convert into word embedding vector than into series of LSTM cells to get desired captions.
image-captioning flickr8k-text pyttsx flickr8k preprocess-data
Language:Python 26
Subangkar / Image-Captioning-Attention-PyTorch
An attention based sequential deep learning model implemented in pytorch to generate single line caption given an input image
pytorch lstm attention-mechanism flickr8k flickr8k-dataset glove-embeddings
Language:Jupyter Notebook 11
KimRass / CLIP
PyTorch implementation of 'CLIP' (Radford et al., 2021) from scratch and training it on Flickr8k + Flickr30k
clip flickr30k flickr8k linear-classification multi-modal zero-shot-classification text-image-retrieval
Language:Python 5
awsaf49 / flickr-dataset
Download flickr8k, flickr30k image caption datasets
captioning-images clip image image-text siglip dataset flickr flickr30k flickr8k
3
spokenlanguage / platalea
Library for training visually-grounded models of spoken language understanding.
visually-grounded-speech multi-tasking spoken-language-understanding deep-neural-networks speech-processing weakly-supervised-learning multimodal-learning pytorch flickr8k spokencoco
Language:Python 3
tojiboyevf / image_captioning
Deep Learning Final project 2022
coco-captions flickr8k image-captioning lstm pytorch transformers
Language:Python 3
GuyKabiri / Image-Caption
Exercise on captioning images in the Neural Networks for Computer Vision course. Using the Flickr8K dataset, and simple encoder-decoder architecture. Evaluation based on Cross-Entropy loss and 4-gram Bleu score.
bleu-score cnn computer-vision cross-entropy flickr8k image-captioning
Language:Jupyter Notebook 0
roysti10 / Image_Captioning
Image Captioning using Encoder Decoder network , Pretrained models given
image-captioning tensorflow encoder-decoder-model checkpoints flickr8k
Language:Python

flickr8k

kakshak07 / Image-Captioining

Subangkar / Image-Captioning-Attention-PyTorch

KimRass / CLIP

awsaf49 / flickr-dataset

spokenlanguage / platalea

tojiboyevf / image_captioning

GuyKabiri / Image-Caption

roysti10 / Image_Captioning