hirazy / Image-Captor-Generator

Image-Captor-Generator

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Image Caption Generator - Flickr Dataset

Complete Video Tutorial: https://youtu.be/fUSTbGrL1tc

Project Information

The objective of the project is to predict the captions for the input image. The dataset consists of 8k images and 5 captions for each image. The features are extracted from both the image and the text captions for input. The features will be concatenated to predict the next word of the caption. CNN is used for image and LSTM is used for text. BLEU Score is used as a metric to evaluate the performance of the trained model.

Download link: https://www.kaggle.com/adityajn105/flickr8k

Environment: Kaggle

Libraries

  • numpy
  • matplotlib
  • keras
  • tensorflow
  • nltk

Neural Network

  • VGG16 Network
  • CNN-LSTM Network

BLEU-1 Score: 0.544 BLEU-2 Score: 0.319

About

Image-Captor-Generator


Languages

Language:Jupyter Notebook 100.0%