Predicting the center of MSCOCO images given the border and the captions

This code is a model written in tensorflow to predict the 32x32 center of a 64x64 image given the border and some captions.

Installation

pip install -r requirements.txt

I have used the MSCOCO dataset and preprocessed the data into tf.Examples files. You can get my preprocessed dataset here.

mkdir dataset
cd dataset
tar xvf dataset_ift6366_images_captions.tar.gz .

To run the model:

python3 main.py

You can start a tensorboard instance to monitor the training:

tensorboard --logdir=logs

The architecture of the generator is resumed in this schema: ![generator](https://raw.githubusercontent.com/ogrergo/ift6266/master/docs/static_files/Archi gen.jpg "archi generator")

Predicting center of MSCOCO images given the border and the captions -- Deeplearning project for a course at UDEM

Language:Jupyter Notebook 99.6%Language:Python 0.4%