ogrergo / ift6266

Predicting center of MSCOCO images given the border and the captions -- Deeplearning project for a course at UDEM

Home Page:https://ogrergo.github.io/ift6266/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Predicting the center of MSCOCO images given the border and the captions

This code is a model written in tensorflow to predict the 32x32 center of a 64x64 image given the border and some captions.

Installation

pip install -r requirements.txt

Dataset

I have used the MSCOCO dataset and preprocessed the data into tf.Examples files. You can get my preprocessed dataset here.

mkdir dataset
cd dataset
tar xvf dataset_ift6366_images_captions.tar.gz .

To run the model:

python3 main.py

You can start a tensorboard instance to monitor the training:

tensorboard --logdir=logs

architecture

The architecture of the generator is resumed in this schema: ![generator](https://raw.githubusercontent.com/ogrergo/ift6266/master/docs/static_files/Archi gen.jpg "archi generator")

About

Predicting center of MSCOCO images given the border and the captions -- Deeplearning project for a course at UDEM

https://ogrergo.github.io/ift6266/


Languages

Language:Jupyter Notebook 99.6%Language:Python 0.4%