hashbangCoder / Adaptive-Sentinel-Image-Captioning

Implementation of the 'Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning' by Lu et. al.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Adaptive-Sentinel-Image-Captioning

PyTorch implementation of the Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning by Lu et. al.

Work in Progress.

TO-DO :

  • Implement normal LSTM-Attention
  • Greedy + Sampling Decoder
  • Layer Normalization
  • [] Debug Sentinel-LSTM
  • [] Implement Beam Search
  • [] Tune hyperparameters

Train Loss curve without pre-trained embeddings

alt text

Train Loss curve with pretrained embeddings

alt text

About

Implementation of the 'Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning' by Lu et. al.


Languages

Language:Python 100.0%