attention caption-generation computer-vision deep-learning encoder-decoder image-captioning nlp tensorflow transformers

Image Captioning Overview

In recent years, neural networks have fueled dramatic advances in image captioning. Researchers are looking for more challenging applications for computer vision and Sequence to Sequence modeling systems. They seek to describe the world in human terms. I have implemented three different architectures from simple Encoder Decoders to Transformers with Multi-Head Attention.

Datasets :-

My Articles

About

Implemented 3 different architectures to tackle the Image Caption problem, i.e, Merged Encoder-Decoder - Bahdanau Attention - Transformers

attention caption-generation computer-vision deep-learning encoder-decoder image-captioning nlp tensorflow transformers

Languages

Language:Jupyter Notebook 100.0%