luogen1996 / LWTransformer

Lightweight Transformer for Multi-modal Tasks

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Lightweight Transformer for Vision-and-Language Tasks

We have implemented transformer and lightweight transformer for a set of Vision-and-Language tasks.

For Visual Question Answering, you can refer to here to reproduce results in our paper.

For Referring Expressiong Comprehension, you can refer to here to reproduce results in our paper.

For Image Captioning, you can refer to here to reproduce results in our paper.

About

Lightweight Transformer for Multi-modal Tasks


Languages

Language:Python 98.7%Language:JavaScript 0.5%Language:Dockerfile 0.4%Language:Batchfile 0.2%Language:Makefile 0.2%Language:CSS 0.0%