Young499

Young's repositories

Pytorch implementation of paper "Multi-Branch Distance-Sensitive Self-Attention Network for Image Captioning".

Language:PythonBSD-3-Clause600

Official pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).

Language:Jupyter NotebookBSD-3-Clause000

Meshed-Memory Transformer for Image Captioning. CVPR 2020

Language:PythonBSD-3-Clause000

OpenMMLab Computer Vision Foundation

Language:PythonApache-2.0000

A quickstart and benchmark for pytorch distributed training.

Language:PythonMIT000

RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words (CVPR 2021)

Language:PythonBSD-3-Clause000

Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.

Language:PythonMIT000

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Language:PythonMIT000

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonMIT000

X-modaler is a versatile and high-performance codebase for cross-modal analytics.

Language:PythonNOASSERTION000