Young's repositories
image-captioning-MDSANet
Pytorch implementation of paper "Multi-Branch Distance-Sensitive Self-Attention Network for Image Captioning".
image-captioning-DLCT
Official pytorch implementation of paper "Dual-Level Collaborative Transformer for Image Captioning" (AAAI 2021).
meshed-memory-transformer
Meshed-Memory Transformer for Image Captioning. CVPR 2020
mmcv
OpenMMLab Computer Vision Foundation
pytorch-distributed
A quickstart and benchmark for pytorch distributed training.
RSTNet
RSTNet: Captioning with Adaptive Attention on Visual and Non-Visual Words (CVPR 2021)
self-critical.pytorch
Unofficial pytorch implementation for Self-critical Sequence Training for Image Captioning. and others.
Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
xmodaler
X-modaler is a versatile and high-performance codebase for cross-modal analytics.