shawn lin's repositories
bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
CLIP
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
faster_rcnn
Faster R-CNN
grad-cam
[ICCV 2017] Torch code for Grad-CAM
leetcode
LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解,记录自己的leetcode解题之路。)
leetcode-1
Provide all my solutions and explanations in Chinese for all the Leetcode coding problems.
MIA
Code for "Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations" (NeurIPS 2019)
models
Models built with TensorFlow
multimodal-prompt-learning
[CVPR 2023] Official repository of paper titled "MaPLe: Multi-modal Prompt Learning".
nlp-tutorial
Natural Language Processing Tutorial for Deep Learning Researchers
object_relation_transformer
Implementation of the Object Relation Transformer for Image Captioning
OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Oscar
Oscar and VinVL
py-bottom-up-attention
PyTorch bottom-up attention with Detectron2
pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
pytorch-distributed
A quickstart and benchmark for pytorch distributed training.
pytorch-faster-rcnn
pytorch1.0 updated. Support cpu test and demo. (Use detectron2, it's a masterpiece)
scene-graph-TF-release
"Scene Graph Generation by Iterative Message Passing" code repository
Semi-Supervised-Image-Captioning
Code for "bootstrap, review, decode: using out-of-domain textual data to improve image captioning"
unsupervised_captioning
Code for Unsupervised Image Captioning
Up-Down-Captioner
Automatic image captioning model based on Caffe, using features from bottom-up attention.