Stephen Xi Chen's repositories
binary-image-selection
BISON: Binary Image SelectiON
bottom-up-attention
Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
detr
End-to-End Object Detection with Transformers
grid-feats-vqa
Grid features pre-training code for visual question answering
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
LightNet
Efficient, transparent deep learning in hundreds of lines of code.
maskrcnn-benchmark
Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.
models
Models and examples built with TensorFlow
ovr-cnn
A new framework for open-vocabulary object detection, based on maskrcnn-benchmark
py-faster-rcnn
Faster R-CNN (Python implementation) -- see https://github.com/ShaoqingRen/faster_rcnn for the official MATLAB version
SCAN
PyTorch source code for "Stacked Cross Attention for Image-Text Matching"