Beast code in Giters

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Language:PythonApache-2.0235700

Detic

Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".

Language:PythonApache-2.0180100

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Language:Jupyter NotebookBSD-3-Clause438300

pytorch_violet

A PyTorch implementation of VIOLET

Language:Python13600

ibot

iBOT :robot:: Image BERT Pre-Training with Online Tokenizer (ICLR 2022)

Language:Jupyter NotebookApache-2.063200

GLIP

Grounded Language-Image Pre-training

Language:PythonMIT202500

mlp-vil

MLPs for Vision and Langauge Modeling (Coming Soon)

MIT2700

METER

METER: A Multimodal End-to-end TransformER Framework

Language:PythonMIT35500

Stable-Pix2Seq

A full-fledged version of Pix2Seq

Language:PythonApache-2.023400

CV_A-FAN

[TMLR] "Adversarial Feature Augmentation and Normalization for Visual Recognition", Tianlong Chen, Yu Cheng, Zhe Gan, Jianfeng Wang, Lijuan Wang, Zhangyang Wang, Jingjing Liu

Language:PythonMIT2000

VL-T5

PyTorch code for "Unifying Vision-and-Language Tasks via Text Generation" (ICML 2021)

Language:PythonMIT35400

VidLanKD

Pytorch version of VidLanKD: Improving Language Understanding viaVideo-Distilled Knowledge Transfer (NeurIPS 2021))

Language:Python5600

P-tuning

A novel method to tune language models. Codes and datasets for paper ``GPT understands, too''.

Language:PythonMIT90100

ALBEF

Code for ALBEF: a new vision-language pre-training method

Language:PythonBSD-3-Clause142700

CoOp

Prompt Learning for Vision-Language Models (IJCV'22, CVPR'22)

Language:PythonMIT153800

compacter

Language:Python12400

Focal-Transformer

[NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"

Language:PythonMIT54300