Jerry Jiarui XU's repositories
mmdetection
Open MMLab Detection Toolbox with PyTorch 1.0
OFA-fairseq
fairseq from OFA
prismatic-vlms
A flexible and efficient codebase for training visually-conditioned language models (VLMs)
Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
CenterNet2
Two-stage CenterNet
davis2017-evaluation
Evaluation Framework for DAVIS 2017 Semi-supervised and Unsupervised used in the DAVIS Challenges
DeepSegmentor
A Pytorch implementation of DeepCrack and RoadNet projects.
detectron2
Detectron2 is FAIR's next-generation platform for object detection and segmentation.
diffusers
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch
fvcore
Collection of common code that's shared among different research projects in FAIR computer vision team.
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
lm-evaluation-harness
A framework for few-shot evaluation of language models.
Mask2Former
Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"
mmsegmentation
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
OFA
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
panopticapi
COCO 2018 Panoptic Segmentation Task API (Beta version)
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more
transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
visual_prompting
Official implementation and data release of the paper "Visual Prompting via Image Inpainting".