xvjiarui

Jerry Jiarui XU's repositories

VFS

Rethinking Self-Supervised Correspondence Learning: A Video Frame-level Similarity Perspective, in ICCV 2021 (Oral)

Language:PythonApache-2.0144 9 18

IMProv

IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks

Language:Python56 5 3

GroupViT

GroupViT: Semantic Segmentation Emerges from Text Supervision

Language:PythonNOASSERTION24 10

mmdetection

Open MMLab Detection Toolbox with PyTorch 1.0

Language:PythonApache-2.08 30

ODISE

ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models

Language:PythonNOASSERTION200

OFA-fairseq

fairseq from OFA

Language:PythonMIT1 10

prismatic-vlms

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

MIT100

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Language:PythonMIT1 10

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Language:Jupyter NotebookBSD-3-Clause000

CenterNet2

Two-stage CenterNet

Language:PythonApache-2.0000

davis2017-evaluation

Evaluation Framework for DAVIS 2017 Semi-supervised and Unsupervised used in the DAVIS Challenges

Language:Python010

DeepSegmentor

A Pytorch implementation of DeepCrack and RoadNet projects.

Language:PythonNOASSERTION000

detectron2

Detectron2 is FAIR's next-generation platform for object detection and segmentation.

Language:PythonApache-2.0010

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Language:PythonApache-2.0000

fvcore

Collection of common code that's shared among different research projects in FAIR computer vision team.

Language:PythonApache-2.0000

litgpt

Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.

Apache-2.0000

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.0000

lm-evaluation-harness

A framework for few-shot evaluation of language models.

Language:PythonMIT000

LWM

Language:PythonApache-2.0000

Mask2Former

Code release for "Masked-attention Mask Transformer for Universal Image Segmentation"

Language:PythonMIT000

mmaction2

OpenMMLab's Next Generation Action Understanding Toolbox and Benchmark

Language:PythonApache-2.0010

mmcv

Open MMLab Computer Vision Foundation

Language:PythonApache-2.0010

mmsegmentation

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Language:PythonApache-2.0010

OFA

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

Language:PythonApache-2.0000

panopticapi

COCO 2018 Panoptic Segmentation Task API (Beta version)

Language:PythonNOASSERTION000

pytorch-image-models

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

Language:PythonApache-2.0010

stable-diffusion

Language:Jupyter NotebookNOASSERTION000

torchtune

A Native-PyTorch Library for LLM Fine-tuning

BSD-3-Clause000

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.0000

visual_prompting

Official implementation and data release of the paper "Visual Prompting via Image Inpainting".

Language:Jupyter Notebook000