Wangbo Zhao(明先生)'s repositories
OpenMMLab-BoxInst
The code for OpenmmLab challenge.
2022CVPR-MMMMTBVS
This is the code for CVPR2022 paper "Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation"
2021TIP-SCG
The code for SCG: Saliency and Contour Guided Salient Instance Segmentation
AOT
Associating Objects with Transformers for Video Object Segmentation
aot-benchmark
An efficient modular implementation of Associating Objects with Transformers for Video Object Segmentation in PyTorch
binsformer
Implementation of Binsformer code
ColossalAI
Making big AI models cheaper, easier, and scalable
DiffRate
[ICCV 23]An approach to enhance the efficiency of Vision Transformer (ViT) by concurrently employing token pruning and token merging techniques, while incorporating a differentiable compression rate.
DiST
ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning
DRL
Deep Reinforcement Learning
EfficientDM
[ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models"
EVA
Exploring the Limits of Masked Visual Representation Learning at Scale (https://arxiv.org/abs/2211.07636)
GenVIS
A Generalized Framework for Video Instance Segmentation
langchain
⚡ Building applications with LLMs through composability ⚡
MiniGPT-4
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
mmpretrain
OpenMMLab Pre-training Toolbox and Benchmark
mmselfsup
OpenMMLab Self-Supervised Learning Toolbox and Benchmark
opencompass
OpenCompass is an LLM evaluation platform, supporting evaluation of 20+ HuggingFace & API models (LLaMA, ChatGPT, Claude, etc) over 50+ datasets. It enables fast, comprehensive benchmarking of large models using efficient distributed evaluation techniques.
Papers-Literature-ML-DL-RL-AI
Highly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning
PixArt-alpha
Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
T-Stitch
Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stitching"
tuning_playbook
A playbook for systematically maximizing the performance of deep learning models.
U-ViT
A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".
VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
VITA
VITA: Video Instance Segmentation via Object Token Association (NeurIPS 2022)
wangbo-zhao
Config files for my GitHub profile.
X-Decoder
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language