wangbo-zhao

[ICCV 23]An approach to enhance the efficiency of Vision Transformer (ViT) by concurrently employing token pruning and token merging techniques, while incorporating a differentiable compression rate.

Language:Jupyter Notebook000

DiST

ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning

Language:Python000

DRL

Deep Reinforcement Learning

NOASSERTION000

EfficientDM

[ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models"

Language:Jupyter NotebookMIT000

EVA

Exploring the Limits of Masked Visual Representation Learning at Scale (https://arxiv.org/abs/2211.07636)

Language:PythonMIT000

GenVIS

A Generalized Framework for Video Instance Segmentation

Language:Python000

langchain

⚡ Building applications with LLMs through composability ⚡

Language:PythonMIT000

MiniGPT-4

MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models

Language:PythonBSD-3-Clause000

mmpretrain

OpenMMLab Pre-training Toolbox and Benchmark

Language:PythonApache-2.0000

mmselfsup

OpenMMLab Self-Supervised Learning Toolbox and Benchmark

Language:PythonApache-2.0000

OpenCompass is an LLM evaluation platform, supporting evaluation of 20+ HuggingFace & API models (LLaMA, ChatGPT, Claude, etc) over 50+ datasets. It enables fast, comprehensive benchmarking of large models using efficient distributed evaluation techniques.

Language:PythonApache-2.0000

Papers-Literature-ML-DL-RL-AI

Highly cited and useful papers related to machine learning, deep learning, AI, game theory, reinforcement learning

MIT000

PixArt-alpha

Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonAGPL-3.0000

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.0000

T-Stitch

Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stitching"

Language:Jupyter NotebookNOASSERTION000

tuning_playbook

A playbook for systematically maximizing the performance of deep learning models.

NOASSERTION000

U-ViT

A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".

Language:Jupyter NotebookMIT000

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonMIT000

wangbo-zhao

Wangbo Zhao(明先生)'s repositories

OpenMMLab-BoxInst

2022CVPR-MMMMTBVS

2021TIP-SCG

Latte

AOT

aot-benchmark

binsformer

ColossalAI

DiffRate

DiST

DRL

DualPath

EfficientDM

EVA

GenVIS

langchain

MiniGPT-4

mmpretrain

mmselfsup

opencompass

Papers-Literature-ML-DL-RL-AI

PixArt-alpha

segment-anything

T-Stitch

tuning_playbook

U-ViT

VAR

VITA

wangbo-zhao

X-Decoder