yxchng's starred repositories

Patch-DM

Code Release for Patch-DM (ICLR 2024)

Language:PythonStargazers:30Issues:0Issues:0

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonLicense:Apache-2.0Stargazers:6321Issues:0Issues:0

sam-hq

Segment Anything in High Quality [NeurIPS 2023]

Language:PythonLicense:Apache-2.0Stargazers:3625Issues:0Issues:0

X-Decoder

[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language

Language:PythonLicense:Apache-2.0Stargazers:1280Issues:0Issues:0

CVinW_Readings

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

Stargazers:1124Issues:0Issues:0

MasQCLIP

(ICCV 2023) MasQCLIP for Open-Vocabulary Universal Image Segmentation

Language:PythonLicense:NOASSERTIONStargazers:30Issues:0Issues:0

FreestyleNet

[CVPR 2023 Highlight] Freestyle Layout-to-Image Synthesis

Language:PythonLicense:MITStargazers:142Issues:0Issues:0

FreeMask

[NeurIPS 2023] FreeMask: Synthetic Images with Dense Annotations Make Stronger Segmentation Models

Language:PythonLicense:MITStargazers:128Issues:0Issues:0

StableLLAVA

Official repo for StableLLAVA

Language:PythonLicense:Apache-2.0Stargazers:90Issues:0Issues:0

daam

Diffusion attentive attribution maps for interpreting Stable Diffusion.

Language:Jupyter NotebookLicense:MITStargazers:659Issues:0Issues:0

End-to-end-Autonomous-Driving

[IEEE T-PAMI] All you need for End-to-end Autonomous Driving

License:MITStargazers:1969Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:122Issues:0Issues:0

fc-clip

[NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP

Language:PythonLicense:Apache-2.0Stargazers:271Issues:0Issues:0

deeplab2

DeepLab2 is a TensorFlow library for deep labeling, aiming to provide a unified and state-of-the-art TensorFlow codebase for dense pixel labeling tasks.

Language:PythonLicense:Apache-2.0Stargazers:995Issues:0Issues:0

kmax-deeplab

a PyTorch re-implementation of ECCV 2022 paper based on Detectron2: k-means mask Transformer.

Language:PythonLicense:Apache-2.0Stargazers:66Issues:0Issues:0

enhancing-transformers

An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch

Language:PythonLicense:MITStargazers:276Issues:0Issues:0

vit-vqgan

JAX implementation ViT-VQGAN

Language:PythonLicense:MITStargazers:77Issues:0Issues:0

BEVT

PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529

Language:PythonLicense:Apache-2.0Stargazers:156Issues:0Issues:0

maskgit

Official Jax Implementation of MaskGIT

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:424Issues:0Issues:0
Language:PythonStargazers:52Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:122Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:220Issues:0Issues:0

MDT

Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)

Language:PythonLicense:Apache-2.0Stargazers:497Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:67Issues:0Issues:0

MaskDiT

Code for Fast Training of Diffusion Models with Masked Transformers

Language:PythonLicense:MITStargazers:344Issues:0Issues:0

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5929Issues:0Issues:0

U-ViT

A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".

Language:Jupyter NotebookLicense:MITStargazers:878Issues:0Issues:0

PCT

This is an official implementation of our CVPR 2023 paper "Human Pose as Compositional Tokens" (https://arxiv.org/pdf/2303.11638.pdf)

Language:PythonLicense:MITStargazers:310Issues:0Issues:0

Awesome-Occupancy-Prediction-Autonomous-Driving

Awesome papers about Multi-Camera Semantic Occupancy Prediction, such as TPVFormer, OccFormer, Occ3D, OpenOccupancy

Stargazers:208Issues:0Issues:0