yxchng

DeepLab2 is a TensorFlow library for deep labeling, aiming to provide a unified and state-of-the-art TensorFlow codebase for dense pixel labeling tasks.

Language:PythonApache-2.099500

kmax-deeplab

a PyTorch re-implementation of ECCV 2022 paper based on Detectron2: k-means mask Transformer.

Language:PythonApache-2.06600

enhancing-transformers

An unofficial implementation of both ViT-VQGAN and RQ-VAE in Pytorch

Language:PythonMIT27600

vit-vqgan

JAX implementation ViT-VQGAN

Language:PythonMIT7700

BEVT

PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529

Language:PythonApache-2.015600

maskgit

Official Jax Implementation of MaskGIT

Language:Jupyter NotebookApache-2.042400

DeepMIM

Language:Python5200

polygon-transformer

Language:PythonApache-2.012200

Asymmetric_VQGAN

Language:Jupyter NotebookMIT22000

MDT

Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)

Language:PythonApache-2.049700

Patch-Diffusion

Language:PythonApache-2.06700

MaskDiT

Code for Fast Training of Diffusion Models with Masked Transformers

Language:PythonMIT34400

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonNOASSERTION592900

U-ViT

A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".

Language:Jupyter NotebookMIT87800

PCT

This is an official implementation of our CVPR 2023 paper "Human Pose as Compositional Tokens" (https://arxiv.org/pdf/2303.11638.pdf)

Language:PythonMIT31000

Awesome-Occupancy-Prediction-Autonomous-Driving

Awesome papers about Multi-Camera Semantic Occupancy Prediction, such as TPVFormer, OccFormer, Occ3D, OpenOccupancy

20800

Awesome-occupancy-perception

25500