Hengkai Guo's starred repositories
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
gemma_pytorch
The official PyTorch implementation of Google's Gemma models
Open-AnimateAnyone
Unofficial Implementation of Animate Anyone
ml-fastvit
This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023
rq-vae-transformer
The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)
LooseControl
Lifting ControlNet for Generalized Depth Conditioning
Awesome-AIGC-3D
A curated list of awesome AIGC 3D papers
BakedAvatar
Pytorch Code for "BakedAvatar: Baking Neural Fields for Real-Time Head Avatar Synthesis"
DriveDreamer
DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving
MVDiffusion_plusplus
MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction
compose-and-conquer
[ICLR 2024] Official repo. for Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis
Context-PIPs
Source code for paper Context-PIPs: Persistent Independent Particles Demands Spatial Context Features, NeurIPS 2023.