guohengkai

Hengkai Guo's starred repositories

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION21898 175 170

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonMIT3648 111 65

champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Language:PythonApache-2.03342 175 92

InstantMesh

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

Language:PythonApache-2.02417 35 97

flowmap

Code for "FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent" by Cameron Smith*, David Charatan*, Ayush Tewari, and Vincent Sitzmann

Language:PythonMIT788 17 35

Paint3D

Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models, a no lighting baked texture generative model

Language:PythonApache-2.0558 61 10

SpaTracker

[CVPR 2024 Highlight] Official PyTorch implementation of SpatialTracker: Tracking Any 2D Pixels in 3D Space

Language:PythonNOASSERTION522 65 19

gaussian-opacity-fields

Gaussian Opacity Fields: Efficient and Compact Surface Reconstruction in Unbounded Scenes

Language:PythonNOASSERTION504 29 50

Arc2Face

Arc2Face: A Foundation Model of Human Faces

Language:PythonMIT468 15 18

mickey

[CVPR 2024 - Oral] Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences

Language:PythonNOASSERTION338 11 10

GaussianAvatar

[CVPR 2024] The official repo for "GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians"

Language:PythonMIT307 17 30

OpenCLAY

CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets

268 49 2

CosmicMan

221 36 6

flowsam

Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi Xie, Andrew Zisserman

Language:PythonApache-2.020700

dot

Dense Optical Tracking: Connecting the Dots

Language:PythonMIT203 12 15

DG-Mesh

Dynamic Gaussian Mesh: Consistent Mesh Reconstruction from Monocular Videos

20000

dmesh

Official implementation for "DMesh: A Differentiable Representation for General Meshes".

Language:PythonMIT195 7 5

acezero

ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.

17100

MVEdit

[WIP] Generic 3D Diffusion Adapter Using Controlled Multi-View Editing

Language:JavaScriptMIT167 8 8

Paint-it

[CVPR'24] Official PyTorch Implementation of "Paint-it: Text-to-Texture Synthesis via Deep Convolutional Texture Map Optimization and Physically-Based Rendering"

Language:PythonMIT163 17 14

realmdreamer

Code for RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion [Arxiv 2024]

16000

ViTamin

[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"

Language:PythonApache-2.0132 5 8

affordance_diffusion

Codes for "Affordance Diffusion: Synthesizing Hand-Object Interactions"

Language:Python94 5 16

DreamReward

DreamReward: Text-to-3D Generation with Human Preference

MIT93 8 3

mvdfusion

[CVPR 2024] MVD-Fusion: Single-view 3D via Depth-consistent Multi-view Generation

Language:PythonMIT89 4 8

CricaVPR

Official repository for the CVPR 2024 paper "CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition".

Language:PythonMIT86 1 9

DTC123

The official PyTorch implementation of Diffusion Time-step Curriculum for One Image to 3D Generation (CVPR 2024)

72 6 1

svd-mv

Unofficial Implementation of "Stable Video Diffusion Multi-View"

Language:PythonMIT66 4 2

MemFlow

[CVPR 2024] MemFlow: Optical Flow Estimation and Prediction with Memory

Language:PythonApache-2.061 9 2

pram

official implementation of PRAM: Place Recognition Anywhere Model for Efficient Visual Localization

Language:PythonNOASSERTION4400