Hengkai Guo (guohengkai)

guohengkai

Geek Repo

Company:ByteDance Inc

Home Page:http://guohengkai.github.io/

Github PK Tool:Github PK Tool

Hengkai Guo's starred repositories

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:21898Issues:175Issues:170

VAR

[GPT beats diffusionšŸ”„] [scaling laws in visual generationšŸ“ˆ] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3648Issues:111Issues:65

champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Language:PythonLicense:Apache-2.0Stargazers:3342Issues:175Issues:92

InstantMesh

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

Language:PythonLicense:Apache-2.0Stargazers:2417Issues:35Issues:97

flowmap

Code for "FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent" by Cameron Smith*, David Charatan*, Ayush Tewari, and Vincent Sitzmann

Language:PythonLicense:MITStargazers:788Issues:17Issues:35

Paint3D

Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models, a no lighting baked texture generative model

Language:PythonLicense:Apache-2.0Stargazers:558Issues:61Issues:10

SpaTracker

[CVPR 2024 Highlight] Official PyTorch implementation of SpatialTracker: Tracking Any 2D Pixels in 3D Space

Language:PythonLicense:NOASSERTIONStargazers:522Issues:65Issues:19

gaussian-opacity-fields

Gaussian Opacity Fields: Efficient and Compact Surface Reconstruction in Unbounded Scenes

Language:PythonLicense:NOASSERTIONStargazers:504Issues:29Issues:50

Arc2Face

Arc2Face: A Foundation Model of Human Faces

Language:PythonLicense:MITStargazers:468Issues:15Issues:18

mickey

[CVPR 2024 - Oral] Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences

Language:PythonLicense:NOASSERTIONStargazers:338Issues:11Issues:10

GaussianAvatar

[CVPR 2024] The official repo for "GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians"

Language:PythonLicense:MITStargazers:307Issues:17Issues:30

OpenCLAY

CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D Assets

flowsam

Official Implementation of "Moving Object Segmentation: All You Need Is SAM (and Flow)" Junyu Xie, Charig Yang, Weidi Xie, Andrew Zisserman

Language:PythonLicense:Apache-2.0Stargazers:207Issues:0Issues:0

dot

Dense Optical Tracking: Connecting the Dots

Language:PythonLicense:MITStargazers:203Issues:12Issues:15

DG-Mesh

Dynamic Gaussian Mesh: Consistent Mesh Reconstruction from Monocular Videos

Stargazers:200Issues:0Issues:0

dmesh

Official implementation for "DMesh: A Differentiable Representation for General Meshes".

Language:PythonLicense:MITStargazers:195Issues:7Issues:5

acezero

ACE0 is a learning-based structure-from-motion approach that estimates camera parameters of sets of images by learning a multi-view consistent, implicit scene representation.

Stargazers:171Issues:0Issues:0

MVEdit

[WIP] Generic 3D Diffusion Adapter Using Controlled Multi-View Editing

Language:JavaScriptLicense:MITStargazers:167Issues:8Issues:8

Paint-it

[CVPR'24] Official PyTorch Implementation of "Paint-it: Text-to-Texture Synthesis via Deep Convolutional Texture Map Optimization and Physically-Based Rendering"

Language:PythonLicense:MITStargazers:163Issues:17Issues:14

realmdreamer

Code for RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion [Arxiv 2024]

Stargazers:160Issues:0Issues:0

ViTamin

[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"

Language:PythonLicense:Apache-2.0Stargazers:132Issues:5Issues:8

affordance_diffusion

Codes for "Affordance Diffusion: Synthesizing Hand-Object Interactions"

DreamReward

DreamReward: Text-to-3D Generation with Human Preference

mvdfusion

[CVPR 2024] MVD-Fusion: Single-view 3D via Depth-consistent Multi-view Generation

Language:PythonLicense:MITStargazers:89Issues:4Issues:8

CricaVPR

Official repository for the CVPR 2024 paper "CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition".

Language:PythonLicense:MITStargazers:86Issues:1Issues:9

DTC123

The official PyTorch implementation of Diffusion Time-step Curriculum for One Image to 3D Generation (CVPR 2024)

svd-mv

Unofficial Implementation of "Stable Video Diffusion Multi-View"

Language:PythonLicense:MITStargazers:66Issues:4Issues:2

MemFlow

[CVPR 2024] MemFlow: Optical Flow Estimation and Prediction with Memory

Language:PythonLicense:Apache-2.0Stargazers:61Issues:9Issues:2

pram

official implementation of PRAM: Place Recognition Anywhere Model for Efficient Visual Localization

Language:PythonLicense:NOASSERTIONStargazers:44Issues:0Issues:0