Hengkai Guo (guohengkai)

guohengkai

Geek Repo

Company:ByteDance Inc

Home Page:http://guohengkai.github.io/

Github PK Tool:Github PK Tool

Hengkai Guo's starred repositories

pytorch-image-models

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonLicense:Apache-2.0Stargazers:29750Issues:304Issues:864

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonLicense:MITStargazers:17974Issues:141Issues:251

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:7926Issues:78Issues:27
Language:PythonLicense:Apache-2.0Stargazers:6812Issues:66Issues:61

gemma_pytorch

The official PyTorch implementation of Google's Gemma models

Language:PythonLicense:Apache-2.0Stargazers:5025Issues:39Issues:33

Open-AnimateAnyone

Unofficial Implementation of Animate Anyone

ml-fastvit

This repository contains the official implementation of the research paper, "FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization" ICCV 2023

Language:PythonLicense:NOASSERTIONStargazers:1736Issues:32Issues:0

LGM

LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation.

Language:PythonLicense:MITStargazers:1156Issues:26Issues:46

magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Language:PythonLicense:Apache-2.0Stargazers:846Issues:72Issues:20

rq-vae-transformer

The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:687Issues:16Issues:21

LooseControl

Lifting ControlNet for Generalized Depth Conditioning

Language:PythonLicense:MITStargazers:399Issues:15Issues:16

xrslam

OpenXRLab Visual-inertial SLAM Toolbox and Benchmark

Language:C++License:Apache-2.0Stargazers:367Issues:15Issues:26

Awesome-AIGC-3D

A curated list of awesome AIGC 3D papers

License:MITStargazers:358Issues:11Issues:0

BakedAvatar

Pytorch Code for "BakedAvatar: Baking Neural Fields for Real-Time Head Avatar Synthesis"

Language:PythonLicense:MITStargazers:263Issues:14Issues:12

DriveDreamer

DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving

GALA3D

GALA3D: Towards Text-to-3D Complex Scene Generation via Layout-guided Generative Gaussian Splatting

OASim

OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous Driving

Language:PythonLicense:Apache-2.0Stargazers:164Issues:9Issues:5

EscherNet

[CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis

Language:PythonLicense:NOASSERTIONStargazers:152Issues:8Issues:6

MVDiffusion_plusplus

MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction

porf

(ICLR 2024) PoRF: Pose Residual Field for Accurate Neural Surface Reconstruction

Language:PythonLicense:MITStargazers:104Issues:5Issues:3

compose-and-conquer

[ICLR 2024] Official repo. for Compose and Conquer: Diffusion-Based 3D Depth Aware Composable Image Synthesis

Language:PythonLicense:MITStargazers:95Issues:4Issues:0

AToM

Official implementation of `AToM: Amortized Text-to-Mesh using 2D Diffusion`

gigapose

[CVPR 2024] PyTorch implementation of GigaPose: Fast and Robust Novel Object Pose Estimation via One Correspondence

Language:PythonLicense:MITStargazers:71Issues:5Issues:8

M3DBench

M3DBench introduces a comprehensive 3D instruction-following dataset with support for interleaved multi-modal prompts. Furthermore, M3DBench provides a new benchmark to assess large models across 3D vision-centric tasks.

Language:PythonLicense:Apache-2.0Stargazers:37Issues:5Issues:3

Context-PIPs

Source code for paper Context-PIPs: Persistent Independent Particles Demands Spatial Context Features, NeurIPS 2023.

Language:PythonLicense:Apache-2.0Stargazers:8Issues:2Issues:2