Hongwei Han's repositories
4D-Humans
4DHumans: Reconstructing and Tracking Humans with Transformers
AnimateDiff
Official implementation of AnimateDiff.
Artemis
[SIGGRAPH 2022] ARTEMIS, a novel neural modeling and rendering pipeline for generating ARTiculated neural pets with appEarance and Motion synthesIS.
AudioLDM2
Text-to-Audio/Music Generation
CoDeF
Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
DiffuseStyleGesture
DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models (IJCAI 2023)
comfyui-inpaint-nodes
Nodes for better inpainting with ComfyUI: Fooocus inpaint model for SDXL, LaMa, MAT, and various other tools for pre-filling inpaint & outpaint areas.
EDGE
Official PyTorch Implementation of EDGE (CVPR 2023)
GlaDOS
This is the Personality Core for GLaDOS, the first steps towards a real-life implementation of the AI from the Portal series by Valve.
Grounded-Segment-Anything
Marrying Grounding DINO with Segment Anything & Stable Diffusion & Tag2Text & BLIP & Whisper & ChatBot - Automatically Detect , Segment and Generate Anything with Image, Text, and Audio Inputs
HumanML3D
HumanML3D: A large and diverse 3d human motion-language dataset.
ListenDenoiseAction
Code to reproduce the results for our SIGGRAPH 2023 paper "Listen Denoise Action"
momask-codes
Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions"
motion-diffusion-model
The official PyTorch implementation of the paper "Human Motion Diffusion Model"
motion-latent-diffusion
(CVPR 2023) Executing your Commands via Motion Diffusion in Latent Space, a fast and high-quality motion diffusion model
MotionGPT
The official PyTorch implementation of the paper "MotionGPT: Finetuned LLMs are General-Purpose Motion Generators"
MotionGPT-TX
MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs
PG-Video-LLaVA
PG-Video-LLaVA: Pixel Grounding in Large Multimodal Video Models
PHALP
Code repository for the paper "Tracking People by Predicting 3D Appearance, Location & Pose". (CVPR 2022 Oral)
priorMDM
The official implementation of the paper "Human Motion Diffusion as a Generative Prior"
ReMoDiffuse
ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model
SMPLer-X
Official Code for "SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation"
stable-dreamfusion
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
svox_t
A differentiable dynamic feature-level octree and renderer implementation as a PyTorch CUDA extension for ARTEMIS.
T2M-GPT
(CVPR 2023) Pytorch implementation of “T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Representations”
text-to-motion-retrieval
Official code for reproducing results obtained in the short paper "Text-to-Motion Retrieval: Towards Joint Understanding of Human Motion Data and Natural Language", accepted at SIGIR 2023.
zero123
Zero-1-to-3: Zero-shot One Image to 3D Object (ICCV 2023)