Ge Yongtao's starred repositories
IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
habitat-lab
A modular high-level library to train embodied AI agents across a variety of tasks and environments.
T2I-Adapter
T2I-Adapter
DiffPoseTalk
DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models
PantoMatrix
PantoMatrix: Co-Speech Talking Head and Gestures Generation
LivelySpeaker
[ICCV-2023] The official repo for the paper "LivelySpeaker: Towards Semantic-aware Co-Speech Gesture Generation".
xrfeitoria
OpenXRLab Synthetic Data Rendering Toolbox
InstructCV
[ ICLR 2024 ] Official Codebase for "InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists"
DiffuseStyleGesture
DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models (IJCAI 2023) | The DiffuseStyleGesture+ entry to the GENEA Challenge 2023 (ICMI 2023, Reproducibility Award)
Awesome-Open-Vocabulary-Semantic-Segmentation
A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..
moyo_toolkit
This is a repository for download, preprocessing, visualizing, running evaluations on the MOYO dataset.
SMPL-Anthropometry
Measure the SMPL body model
lm-listener
Implementation for the paper "Can Language Models Learn to Listen?"