YongtaoGe

Ge Yongtao's starred repositories

codellama

Inference code for CodeLlama models

Language:PythonNOASSERTION1567300

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookApache-2.0468300

SALMONN

SALMONN: Speech Audio Language Music Open Neural Network

Language:PythonApache-2.091700

PonderV2

PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm

Language:PythonMIT31000

habitat-lab

A modular high-level library to train embodied AI agents across a variety of tasks and environments.

Language:PythonMIT184200

tabilize

Simple code for generating a color-coded latex table from raw data

Language:Jupyter Notebook14700

OIR

[ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"

Language:Python7100

PHC

Official Implementation of the ICCV 2023 paper: Perpetual Humanoid Control for Real-time Simulated Avatars

Language:PythonNOASSERTION38400

t2motion

Official implementation of Breaking The Limits of Text-conditioned 3D Motion Synthesis with Elaborative Descriptions. (ICCV2023)

Language:PythonMIT2000

CLAP

Contrastive Language-Audio Pretraining

Language:PythonCC0-1.0126700

DiffPoseTalk

DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models

11400

PantoMatrix

PantoMatrix: Co-Speech Talking Head and Gestures Generation

Language:PythonNOASSERTION88100

LivelySpeaker

[ICCV-2023] The official repo for the paper "LivelySpeaker: Towards Semantic-aware Co-Speech Gesture Generation".

Language:Python6900

FineDance

FineDance: A Fine-grained Choreography Dataset for 3D Full Body Dance Generation. (ICCV2023)

Language:PythonNOASSERTION10700

xrfeitoria

OpenXRLab Synthetic Data Rendering Toolbox

Language:PythonApache-2.021500

InstructCV

[ ICLR 2024 ] Official Codebase for "InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists"

Language:PythonNOASSERTION51600

DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models (IJCAI 2023) | The DiffuseStyleGesture+ entry to the GENEA Challenge 2023 (ICMI 2023, Reproducibility Award)

Language:PythonMIT14100

Imitator

Language:Python14900

Awesome-Open-Vocabulary-Semantic-Segmentation

A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..

32400

UniHSI

[ICLR 2024 Spotlight] Unified Human-Scene Interaction via Prompted Chain-of-Contacts

Language:Python14400

moyo_toolkit

This is a repository for download, preprocessing, visualizing, running evaluations on the MOYO dataset.

Language:PythonNOASSERTION6100

AnthroNet

Unity's Privacy-Preserving Novel Human Body Model Trained Solely on Synthetic Data and Corresponding Dense Anthropometric Measurements

Language:Rich Text FormatNOASSERTION2900

SMPL-Anthropometry

Measure the SMPL body model

Language:PythonMIT14400

metrabs

Estimate absolute 3D human poses from RGB images.

Language:PythonMIT44500

T2M-GPT

(CVPR 2023) Pytorch implementation of “T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Representations”

Language:PythonApache-2.055900

lm-listener

Implementation for the paper "Can Language Models Learn to Listen?"

Language:Python5900