Ge Yongtao (YongtaoGe)

YongtaoGe

Geek Repo

Company:The University of Adelaide

Location:Hangzhou

Home Page:yongtaoge.github.io

Github PK Tool:Github PK Tool

Ge Yongtao's starred repositories

codellama

Inference code for CodeLlama models

Language:PythonLicense:NOASSERTIONStargazers:15673Issues:0Issues:0

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4683Issues:0Issues:0

SALMONN

SALMONN: Speech Audio Language Music Open Neural Network

Language:PythonLicense:Apache-2.0Stargazers:917Issues:0Issues:0
Language:PythonStargazers:33Issues:0Issues:0

PonderV2

PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm

Language:PythonLicense:MITStargazers:310Issues:0Issues:0

habitat-lab

A modular high-level library to train embodied AI agents across a variety of tasks and environments.

Language:PythonLicense:MITStargazers:1842Issues:0Issues:0

T2I-Adapter

T2I-Adapter

Language:PythonStargazers:3329Issues:0Issues:0

tabilize

Simple code for generating a color-coded latex table from raw data

Language:Jupyter NotebookStargazers:147Issues:0Issues:0

OIR

[ICLR 2024] Official PyTorch/Diffusers implementation of "Object-aware Inversion and Reassembly for Image Editing"

Language:PythonStargazers:71Issues:0Issues:0
License:MITStargazers:67Issues:0Issues:0

PHC

Official Implementation of the ICCV 2023 paper: Perpetual Humanoid Control for Real-time Simulated Avatars

Language:PythonLicense:NOASSERTIONStargazers:384Issues:0Issues:0

t2motion

Official implementation of Breaking The Limits of Text-conditioned 3D Motion Synthesis with Elaborative Descriptions. (ICCV2023)

Language:PythonLicense:MITStargazers:20Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:117Issues:0Issues:0

CLAP

Contrastive Language-Audio Pretraining

Language:PythonLicense:CC0-1.0Stargazers:1267Issues:0Issues:0

DiffPoseTalk

DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models

Stargazers:114Issues:0Issues:0

PantoMatrix

PantoMatrix: Co-Speech Talking Head and Gestures Generation

Language:PythonLicense:NOASSERTIONStargazers:881Issues:0Issues:0

LivelySpeaker

[ICCV-2023] The official repo for the paper "LivelySpeaker: Towards Semantic-aware Co-Speech Gesture Generation".

Language:PythonStargazers:69Issues:0Issues:0

FineDance

FineDance: A Fine-grained Choreography Dataset for 3D Full Body Dance Generation. (ICCV2023)

Language:PythonLicense:NOASSERTIONStargazers:107Issues:0Issues:0

xrfeitoria

OpenXRLab Synthetic Data Rendering Toolbox

Language:PythonLicense:Apache-2.0Stargazers:215Issues:0Issues:0

InstructCV

[ ICLR 2024 ] Official Codebase for "InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists"

Language:PythonLicense:NOASSERTIONStargazers:516Issues:0Issues:0

DiffuseStyleGesture

DiffuseStyleGesture: Stylized Audio-Driven Co-Speech Gesture Generation with Diffusion Models (IJCAI 2023) | The DiffuseStyleGesture+ entry to the GENEA Challenge 2023 (ICMI 2023, Reproducibility Award)

Language:PythonLicense:MITStargazers:141Issues:0Issues:0
Language:PythonStargazers:149Issues:0Issues:0

Awesome-Open-Vocabulary-Semantic-Segmentation

A curated publication list on open vocabulary semantic segmentation and related area (e.g. zero-shot semantic segmentation) resources..

Stargazers:324Issues:0Issues:0

UniHSI

[ICLR 2024 Spotlight] Unified Human-Scene Interaction via Prompted Chain-of-Contacts

Language:PythonStargazers:144Issues:0Issues:0

moyo_toolkit

This is a repository for download, preprocessing, visualizing, running evaluations on the MOYO dataset.

Language:PythonLicense:NOASSERTIONStargazers:61Issues:0Issues:0

AnthroNet

Unity's Privacy-Preserving Novel Human Body Model Trained Solely on Synthetic Data and Corresponding Dense Anthropometric Measurements

Language:Rich Text FormatLicense:NOASSERTIONStargazers:29Issues:0Issues:0

SMPL-Anthropometry

Measure the SMPL body model

Language:PythonLicense:MITStargazers:144Issues:0Issues:0

metrabs

Estimate absolute 3D human poses from RGB images.

Language:PythonLicense:MITStargazers:445Issues:0Issues:0

T2M-GPT

(CVPR 2023) Pytorch implementation of “T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Representations”

Language:PythonLicense:Apache-2.0Stargazers:559Issues:0Issues:0

lm-listener

Implementation for the paper "Can Language Models Learn to Listen?"

Language:PythonStargazers:59Issues:0Issues:0