LeslieZhoa

LeslieZhao's starred repositories

generative-ai-for-beginners

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Language:Jupyter NotebookMIT61394 525 108

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonNOASSERTION34834 310 875

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonApache-2.029828 195 4687

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION25822 212 230

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonNOASSERTION5931 46 78

ComfyUI-Workflows-ZHO

我的 ComfyUI 工作流合集 | My ComfyUI workflows collection

GPL-3.04607 37 11

champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Language:PythonMIT3568 177 112

IDM-VTON

[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild

Language:Python3479 53 131

motion-diffusion-model

The official PyTorch implementation of the paper "Human Motion Diffusion Model"

Language:PythonMIT3038 69 203

Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonApache-2.02817 28 178

Bark-Voice-Cloning

Bark Voice Cloning and Voice Cloning for Chinese Speech

Language:Jupyter NotebookMIT2702 33 97

stable-diffusion-tutorial

全网最全Stable Diffusion全套教程，从入门到进阶，耗时三个月制作

1276 11 2

SyncTalk

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

Language:PythonNOASSERTION1192 63 214

MobileVLM

Strong and Open Vision Language Assistant for Mobile Devices

Language:PythonApache-2.0952 21 56

Real3DPortrait

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code

Language:PythonMIT862 24 74

FollowYourClick

[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"

826 55 12

FRESCO

[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

Language:Jupyter NotebookNOASSERTION702 10 41

Arc2Face

[ECCV 2024 Oral🔥] Arc2Face: A Foundation Model for ID-Consistent Human Faces

Language:PythonMIT543 16 25

parrots

Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成，支持多语言，准确率高

Language:PythonApache-2.0457 12 27

Make-Your-Anchor

[CVPR 2024] Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework.

Language:Python299 30 12

TalkSHOW

This is the official repository for TalkSHOW: Generating Holistic 3D Human Motion from Speech [CVPR2023].

Language:Python286 12 31

B-LoRA

Implicit Style-Content Separation using B-LoRA

Language:Jupyter NotebookMIT264 8 19

EDTalk

[ECCV 2024 Oral] EDTalk - Official PyTorch Implementation

Language:PythonApache-2.0250 15 23

variational-inference-with-normalizing-flows

Reimplementation of Variational Inference with Normalizing Flows (https://arxiv.org/abs/1505.05770)

Language:Python223 7 4

SHOW

This is the codebase for SHOW in Generating Holistic 3D Human Motion from Speech [CVPR2023],

Language:PythonNOASSERTION209 4 35

Co-Speech-Motion-Generation

Freeform Body Motion Generation from Speech

Language:Python193 8 25

LN3Diff

[ECCV-2024] LN3Diff creates high-quality 3D object mesh from text within 8 V100-SECONDS.

Language:PythonNOASSERTION128 11 2

MCGaze

[IEEE SPL] End-to-end Video Gaze Estimation via Capturing Head-face-eye Spatial-temporal Interaction Context

Language:PythonMIT37 2 8

champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Language:PythonApache-2.01800

C2G2

Official implementation for C2G2: Controllable Co-speech Gesture generation.

Language:Python9 2 3