LeslieZhao (LeslieZhoa)

LeslieZhoa

Geek Repo

Location:China

Github PK Tool:Github PK Tool

LeslieZhao's starred repositories

generative-ai-for-beginners

18 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Language:Jupyter NotebookLicense:MITStargazers:61394Issues:525Issues:108

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:34834Issues:310Issues:875

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:29828Issues:195Issues:4687

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:25822Issues:212Issues:230

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5931Issues:46Issues:78

ComfyUI-Workflows-ZHO

我的 ComfyUI 工作流合集 | My ComfyUI workflows collection

champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Language:PythonLicense:MITStargazers:3568Issues:177Issues:112

IDM-VTON

[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild

motion-diffusion-model

The official PyTorch implementation of the paper "Human Motion Diffusion Model"

Language:PythonLicense:MITStargazers:3038Issues:69Issues:203

Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonLicense:Apache-2.0Stargazers:2817Issues:28Issues:178

Bark-Voice-Cloning

Bark Voice Cloning and Voice Cloning for Chinese Speech

Language:Jupyter NotebookLicense:MITStargazers:2702Issues:33Issues:97

stable-diffusion-tutorial

全网最全Stable Diffusion全套教程,从入门到进阶,耗时三个月制作

SyncTalk

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

Language:PythonLicense:NOASSERTIONStargazers:1192Issues:63Issues:214

MobileVLM

Strong and Open Vision Language Assistant for Mobile Devices

Language:PythonLicense:Apache-2.0Stargazers:952Issues:21Issues:56

Real3DPortrait

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code

Language:PythonLicense:MITStargazers:862Issues:24Issues:74

FollowYourClick

[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"

FRESCO

[CVPR 2024] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:702Issues:10Issues:41

Arc2Face

[ECCV 2024 Oral🔥] Arc2Face: A Foundation Model for ID-Consistent Human Faces

Language:PythonLicense:MITStargazers:543Issues:16Issues:25

parrots

Automatic Speech Recognition(ASR), Text-To-Speech(TTS) engine. 中英语音识别、多角色语音合成,支持多语言,准确率高

Language:PythonLicense:Apache-2.0Stargazers:457Issues:12Issues:27

Make-Your-Anchor

[CVPR 2024] Make-Your-Anchor: A Diffusion-based 2D Avatar Generation Framework.

TalkSHOW

This is the official repository for TalkSHOW: Generating Holistic 3D Human Motion from Speech [CVPR2023].

B-LoRA

Implicit Style-Content Separation using B-LoRA

Language:Jupyter NotebookLicense:MITStargazers:264Issues:8Issues:19

EDTalk

[ECCV 2024 Oral] EDTalk - Official PyTorch Implementation

Language:PythonLicense:Apache-2.0Stargazers:250Issues:15Issues:23

variational-inference-with-normalizing-flows

Reimplementation of Variational Inference with Normalizing Flows (https://arxiv.org/abs/1505.05770)

SHOW

This is the codebase for SHOW in Generating Holistic 3D Human Motion from Speech [CVPR2023],

Language:PythonLicense:NOASSERTIONStargazers:209Issues:4Issues:35

Co-Speech-Motion-Generation

Freeform Body Motion Generation from Speech

LN3Diff

[ECCV-2024] LN3Diff creates high-quality 3D object mesh from text within 8 V100-SECONDS.

Language:PythonLicense:NOASSERTIONStargazers:128Issues:11Issues:2

MCGaze

[IEEE SPL] End-to-end Video Gaze Estimation via Capturing Head-face-eye Spatial-temporal Interaction Context

Language:PythonLicense:MITStargazers:37Issues:2Issues:8

champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Language:PythonLicense:Apache-2.0Stargazers:18Issues:0Issues:0

C2G2

Official implementation for C2G2: Controllable Co-speech Gesture generation.