Levon Dang (Droliven)

Droliven

Geek Repo

Company:South China University of Technology, @shuopensourcecommunity

Location:Guangzhou, Guangdong, China

Github PK Tool:Github PK Tool


Organizations
shuosc

Levon Dang's starred repositories

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:64980Issues:542Issues:0

GFPGAN

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

Language:PythonLicense:NOASSERTIONStargazers:35166Issues:504Issues:465

insightface

State-of-the-art 2D and 3D Face Analysis Project

Language:PythonLicense:MITStargazers:22123Issues:503Issues:2439

stable-diffusion-webui-colab

stable diffusion webui colab

Language:Jupyter NotebookLicense:UnlicenseStargazers:15491Issues:186Issues:348

AnimateAnyone

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Language:PythonLicense:Apache-2.0Stargazers:10678Issues:184Issues:1892

magic-animate

[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonLicense:BSD-3-ClauseStargazers:10206Issues:102Issues:143

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonLicense:Apache-2.0Stargazers:9909Issues:102Issues:334

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

awesome-ai-agents

A list of AI autonomous agents

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonLicense:Apache-2.0Stargazers:6151Issues:71Issues:230

Realtime_Multi-Person_Pose_Estimation

Code repo for realtime multi-person pose estimation in CVPR'17 (Oral)

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:5085Issues:259Issues:236

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonLicense:MITStargazers:4229Issues:61Issues:92

sd-webui-animatediff

AnimateDiff for AUTOMATIC1111 Stable Diffusion WebUI

Language:PythonLicense:NOASSERTIONStargazers:2963Issues:22Issues:362

ComfyUI-AnimateDiff-Evolved

Improved AnimateDiff for ComfyUI and Advanced Sampling Support

Language:PythonLicense:Apache-2.0Stargazers:2378Issues:24Issues:284

sd-webui-mov2mov

This is the Mov2mov plugin for Automatic1111/stable-diffusion-webui.

Language:PythonLicense:MITStargazers:2110Issues:28Issues:137

comfyui-portrait-master-zh-cn

肖像大师 中文版 comfyui-portrait-master

Language:PythonLicense:GPL-3.0Stargazers:1520Issues:17Issues:0

AgentTuning

AgentTuning: Enabling Generalized Agent Abilities for LLMs

FollowYourPose

[AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos"

Language:PythonLicense:MITStargazers:1172Issues:25Issues:49

408Bester

这里有着计算机考研408的详细路线,每个月的学习规划和所有视频书籍资源,计算机考研必看仓库

infinite-zoom-automatic1111-webui

infinite zoom effect extension for AUTOMATIC1111's webui - stable diffusion

Language:PythonLicense:MITStargazers:655Issues:9Issues:62

MagicDance

[ICML 2024] MagicPose(also known as MagicDance): Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion

Language:PythonLicense:NOASSERTIONStargazers:631Issues:32Issues:37

CLIP-Chinese

中文CLIP预训练模型

Iridescent

Solid data structure and algorithms

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:356Issues:11Issues:4
Language:PythonLicense:NOASSERTIONStargazers:262Issues:33Issues:31

HyperLips

Pytorch official implementation for our paper "HyperLips: Hyper Control Lips with High Resolution Decoder for Talking Face Generation".