xiankgx

followers

following

stars

University of Malaya

Malaysia

GX Kok's repositories

inswapper

One-click Face Swapper and Restoration powered by insightface 🔥

Language:Python2 10

Music-Demixing-with-Band-Split-RNN

An unofficial PyTorch implementation of Music Source Separation with Band-split RNN for MDX-23 ("Label Noise" Track)

Language:Python100

SadTalker

[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

Language:PythonNOASSERTION1 10

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonApache-2.01 10

AIT

Language:Python000

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookMIT000

clarity-upscaler

Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative

AGPL-3.0000

cog-musicgen-fine-tuner

This is a cog implementation of the fine-tuner for Meta's MusicGen

Language:PythonApache-2.0000

DemoFusion

Let us democratise high-resolution generation! (CVPR 2024)

Language:Jupyter Notebook000

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

MIT000

DragGAN

Official Code for DragGAN (SIGGRAPH 2023)

Language:PythonNOASSERTION000

EchoMimic

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Apache-2.0000

generative-models

Generative Models by Stability AI

Language:PythonNOASSERTION000

HDTF

the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"

Language:PythonGPL-3.0000

IF

Language:PythonNOASSERTION000

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonNOASSERTION000

LipSick

🤢 LipSick: Fast, High Quality, Low Resource Lipsync Tool 🤮

Language:PythonUnlicense000

Moore-AnimateAnyone

Apache-2.0000

MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Language:PythonNOASSERTION000

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Apache-2.0000

Panda-70M

[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers

000

riffusion

Stable diffusion for real-time music generation

Language:PythonMIT000

SEINE

SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction

Language:PythonApache-2.0000

Simple-Magic-Animate

A simple magic animate pipeline including densepose inference.

Language:Python000

TalkingHead-1KH

NOASSERTION000

TranSalNet

TranSalNet: Towards perceptually relevant visual saliency prediction. Neurocomputing (2022)

Language:PythonMIT000

Unconditional-MusicGen-Trainer

fine-tuning MusicGen without prompts to generate music with a specific style

Language:PythonMIT000

VideoCrafter

VideoCrafter1: Open Diffusion Models for High-Quality Video Generation

NOASSERTION000

Wav2Lip-GFPGAN

High quality Lip sync

000

wunjo.wladradchenko.ru

Wunjo AI: Synthesize & clone voices in English, Russian & Chinese, real-time speech recognition, deepfake face & lips animation, face swap with one photo, change video by text prompts, segmentation, and retouching. Open-source, local & free.

MIT000