GX Kok's repositories
Music-Demixing-with-Band-Split-RNN
An unofficial PyTorch implementation of Music Source Separation with Band-split RNN for MDX-23 ("Label Noise" Track)
video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
bark
🔊 Text-Prompted Generative Audio Model
clarity-upscaler
Clarity AI | AI Image Upscaler & Enhancer - free and open-source Magnific Alternative
cog-musicgen-fine-tuner
This is a cog implementation of the fine-tuner for Meta's MusicGen
DemoFusion
Let us democratise high-resolution generation! (CVPR 2024)
demucs
Code for the paper Hybrid Spectrogram and Waveform Source Separation
DragGAN
Official Code for DragGAN (SIGGRAPH 2023)
EchoMimic
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
generative-models
Generative Models by Stability AI
HDTF
the dataset and code for "Flow-guided One-shot Talking Face Generation with a High-resolution Audio-visual Dataset"
ImageBind
ImageBind One Embedding Space to Bind Them All
LipSick
🤢 LipSick: Fast, High Quality, Low Resource Lipsync Tool 🤮
MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
Panda-70M
[CVPR 2024] Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers
riffusion
Stable diffusion for real-time music generation
SEINE
SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction
Simple-Magic-Animate
A simple magic animate pipeline including densepose inference.
TranSalNet
TranSalNet: Towards perceptually relevant visual saliency prediction. Neurocomputing (2022)
Unconditional-MusicGen-Trainer
fine-tuning MusicGen without prompts to generate music with a specific style
VideoCrafter
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
Wav2Lip-GFPGAN
High quality Lip sync
wunjo.wladradchenko.ru
Wunjo AI: Synthesize & clone voices in English, Russian & Chinese, real-time speech recognition, deepfake face & lips animation, face swap with one photo, change video by text prompts, segmentation, and retouching. Open-source, local & free.