Sylvain Filoni's repositories
video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Hotshot-XL
✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL
daclip-uir
PyTorch implementation of the paper "Controlling Vision-Language Models for Universal Image Restoration"
OOTDiffusion
Official implementation of OOTDiffusion
style-aligned
Official code for "Style Aligned Image Generation via Shared Attention"
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
DemoFusion
Let us democratise high-resolution generation! (arXiv 2023)
Real3DPortrait
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code
AniPortrait
AniPortrait with Gradio: Audio-Driven Synthesis of Photorealistic Portrait Animation
BasicPBC
Official Implementation of "Learning Inclusion Matching for Animation Paint Bucket Colorization"
cog-autocaption
Add caption to any video
diffusion-motion-transfer
Official Pytorch Implementation for "Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer""
metavoice-src
AI for human-level speech intelligence
Open-Sora-Plan-v1-0-0
This project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.
ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
StoryDiffusion
Create Magic Story!
TAO-Amodal
Official Code for Tracking Any Object Amodally
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
zest_code
This is the official implementation of ZeST