fffiloni

Sylvain Filoni's repositories

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonApache-2.01200

pyChatGPT

An unofficial Python wrapper for OpenAI's ChatGPT API

Language:PythonGPL-3.0600

Hotshot-XL

✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL

Language:PythonApache-2.0500

dreamtalk

Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Language:PythonMIT300

daclip-uir

PyTorch implementation of the paper "Controlling Vision-Language Models for Universal Image Restoration"

Language:PythonMIT200

OOTDiffusion

Official implementation of OOTDiffusion

NOASSERTION200

style-aligned

Official code for "Style Aligned Image Generation via Shared Attention"

Language:PythonApache-2.0200

AnyV2V

A Plug-and-Play Framework For Any Video-to-Video Editing Tasks. Now with gradio demo

Language:Jupyter NotebookMIT100

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT100

champ

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Apache-2.0100

DemoFusion

Let us democratise high-resolution generation! (arXiv 2023)

Language:Jupyter Notebook100

PASD

Language:Python100

Real3DPortrait

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code

100

AniPortrait

AniPortrait with Gradio: Audio-Driven Synthesis of Photorealistic Portrait Animation

Language:PythonApache-2.0000

BasicPBC

Official Implementation of "Learning Inclusion Matching for Animation Paint Bucket Colorization"

Language:PythonNOASSERTION000

CameraCtrl

Language:PythonApache-2.0000

cog-autocaption

Add caption to any video

000

DiffBIR

Language:PythonApache-2.0000

diffusion-motion-transfer

Official Pytorch Implementation for "Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer""

000

DragNUWA

Language:PythonMIT000

metavoice-src

AI for human-level speech intelligence

Language:PythonApache-2.0000

Open-Sora-Plan-v1-0-0

This project aim to reproduce Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.

MIT000

ProPainter

[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting

Language:PythonNOASSERTION000

pytorch-image-models

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more

Language:PythonApache-2.0000

radio-olympiades

Language:JavaScript020

StoryDiffusion

Create Magic Story!

000

TAO-Amodal

Official Code for Tracking Any Object Amodally

Language:Python000

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.0000

zest_code

This is the official implementation of ZeST

Language:Jupyter Notebook000

fffiloni

Sylvain Filoni's repositories

video-retalking

pyChatGPT

Hotshot-XL

dreamtalk

daclip-uir

MiniGPT4-video

OOTDiffusion

style-aligned

AnyV2V

audiocraft

champ

DemoFusion

PASD

Real3DPortrait

AniPortrait

BasicPBC

CameraCtrl

cog-autocaption

DiffBIR

diffusion-motion-transfer

DragNUWA

metavoice-src

Open-Sora-Plan-v1-0-0

ProPainter

pytorch-image-models

radio-olympiades

StoryDiffusion

TAO-Amodal

TTS

zest_code