mvasil's repositories
fashion-compatibility
Learning Type-Aware Embeddings for Fashion Compatibility
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
StableVideo
[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing
ai-audio-startups
Community list of startups working with AI in audio and music technology
animatable_nerf
Code for "Animatable Neural Radiance Fields for Modeling Dynamic Human Bodies" ICCV 2021
animatediff-cli-prompt-travel
animatediff prompt travel
animatediff-kaiber
Improved AnimateDiff with a number of improvements
AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
BiFormer
BiFormer: Learning Bilateral Motion Estimation via Bilateral Transformer for 4K Video Frame Interpolation, CVPR2023
CoDeF
Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing
Gen-L-Video
The official implementation for "Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising".
Grounded-Segment-Anything
Grounded-SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
inanimate
Generate images from an initial frame and text
MRL
Code repository for the paper - "Matryoshka Representation Learning"
roop
one-click deepfake (face swap)
BrushNet
[ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"
co-tracker
CoTracker is a model for tracking any point (pixel) on a video.
CVPR23_LFDM
The pytorch implementation of our CVPR 2023 paper "Conditional Image-to-Video Generation with Latent Flow Diffusion Models"
cycle-diffusion
[ICCV 2023] Zero-shot image editing with stochastic diffusion models
DreamPose
Official implementation of "DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion"
FateZero
[ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"
Inpaint-Anything
Inpaint anything using Segment Anything and inpainting models.
KGI
This is the code repo for ICCV23 paper Virtual Try-On with Garment-Pose Keypoints Guided Inpainting
lora
Using Low-rank adaptation to quickly fine-tune diffusion models.
stable-audio-tools
Generative models for conditional audio generation
text2cinemagraph
Official Pytorch implementation of Artistic Cinemagraph: Synthesizing Artistic Cinemagraphs from Text
videocomposer
Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.