mvasil

mvasil's repositories

fashion-compatibility

Learning Type-Aware Embeddings for Fashion Compatibility

Language:PythonBSD-3-Clause152 9 32

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT100

StableVideo

[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing

Language:PythonApache-2.0100

ai-audio-startups

Community list of startups working with AI in audio and music technology

Apache-2.0000

animatable_nerf

Code for "Animatable Neural Radiance Fields for Modeling Dynamic Human Bodies" ICCV 2021

Language:PythonNOASSERTION000

animatediff-cli-prompt-travel

animatediff prompt travel

Language:PythonApache-2.0000

animatediff-kaiber

Improved AnimateDiff with a number of improvements

Language:PythonApache-2.0000

AnyDoor

Official implementations for paper: Anydoor: zero-shot object-level image customization

Language:PythonMIT000

BiFormer

BiFormer: Learning Bilateral Motion Estimation via Bilateral Transformer for 4K Video Frame Interpolation, CVPR2023

Language:PythonApache-2.0000

CoDeF

Official PyTorch implementation of CoDeF: Content Deformation Fields for Temporally Consistent Video Processing

Language:PythonNOASSERTION000

Gen-L-Video

The official implementation for "Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising".

Language:Jupyter NotebookApache-2.0000

Grounded-Segment-Anything

Grounded-SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Language:Jupyter NotebookApache-2.0000