nazmul-karim170 / SAVE-Text2Video-Diffusion

Implementation of "SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-guided Video Editing" Paper

Home Page:https://save-textguidedvideoediting.github.io/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

If you like our project, please give us a star โญ on GitHub for the latest update.

webpage arXiv License: MIT

๐Ÿ˜ฎ Highlights

SAVE allows you to edit your video in a matter of 3 minutes! instead of 30 minutes! in SOTA.

๐Ÿ’ก Efficient, High-quality, and Fast-speed

  • Stable Diffusion (SD) for image generation --> high-quality
  • Only fine-tune the Singular Values of the Query Matrices --> Efficient Adaptation
  • Regularize the singular value updates

๐Ÿšฉ Updates

Welcome to watch ๐Ÿ‘€ this repository for the latest updates.

โœ… [2023.06.07] : We have released our code

โœ… [2023.12.01] : We have released our paper, SAVE on arXiv.

โœ… [2023.12.01] : Release project page.

๐Ÿ› ๏ธ Methodology

Implementation of SAVE Algorithm.

First, create a conda environment using this

conda create -n save

First Install the following packages-

pip install -r requirements.txt

Run the following command to edit a given video.

python Edit_Video_SAVE.py

Change the "--config" option in arguments to provide a new video.

๐Ÿš€ Video-Editing Results

Qualitative comparison

Quantitative comparison

๐Ÿ‘ Acknowledgement

This work is built on many amazing research works and open-source projects, thanks a lot to all the authors for sharing!

โœ๏ธ Citation

If you find our paper and code useful in your research, please consider giving a star โญ and a citation ๐Ÿ“.

@misc{karim2023save,
      title={SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-driven Video Editing}, 
      author={Nazmul Karim and Umar Khalid and Mohsen Joneidi and Chen Chen and Nazanin Rahnavard},
      year={2023},
      eprint={2305.18670},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

About

Implementation of "SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-guided Video Editing" Paper

https://save-textguidedvideoediting.github.io/

License:MIT License


Languages

Language:Python 100.0%