nazmul-karim170/SAVE-Text2Video-Diffusion

diffusion-models generative-ai text-to-image-generation text-to-video-generation video-editing

SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-guided Video Editing

If you like our project, please give us a star ⭐ on GitHub for the latest update.

Project page | Paper

😮 Highlights

SAVE allows you to edit your video in a matter of 3 minutes! instead of 30 minutes! in SOTA.

💡 Efficient, High-quality, and Fast-speed

Stable Diffusion (SD) for image generation --> high-quality
Only fine-tune the Singular Values of the Query Matrices --> Efficient Adaptation
Regularize the singular value updates

🚩 Updates

Welcome to watch 👀 this repository for the latest updates.

✅ [2023.06.07] : We have released our code

✅ [2023.12.01] : We have released our paper, SAVE on arXiv.

✅ [2023.12.01] : Release project page.

🛠️ Methodology

Implementation of SAVE Algorithm.

First, create a conda environment using this

conda create -n save

First Install the following packages-

pip install -r requirements.txt

Run the following command to edit a given video.

python Edit_Video_SAVE.py

Change the "--config" option in arguments to provide a new video.

🚀 Video-Editing Results

Qualitative comparison

Quantitative comparison

👍 Acknowledgement

This work is built on many amazing research works and open-source projects, thanks a lot to all the authors for sharing!

✏️ Citation

If you find our paper and code useful in your research, please consider giving a star ⭐ and a citation 📝.

@misc{karim2023save,
      title={SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-driven Video Editing}, 
      author={Nazmul Karim and Umar Khalid and Mohsen Joneidi and Chen Chen and Nazanin Rahnavard},
      year={2023},
      eprint={2305.18670},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

About

Implementation of "SAVE: Spectral-Shift-Aware Adaptation of Image Diffusion Models for Text-guided Video Editing" Paper

https://save-textguidedvideoediting.github.io/

diffusion-models generative-ai text-to-image-generation text-to-video-generation video-editing

MIT License

Languages

Language:Python 100.0%