chenlin9's starred repositories
PhotoMaker
PhotoMaker
datasciencecoursera
for Data Science class on Coursera
videocomposer
Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability
generative-models
Generative Models by Stability AI
Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Medium-Articles-Notebooks
Medium Articles Notebooks and Media Files
clip-retrieval
Easily compute clip embeddings and build a clip retrieval system with them
imagecorruptions
Python package to corrupt arbitrary images.
T2I-Adapter
T2I-Adapter
so-vits-svc
SoftVC VITS Singing Voice Conversion
MultiDiffusion
Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)
watermark-removal
a machine learning image inpainting task that instinctively removes watermarks from image indistinguishable from the ground truth image
watermark-removal
通过水印减除方法去掉视频中的水印,快速但不完美
Reflected-Diffusion
[ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
sd-webui-text2video
Auto1111 extension implementing text2video diffusion models (like ModelScope or VideoCrafter) using only Auto1111 webui dependencies
Text-To-Video-Finetuning
Finetune ModelScope's Text To Video model using Diffusers 🧨
Tune-A-Video
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
frozen-in-time
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]
Text2Video-Zero
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Multilingual_Text_to_Speech
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.