chenlin9

followers

following

stars

chenlin9's starred repositories

PhotoMaker

PhotoMaker

Language:Jupyter NotebookNOASSERTION869000

datasciencecoursera

for Data Science class on Coursera

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonMPL-2.03211600

videocomposer

Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability

Language:PythonMIT86300

I2VGen-XL-colab

Language:Jupyter Notebook11300

generative-models

Generative Models by Stability AI

Language:PythonMIT2339000

prompt-to-prompt

Language:Jupyter NotebookApache-2.0297800

Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonBSD-3-Clause261300

Medium-Articles-Notebooks

Medium Articles Notebooks and Media Files

Language:Jupyter Notebook1500

clip-retrieval

Easily compute clip embeddings and build a clip retrieval system with them

Language:Jupyter NotebookMIT228000

imagecorruptions

Python package to corrupt arbitrary images.

Language:PythonApache-2.038900

T2I-Adapter

T2I-Adapter

Language:Python332100

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonAGPL-3.02484800

MultiDiffusion

Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)

Language:Jupyter Notebook94500

lambda-diffusers

Language:Jupyter NotebookMIT54800

yt-dlp

A feature-rich command-line audio/video downloader

Language:PythonUnlicense7731100

watermark-removal

a machine learning image inpainting task that instinctively removes watermarks from image indistinguishable from the ground truth image

Language:Python174900

watermark-removal

通过水印减除方法去掉视频中的水印，快速但不完美

Language:Python31000

Reflected-Diffusion

[ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)

Language:PythonMIT15200

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.04567900

VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Language:PythonNOASSERTION435500

sd-webui-text2video

Auto1111 extension implementing text2video diffusion models (like ModelScope or VideoCrafter) using only Auto1111 webui dependencies

Language:PythonNOASSERTION127300

Text-To-Video-Finetuning

Finetune ModelScope's Text To Video model using Diffusers 🧨

Language:PythonMIT64300

webvid

Large-scale text-video dataset. 10 million captioned short videos.

Language:Python55300

Tune-A-Video

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Language:PythonApache-2.0415500

frozen-in-time

Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]

Language:PythonMIT33900

Text2Video-Zero

[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators

Language:PythonNOASSERTION392600

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:PythonMIT3770700

Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Language:PythonMIT81800

HanTTS

Chinese Text-to-Speech web service

Language:PythonMIT30900