chenlin9's starred repositories

PhotoMaker

PhotoMaker

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:8690Issues:0Issues:0

datasciencecoursera

for Data Science class on Coursera

Stargazers:392Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:32116Issues:0Issues:0

videocomposer

Official repo for VideoComposer: Compositional Video Synthesis with Motion Controllability

Language:PythonLicense:MITStargazers:863Issues:0Issues:0
Language:Jupyter NotebookStargazers:113Issues:0Issues:0

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:23390Issues:0Issues:0
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2978Issues:0Issues:0

Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonLicense:BSD-3-ClauseStargazers:2613Issues:0Issues:0

Medium-Articles-Notebooks

Medium Articles Notebooks and Media Files

Language:Jupyter NotebookStargazers:15Issues:0Issues:0

clip-retrieval

Easily compute clip embeddings and build a clip retrieval system with them

Language:Jupyter NotebookLicense:MITStargazers:2280Issues:0Issues:0

imagecorruptions

Python package to corrupt arbitrary images.

Language:PythonLicense:Apache-2.0Stargazers:389Issues:0Issues:0

T2I-Adapter

T2I-Adapter

Language:PythonStargazers:3321Issues:0Issues:0

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonLicense:AGPL-3.0Stargazers:24848Issues:0Issues:0

MultiDiffusion

Official Pytorch Implementation for "MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation" presenting "MultiDiffusion" (ICML 2023)

Language:Jupyter NotebookStargazers:945Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:548Issues:0Issues:0

yt-dlp

A feature-rich command-line audio/video downloader

Language:PythonLicense:UnlicenseStargazers:77311Issues:0Issues:0

watermark-removal

a machine learning image inpainting task that instinctively removes watermarks from image indistinguishable from the ground truth image

Language:PythonStargazers:1749Issues:0Issues:0

watermark-removal

通过水印减除方法去掉视频中的水印,快速但不完美

Language:PythonStargazers:310Issues:0Issues:0

Reflected-Diffusion

[ICML 2023] Reflected Diffusion Models (https://arxiv.org/abs/2304.04740)

Language:PythonLicense:MITStargazers:152Issues:0Issues:0

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:45679Issues:0Issues:0

VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Language:PythonLicense:NOASSERTIONStargazers:4355Issues:0Issues:0

sd-webui-text2video

Auto1111 extension implementing text2video diffusion models (like ModelScope or VideoCrafter) using only Auto1111 webui dependencies

Language:PythonLicense:NOASSERTIONStargazers:1273Issues:0Issues:0

Text-To-Video-Finetuning

Finetune ModelScope's Text To Video model using Diffusers 🧨

Language:PythonLicense:MITStargazers:643Issues:0Issues:0

webvid

Large-scale text-video dataset. 10 million captioned short videos.

Language:PythonStargazers:553Issues:0Issues:0

Tune-A-Video

[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation

Language:PythonLicense:Apache-2.0Stargazers:4155Issues:0Issues:0

frozen-in-time

Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]

Language:PythonLicense:MITStargazers:339Issues:0Issues:0

Text2Video-Zero

[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators

Language:PythonLicense:NOASSERTIONStargazers:3926Issues:0Issues:0

stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Language:PythonLicense:MITStargazers:37707Issues:0Issues:0

Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Language:PythonLicense:MITStargazers:818Issues:0Issues:0

HanTTS

Chinese Text-to-Speech web service

Language:PythonLicense:MITStargazers:309Issues:0Issues:0