Chris Street's starred repositories
Paints-UNDO
Understand Human Behavior to Align True Needs
tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
tortoise.cpp
A ggml (C++) re-implementation of tortoise-tts
Scene-Change-Detection
A video scene detection algorithm is designed to detect a variety of different scenes within a video. There is a very simple definition for a scene: It is a series of logically and chronologically related shots taken in a specific order to depict an over-arching concept or story.
SimpleTuner
A general fine-tuning kit geared toward Stable Diffusion 2.1, Stable Diffusion 3, DeepFloyd, and SDXL.
PixArt-sigma
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
simple_transformer
Simple Transformer in Jax
IQA-PyTorch
👁️ 🖼️ 🔥PyTorch Toolbox for Image Quality Assessment, including LPIPS, FID, NIQE, NRQM(Ma), MUSIQ, TOPIQ, NIMA, DBCNN, BRISQUE, PI and more...
CrossAttentionControl
Unofficial implementation of "Prompt-to-Prompt Image Editing with Cross Attention Control" with Stable Diffusion
ToonCrafter
a research paper for generative cartoon interpolation
image-textualization
Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions
stable-audio-tools
Generative models for conditional audio generation