llllllllllllllllllll's starred repositories
richhf-18k
RichHF-18K dataset contains rich human feedback labels we collected for our CVPR'24 paper: https://arxiv.org/pdf/2312.10240, along with the file name of the associated labeled images (no urls or images are included in this dataset).
style-aligned
Official code for "Style Aligned Image Generation via Shared Attention"
stellar-dataset
Official Code for the dataset exploration of Stellar: Systematic Evaluation of Human-Centric Personalized Text-to-Image Methods
Awesome-Controllable-T2I-Diffusion-Models
A collection of resources on controllable generation with text-to-image diffusion models.
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
so-vits-svc
SoftVC VITS Singing Voice Conversion
LayerDiffuse
Transparent Image Layer Diffusion using Latent Transparency
AnimateDiff
Official implementation of AnimateDiff.
Tune-A-Video
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Awesome-Multimodal-Large-Language-Models
:sparkles::sparkles:Latest Advances on Multimodal Large Language Models
PhotoMaker
PhotoMaker
StyleDrop-PyTorch
This is an unofficial PyTorch implementation of StyleDrop: Text-to-Image Generation in Any Style.
ImageReward
[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation
DiffusionDisentanglement
Official implementation of the paper "Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models
generative-models
Generative Models by Stability AI