C0notSilly's starred repositories
stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
ControlNet
Let us control diffusion models!
PhotoMaker
PhotoMaker
IP-Adapter
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
T2I-Adapter
T2I-Adapter
Kandinsky-2
Kandinsky 2 — multilingual text2image latent diffusion model
chatgpt-prompts-for-academic-writing
This list of writing prompts covers a range of topics and tasks, including brainstorming research ideas, improving language and style, conducting literature reviews, and developing research plans.
InternLM-XComposer
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output
Collaborative-Diffusion
Collaborative Diffusion (CVPR 2023)
LLM-groundedDiffusion
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models (LLM-grounded Diffusion: LMD)
CelebV-Text
(CVPR 2023) CelebV-Text: A Large-Scale Facial Text-Video Dataset
FaceStudio
Put Your Face Everywhere in Seconds.
HD-Painter
HD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion Models
ME-GraphAU
[IJCAI 2022] Learning Multi-dimensional Edge Feature-based AU Relation Graph for Facial Action Unit Recognition, Pytorch code
Focus-on-Your-Instruction
[CVPR 2024] Focus on Your Instruction: Fine-grained and Multi-instruction Image Editing by Attention Modulation
Training-Data-Synthesis
[ICLR 2024] Real-Fake: Effective Training Data Synthesis Through Distribution Matching
CelebA-Dialog
A large-scale visual-language face dataset with fine-grained annotations (ICCV 2021)
OpenGraphAU
An tool for facial action unit analysis