MingzhouZhang's starred repositories

Language:PythonLicense:Apache-2.0Stargazers:722Issues:0Issues:0
Language:PythonLicense:MITStargazers:333Issues:0Issues:0

PuLID

Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Language:PythonLicense:Apache-2.0Stargazers:1033Issues:0Issues:0

CosmicMan

CosmicMan: A Text-to-Image Foundation Model for Humans (CVPR 2024)

Language:PythonStargazers:299Issues:0Issues:0

Diffusion-Models-Papers-Survey-Taxonomy

Diffusion model papers, survey, and taxonomy

Stargazers:2852Issues:0Issues:0

minRF

Minimal implementation of scalable rectified flow transformers, based on SD3's approach

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:344Issues:0Issues:0

Omost

Your image is almost there!

Language:PythonLicense:Apache-2.0Stargazers:7086Issues:0Issues:0

modelscope-agent

ModelScope-Agent: An agent framework connecting models in ModelScope with the world

Language:PythonLicense:Apache-2.0Stargazers:2509Issues:0Issues:0

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Language:PythonLicense:NOASSERTIONStargazers:3158Issues:0Issues:0

LaVi-Bridge

[ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation

Language:PythonLicense:MITStargazers:297Issues:0Issues:0

awesome-text-to-image-studies

A collection of awesome text-to-image generation studies.

Language:TeXLicense:MITStargazers:297Issues:0Issues:0

pytorch_distribute_tutorials

pytorch distribute tutorials

Language:Jupyter NotebookStargazers:56Issues:0Issues:0
Language:Jupyter NotebookStargazers:42Issues:0Issues:0

OpenDiT

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

Language:PythonLicense:Apache-2.0Stargazers:1404Issues:0Issues:0

LayerDiffuse

Transparent Image Layer Diffusion using Latent Transparency

License:Apache-2.0Stargazers:1947Issues:0Issues:0

Latte

Latte: Latent Diffusion Transformer for Video Generation.

Language:PythonLicense:Apache-2.0Stargazers:1570Issues:0Issues:0

RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

Language:Jupyter NotebookStargazers:1632Issues:0Issues:0

awesome-diffusion-categorized

collection of diffusion model papers categorized by their subareas

Stargazers:1047Issues:0Issues:0

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonLicense:Apache-2.0Stargazers:10707Issues:0Issues:0

automatic

SD.Next: Advanced Implementation of Stable Diffusion and other Diffusion-based generative image models

Language:PythonLicense:AGPL-3.0Stargazers:5400Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:9068Issues:0Issues:0

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonLicense:AGPL-3.0Stargazers:2627Issues:0Issues:0
License:Apache-2.0Stargazers:1528Issues:0Issues:0

KandinskyVideo

KandinskyVideo — multilingual end-to-end text2video latent diffusion model

Language:PythonLicense:Apache-2.0Stargazers:161Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:293Issues:0Issues:0

open_clip

An open source implementation of CLIP.

Language:PythonLicense:NOASSERTIONStargazers:9540Issues:0Issues:0

MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Language:PythonLicense:NOASSERTIONStargazers:1142Issues:0Issues:0

deepface

A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python

Language:PythonLicense:MITStargazers:11353Issues:0Issues:0

adetailer

Auto detecting, masking and inpainting with detection model.

Language:PythonLicense:AGPL-3.0Stargazers:4022Issues:0Issues:0

Fooocus

Focus on prompting and generating

Language:PythonLicense:GPL-3.0Stargazers:39339Issues:0Issues:0