diffusion-models

There are 57 repositories under diffusion-models topic.

diff-usion / Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
artificial-intelligence diffusion-models generative-model machine-learning score-based score-matching
Language:HTML 11629
Tencent / HunyuanVideo
HunyuanVideo: A Systematic Framework For Large Video Generation Model
diffusion-models diffusion-transformer video-generation
Language:Python 9601
openvinotoolkit / openvino
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
inference deep-learning openvino ai computer-vision diffusion-models generative-ai llm-inference natural-language-processing nlp performance-boost speech-recognition stable-diffusion deploy-ai optimize-ai transformers yolo recommendation-system good-first-issue
Language:C++ 8821
FoundationVision / VAR
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
auto-regressive-model diffusion-models image-generation transformers autoregressive-models generative-ai generative-model gpt gpt-2 large-language-models vision-transformer neurips
Language:Jupyter Notebook 8393
Tencent / Hunyuan3D-2
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
3d 3d-aigc 3d-generation diffusion-models hunyuan3d image-to-3d shape shape-generation text-to-3d texture-generation
Language:Python 8373
open-mmlab / mmagic
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
aigc computer-vision deep-learning diffusion diffusion-models generative-adversarial-network generative-ai image-editing image-generation image-processing image-synthesis inpainting matting pytorch super-resolution text2image video-frame-interpolation video-interpolation video-super-resolution
Language:Jupyter Notebook 7264
yl4579 / StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
adversarial-training deep-learning diffusion-models gan latent-diffusion latent-diffusion-models pytorch speaker-adaptation speech-synthesis text-to-speech tts wavlm
Language:Python 5962
Fanghua-Yu / SUPIR
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
deep-learning diffusion-models llava sdxl stable-diffusion super-resolution restoration pytorch pytorch-lightning
Language:Python 5230
showlab / Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, and various other applications.
awesome diffusion-models motion-customization video-editing video-generation video-generation-evaluation
4273
TingsongYu / PyTorch-Tutorial-2nd
《Pytorch实用教程》（第二版）无论是零基础入门，还是CV、NLP、LLM项目应用，或是进阶工程化部署落地，在这里都有。相信在本书的帮助下，读者将能够轻松掌握 PyTorch 的使用，成为一名优秀的深度学习工程师。
computer-vision deepsort diffusion-models onnx pytorch pytorch-tutorial tensorrt yolov5 llm qwen
Language:Jupyter Notebook 4043
bytedance / LatentSync
Taming Stable Diffusion for Lip Sync!
diffusion-models lipsync research
Language:Python 3570
Lightricks / LTX-Video
Official repository for LTX-Video
diffusion-models dit image-to-video image-to-video-generation text-to-video text-to-video-generation
Language:Python 3291
YangLing0818 / Diffusion-Models-Papers-Survey-Taxonomy
Diffusion model papers, survey, and taxonomy
diffusion-models stable-diffusion survey text-to-3d text-to-image text-to-video
3157
ali-vilab / VGen
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
diffusion-models video-synthesis
Language:Python 3096
zzw922cn / awesome-speech-recognition-speech-synthesis-papers
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
automatic-speech-recognition papers roadmap rnn cnn dnn attention-mechanism seq2seq acoustic-model timit-dataset tts language-model speaker-verification speech-recognition speech-synthesis neural-network recognition-synthesis diffusion-models singing-voice-synthesis voice-conversion
3068
deepseek-ai / DreamCraft3D
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
3d-generation aigc diffusion-models generative-model image-to-3d 3d-creation
Language:Python 2969
jy0205 / Pyramid-Flow
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
diffusion-models flow-matching video-generation
Language:Python 2889
bghira / SimpleTuner
A general fine-tuning kit geared toward diffusion models.
diffusers diffusion-models fine-tuning flux-dev machine-learning stable-diffusion
Language:Python 2532
Tencent / MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
diffusion-models video-generation
Language:Python 2307
Alpha-VLLM / Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
aigc transformer diffusion-models diffusion diffusion-model diffusion-transformer generation-models transformers
Language:Python 2221
andreas128 / RePaint
Official PyTorch Code and Models of "RePaint: Inpainting using Denoising Diffusion Probabilistic Models", CVPR 2022
cvpr2022 diffusion-models inpainting
Language:Python 2176
ChenHsing / Awesome-Video-Diffusion-Models
[CSUR] A Survey on Video Diffusion Models
awesome awesome-list diffusion diffusion-models survey text-to-video video video-diffusion video-diffusion-model video-editing
2055
open-mmlab / mmgeneration
MMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.
diffusion-models gan generative generative-adversarial-network mmcv openmmlab pytorch
Language:Python 1970
SUDO-AI-3D / zero123plus
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
3d 3d-graphics aigc diffusers diffusion-models image-to-3d research-project text-to-3d
Language:Python 1949
adobe-research / custom-diffusion
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
computer-vision customization diffusion-models few-shot fine-tuning pytorch text-to-image-generation
Language:Python 1933
amirhossein-kz / Awesome-Diffusion-Models-in-Medical-Imaging
Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)
ddpm deep-learning denoising diffusion diffusion-models generation generative-models machine-learning medical-imaging ncsn reconstruction score-based score-matching sde segmentation vae
1922
yang-song / score_sde_pytorch
PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
pytorch stochastic-differential-equations inverse-problems generative-models score-matching score-based-generative-modeling controllable-generation iclr-2021 diffusion-models
Language:Jupyter Notebook 1884
eloialonso / diamond
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
atari deep-learning diffusion-models machine-learning reinforcement-learning research world-models artificial-intelligence
Language:Python 1865
onediff
siliconflow / onediff
OneDiff: An out-of-the-box acceleration library for diffusion models.
aigc-serving comfyui comfyui-workflow cuda diffusers diffusion-models inference-engine lcm lcm-lora lora performance-optimization pytorch sd-webui sdxl sdxl-turbo stable-diffusion stable-video-diffusion
Language:Jupyter Notebook 1861
junshutang / Make-It-3D
[ICCV 2023] Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior
3d-generation 3d-vision computer-vision deep-learning diffusion-models generative-art nerf
Language:Python 1842
hymie122 / RAG-Survey
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
aigc rag survey diffusion-models llm multimodality
1729
FoundationVision / LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
auto-regressive-model diffusion diffusion-models image-generation llama llm text2image
Language:Python 1682
LuChengTHU / dpm-solver
Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)
diffusion-models machine-learning score-based-generative-models stable-diffusion
Language:Python 1657
wangkai930418 / awesome-diffusion-categorized
collection of diffusion model papers categorized by their subareas
continual-learning controlnet detection diffusion diffusion-model diffusion-models few-shot image-edit inpainting inversion segmentation stable-diffusion text-guided tracking
1646
yang-song / score_sde
Official code for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
controllable-generation diffusion-models flax generative-models iclr-2021 inverse-problems jax score-based-generative-modeling score-matching stochastic-differential-equations
Language:Jupyter Notebook 1618
guochengqian / Magic123
[ICLR24] Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors
3d-generation diffusion-models image-to-3d
Language:Jupyter Notebook 1574

diffusion-models

diff-usion / Awesome-Diffusion-Models

Tencent / HunyuanVideo

openvinotoolkit / openvino

FoundationVision / VAR

Tencent / Hunyuan3D-2

open-mmlab / mmagic

yl4579 / StyleTTS2

Fanghua-Yu / SUPIR

showlab / Awesome-Video-Diffusion

TingsongYu / PyTorch-Tutorial-2nd

bytedance / LatentSync

Lightricks / LTX-Video

YangLing0818 / Diffusion-Models-Papers-Survey-Taxonomy

ali-vilab / VGen

zzw922cn / awesome-speech-recognition-speech-synthesis-papers

deepseek-ai / DreamCraft3D

jy0205 / Pyramid-Flow

bghira / SimpleTuner

Tencent / MimicMotion

Alpha-VLLM / Lumina-T2X

andreas128 / RePaint

ChenHsing / Awesome-Video-Diffusion-Models

open-mmlab / mmgeneration

SUDO-AI-3D / zero123plus

adobe-research / custom-diffusion

amirhossein-kz / Awesome-Diffusion-Models-in-Medical-Imaging

yang-song / score_sde_pytorch

eloialonso / diamond

siliconflow / onediff

junshutang / Make-It-3D

hymie122 / RAG-Survey

FoundationVision / LlamaGen

LuChengTHU / dpm-solver

wangkai930418 / awesome-diffusion-categorized

yang-song / score_sde

guochengqian / Magic123