There are 57 repositories under diffusion-models topic.
A collection of resources and papers on Diffusion Models
HunyuanVideo: A Systematic Framework For Large Video Generation Model
OpenVINO™ is an open source toolkit for optimizing and deploying AI inference
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
High-Resolution 3D Assets Generation with Large Scale Hunyuan3D Diffusion Models.
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild. Our new online demo is also released at suppixel.ai.
A curated list of recent diffusion models for video generation, editing, and various other applications.
《Pytorch实用教程》(第二版)无论是零基础入门,还是CV、NLP、LLM项目应用,或是进阶工程化部署落地,在这里都有。相信在本书的帮助下,读者将能够轻松掌握 PyTorch 的使用,成为一名优秀的深度学习工程师。
Official repository for LTX-Video
Diffusion model papers, survey, and taxonomy
Official repo for VGen: a holistic video generation ecosystem for video generation building on diffusion models
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
[ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
A general fine-tuning kit geared toward diffusion models.
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Lumina-T2X is a unified framework for Text to Any Modality Generation
Official PyTorch Code and Models of "RePaint: Inpainting using Denoising Diffusion Probabilistic Models", CVPR 2022
[CSUR] A Survey on Video Diffusion Models
MMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.
Code repository for Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model.
Custom Diffusion: Multi-Concept Customization of Text-to-Image Diffusion (CVPR 2023)
Diffusion Models in Medical Imaging (Published in Medical Image Analysis Journal)
PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 Spotlight.
OneDiff: An out-of-the-box acceleration library for diffusion models.
[ICCV 2023] Make-It-3D: High-Fidelity 3D Creation from A Single Image with Diffusion Prior
Collecting awesome papers of RAG for AIGC. We propose a taxonomy of RAG foundations, enhancements, and applications in paper "Retrieval-Augmented Generation for AI-Generated Content: A Survey".
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Official code for "DPM-Solver: A Fast ODE Solver for Diffusion Probabilistic Model Sampling in Around 10 Steps" (Neurips 2022 Oral)
collection of diffusion model papers categorized by their subareas
Official code for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)
[ICLR24] Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors