FishNotFish's starred repositories

ffhq-dataset

Flickr-Faces-HQ Dataset (FFHQ)

Language:PythonLicense:NOASSERTIONStargazers:3656Issues:0Issues:0

flash-diffusion

Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation

Language:PythonLicense:NOASSERTIONStargazers:424Issues:0Issues:0

U-ViT

A PyTorch implementation of the paper "All are Worth Words: A ViT Backbone for Diffusion Models".

Language:Jupyter NotebookLicense:MITStargazers:878Issues:0Issues:0

SpeeD

SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Training

Language:PythonLicense:Apache-2.0Stargazers:142Issues:0Issues:0

CVPR-2023-24-Papers

CVPR 2023-2024 Papers: Dive into advanced research presented at the leading computer vision conference. Keep up to date with the latest developments in computer vision and deep learning. Code included. ⭐ support visual intelligence development!

Language:PythonLicense:MITStargazers:377Issues:0Issues:0
Language:PythonStargazers:69Issues:0Issues:0

webdataset

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Language:PythonLicense:BSD-3-ClauseStargazers:2180Issues:0Issues:0

consistencydecoder

Consistency Distilled Diff VAE

Language:PythonLicense:MITStargazers:2121Issues:0Issues:0

1d-tokenizer

This repo contains the code for our paper An Image is Worth 32 Tokens for Reconstruction and Generation

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:364Issues:0Issues:0

nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Language:PythonLicense:MITStargazers:35967Issues:0Issues:0

LLM101n

LLM101n: Let's build a Storyteller

Stargazers:27714Issues:0Issues:0

CV-VAE

CV-VAE: A Compatible Video VAE for Latent Generative Video Models

Language:Jupyter NotebookStargazers:198Issues:0Issues:0

edm2

Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)

Language:PythonLicense:NOASSERTIONStargazers:466Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:8283Issues:0Issues:0

improved_edm

Implementation of "Analyzing and Improving the Training Dynamics of Diffusion Models"

Language:PythonLicense:MITStargazers:87Issues:0Issues:0

diffusers_ddim_inversion

A simple example for using `DDIMInverseScheduler` for inverting an input image to StableDiffusion's latent space

Language:PythonLicense:CC0-1.0Stargazers:49Issues:0Issues:0

InstaFlow

:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)

Language:PythonLicense:MITStargazers:1116Issues:0Issues:0

rfpp

The codebase of our paper "Improving the Training of Rectified Flows"

Language:PythonStargazers:57Issues:0Issues:0

RectifiedFlow

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Language:PythonStargazers:782Issues:0Issues:0

VideoSys

VideoSys: An easy and efficient system for video generation

Language:PythonLicense:Apache-2.0Stargazers:1562Issues:0Issues:0

MiniCPM-V

MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone

Language:PythonLicense:Apache-2.0Stargazers:11289Issues:0Issues:0

EasyAnimate

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Language:PythonLicense:Apache-2.0Stargazers:1091Issues:0Issues:0
Language:PythonLicense:AGPL-3.0Stargazers:6831Issues:0Issues:0

Omost

Your image is almost there!

Language:PythonLicense:Apache-2.0Stargazers:7149Issues:0Issues:0

HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Language:PythonLicense:NOASSERTIONStargazers:3239Issues:0Issues:0

ChatTTS

A generative speech model for daily dialogue.

Language:PythonLicense:AGPL-3.0Stargazers:30110Issues:0Issues:0

VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Language:PythonLicense:NOASSERTIONStargazers:4438Issues:0Issues:0

leetcode

🔥LeetCode solutions in any programming language | 多种编程语言实现 LeetCode、《剑指 Offer(第 2 版)》、《程序员面试金典(第 6 版)》题解

Language:JavaLicense:CC-BY-SA-4.0Stargazers:30589Issues:0Issues:0

ddpm-torch

Unofficial PyTorch Implementation of Denoising Diffusion Probabilistic Models (DDPM)

Language:PythonLicense:MITStargazers:177Issues:0Issues:0

diffusion

Denoising Diffusion Probabilistic Models

Language:PythonStargazers:3585Issues:0Issues:0