Zhendong Wang (ZhendongWang6)

ZhendongWang6

Geek Repo

Company:University of Science and Technology of China (USTC)

Location:Hefei, China

Home Page:https://zhendongwang6.github.io/

Github PK Tool:Github PK Tool

Zhendong Wang's starred repositories

diffusion-forcing

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Language:PythonLicense:MITStargazers:318Issues:0Issues:0

Kolors

Kolors Team

Language:PythonLicense:Apache-2.0Stargazers:2398Issues:0Issues:0

Awesome-Text-to-Image

(ෆ`꒳´ෆ) A Survey on Text-to-Image Generation/Synthesis.

License:MITStargazers:1993Issues:0Issues:0

FastV

[ECCV 2024] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models

Language:PythonStargazers:169Issues:0Issues:0

EG4D

Official implementation of EG4D: Explicit Generation of 4D Object without Score Distillation

Stargazers:17Issues:0Issues:0

Lumina-T2X

Lumina-T2X is a unified framework for Text to Any Modality Generation

Language:PythonLicense:MITStargazers:1896Issues:0Issues:0

DeepSeek-V2

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

License:MITStargazers:2977Issues:0Issues:0

InstanceDiffusion

[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"

Language:PythonLicense:Apache-2.0Stargazers:436Issues:0Issues:0

RPG-DiffusionMaster

[ICML 2024] Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs (PRG)

Language:Jupyter NotebookStargazers:1601Issues:0Issues:0

DiffusionDPO

Code for "Diffusion Model Alignment Using Direct Preference Optimization"

Language:PythonLicense:Apache-2.0Stargazers:183Issues:0Issues:0

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

Language:PythonLicense:MITStargazers:3847Issues:0Issues:0

Latte

Latte: Latent Diffusion Transformer for Video Generation.

Language:PythonLicense:Apache-2.0Stargazers:1498Issues:0Issues:0

OpenDiT

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

Language:PythonLicense:Apache-2.0Stargazers:1344Issues:0Issues:0

GaussianCube

GaussianCube: A Structured and Explicit Radiance Representation for 3D Generative Modeling

Language:PythonStargazers:267Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:417Issues:0Issues:0

img2img-turbo

One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more

Language:PythonLicense:MITStargazers:1330Issues:0Issues:0

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49179Issues:0Issues:0

RectifiedFlow

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Language:PythonStargazers:693Issues:0Issues:0

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

Language:PythonLicense:Apache-2.0Stargazers:5652Issues:0Issues:0

sd-forge-layerdiffuse

[WIP] Layer Diffusion for WebUI (via Forge)

Language:PythonLicense:Apache-2.0Stargazers:3662Issues:0Issues:0

Awesome-Controllable-T2I-Diffusion-Models

A collection of resources on controllable generation with text-to-image diffusion models.

License:MITStargazers:764Issues:0Issues:0

FiT

[ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model

License:Apache-2.0Stargazers:343Issues:0Issues:0

fast-DiT

Fast Diffusion Models with Transformers

Language:PythonLicense:NOASSERTIONStargazers:622Issues:0Issues:0

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5692Issues:0Issues:0

fastcomposer

FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention

Language:PythonLicense:MITStargazers:626Issues:0Issues:0

minSDXL

Huggingface-compatible SDXL Unet implementation that is readily hackable

Language:Jupyter NotebookStargazers:366Issues:0Issues:0
Language:PythonLicense:MITStargazers:2457Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1704Issues:0Issues:0

MaskTextSpotterV3

The code of "Mask TextSpotter v3: Segmentation Proposal Network for Robust Scene Text Spotting"

Language:PythonLicense:NOASSERTIONStargazers:618Issues:0Issues:0

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonLicense:MITStargazers:3453Issues:0Issues:0