Z-L-D's starred repositories

taggui

Tag manager and captioner for image datasets

Language:PythonLicense:GPL-3.0Stargazers:523Issues:0Issues:0

airgen

Official source codes of airsep

Language:PythonLicense:MITStargazers:29Issues:0Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20284Issues:0Issues:0

De-limiter

An official repository of "Music De-limiter Networks via Sample-wise Gain Inversion", which will be presented in WASPAA 2023.

Language:PythonLicense:MITStargazers:64Issues:0Issues:0
Language:PythonLicense:MITStargazers:19Issues:0Issues:0

Stable-Diffusion

Stable Diffusion, SDXL, LoRA Training, DreamBooth Training, Automatic1111 Web UI, DeepFake, Deep Fakes, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, ComfyUI, Google Colab, RunPod, NoteBooks, ControlNet, TTS, Voice Cloning, AI, AI News, ML, ML News, News, Tech, Tech News, Kohya LoRA, Kandinsky 2, DeepFloyd IF, Midjourney

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:1907Issues:0Issues:0

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonLicense:AGPL-3.0Stargazers:2575Issues:0Issues:0

PixArt-sigma

PixArt-ÎŁ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Language:PythonLicense:AGPL-3.0Stargazers:1496Issues:0Issues:0
Stargazers:750Issues:0Issues:0

rich-text-to-image

Rich-Text-to-Image Generation

Language:PythonLicense:MITStargazers:748Issues:0Issues:0
Language:PythonStargazers:117Issues:0Issues:0

ELLA

ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment

Language:PythonLicense:Apache-2.0Stargazers:1005Issues:0Issues:0

LaVi-Bridge

[ECCV 2024] Bridging Different Language Models and Generative Vision Models for Text-to-Image Generation

Language:PythonLicense:MITStargazers:294Issues:0Issues:0

DiLightNet

Official Code Release for [SIGGRAPH 2024] DilightNet: Fine-grained Lighting Control for Diffusion-based Image Generation

Language:PythonLicense:MITStargazers:57Issues:0Issues:0

StabilityMatrix

Multi-Platform Package Manager for Stable Diffusion

Language:C#License:AGPL-3.0Stargazers:3459Issues:0Issues:0

RaDe-GS

RaDe-GS: Rasterizing Depth in Gaussian Splatting

Language:C++License:NOASSERTIONStargazers:388Issues:0Issues:0

audio-diffusion

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:686Issues:0Issues:0

part123

https://liuar0512.github.io/part123_official_page/

License:MITStargazers:30Issues:0Issues:0

stmc

Implementation of "Multi-Track Timeline Control for Text-Driven 3D Human Motion Generation" from CVPR Workshop on Human Motion Generation 2024.

Language:PythonLicense:NOASSERTIONStargazers:67Issues:0Issues:0

BlockFusion

[TOG 2024] BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation

Stargazers:16Issues:0Issues:0

Unique3D

Official implementation of Unique3D: High-Quality and Efficient 3D Mesh Generation from a Single Image

Language:PythonLicense:MITStargazers:2544Issues:0Issues:0

MotionDreamer

MotionDreamer: Zero-Shot 3D Mesh Animation from Video Diffusion Models

Stargazers:15Issues:0Issues:0

GaussianPrediction

[SIGGRAPH Conference 2024] GaussianPrediction: Dynamic 3D Gaussian Prediction for Motion Extrapolation and Free View Synthesis

Stargazers:36Issues:0Issues:0

ComfyUI-DynamiCrafterWrapper

Wrapper to use DynamiCrafter models in ComfyUI

Language:PythonLicense:NOASSERTIONStargazers:538Issues:0Issues:0

Omost

Your image is almost there!

Language:PythonLicense:Apache-2.0Stargazers:6947Issues:0Issues:0

MusePose

MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation

Language:PythonLicense:NOASSERTIONStargazers:1958Issues:0Issues:0

ComfyUI-FlashFace

ComfyUI Node for FlashFace

Language:PythonLicense:MITStargazers:41Issues:0Issues:0
Language:PythonLicense:MITStargazers:292Issues:0Issues:0

threefiner

An interface for text-guided mesh refinement.

Language:PythonLicense:Apache-2.0Stargazers:161Issues:0Issues:0

Analogist

Analogist: Out-of-the-box Visual In-Context Learning with Image Diffusion Model (SIGGRAPH 2024)

Language:PythonLicense:MITStargazers:28Issues:0Issues:0