PeterYoung's starred repositories

titok-pytorch

Implementation of TiTok, proposed by Bytedance in "An Image is Worth 32 Tokens for Reconstruction and Generation"

Language:PythonLicense:MITStargazers:112Issues:0Issues:0

AutoStudio

AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation

Language:Jupyter NotebookStargazers:92Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:225Issues:0Issues:0

BasicSR

Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.

Language:PythonLicense:Apache-2.0Stargazers:6398Issues:0Issues:0

CleanDiffuser

CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making

Language:PythonLicense:Apache-2.0Stargazers:121Issues:0Issues:0

magvit

Official JAX implementation of MAGVIT: Masked Generative Video Transformer

Language:PythonLicense:Apache-2.0Stargazers:884Issues:0Issues:0

textgrad

Automatic ''Differentiation'' via Text -- using large language models to backpropagate textual gradients.

Language:PythonLicense:MITStargazers:539Issues:0Issues:0

BERT-pytorch

Google AI 2018 BERT pytorch implementation

Language:PythonLicense:Apache-2.0Stargazers:6058Issues:0Issues:0

vision-agent

Vision agent

Language:PythonLicense:Apache-2.0Stargazers:727Issues:0Issues:0

CustomTkinter

A modern and customizable python UI-library based on Tkinter

Language:PythonLicense:MITStargazers:10670Issues:0Issues:0

pytorch-lightning

Pretrain, finetune and deploy AI models on multiple GPUs, TPUs with zero code changes.

Language:PythonLicense:Apache-2.0Stargazers:27363Issues:0Issues:0

Glyph-ByT5

This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Multilingual Visual Text Rendering""

Language:Jupyter NotebookStargazers:300Issues:0Issues:0
Language:PythonStargazers:23Issues:0Issues:0

wtfpython

What the f*ck Python? 😱

Language:PythonLicense:WTFPLStargazers:35434Issues:0Issues:0

hallo

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Language:PythonLicense:MITStargazers:3935Issues:0Issues:0

MuseTalk

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Language:PythonLicense:NOASSERTIONStargazers:1703Issues:0Issues:0

bsq-vit

[BSQ-ViT] Image and Video Tokenization with Binary Spherical Quantization

Language:PythonLicense:MITStargazers:45Issues:0Issues:0

mdlm

Simplified Masked Diffusion Language Model

Language:PythonLicense:Apache-2.0Stargazers:79Issues:0Issues:0

image-textualization

Image Textualization: An Automatic Framework for Generating Rich and Detailed Image Descriptions

Language:PythonStargazers:50Issues:0Issues:0

AsyncDiff

Official implementation of "AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising"

Language:PythonLicense:Apache-2.0Stargazers:69Issues:0Issues:0

LibriTTS-P

LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning

Stargazers:91Issues:0Issues:0

Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

Language:PythonLicense:MITStargazers:20486Issues:0Issues:0
Language:PythonLicense:MITStargazers:181Issues:0Issues:0

omniglue

Code release for CVPR'24 submission 'OmniGlue'

Language:PythonLicense:Apache-2.0Stargazers:417Issues:0Issues:0

Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Language:PythonLicense:BSD-3-ClauseStargazers:26775Issues:0Issues:0

MQT-LLaVA

Matryoshka Query Transformer for Large Vision-Language Models

Language:PythonLicense:Apache-2.0Stargazers:69Issues:0Issues:0

cobalt

save what you love

Language:JavaScriptLicense:AGPL-3.0Stargazers:9728Issues:0Issues:0

LlamaGen

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Language:PythonLicense:MITStargazers:794Issues:0Issues:0

BIRD

This is the official implementation of "Blind Image Restoration via Fast Diffusion Inversion"

Language:PythonStargazers:206Issues:0Issues:0

flash-diffusion

Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation

Language:PythonLicense:NOASSERTIONStargazers:235Issues:0Issues:0