yapengyu's starred repositories
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
sd-webui-controlnet
WebUI extension for ControlNet
ml-stable-diffusion
Stable Diffusion with Core ML on Apple Silicon
Swin-Transformer
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
libfacedetection
An open source library for face detection in images. The face detection speed can reach 1000FPS.
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
photo2cartoon
人像卡通化探索项目 (photo-to-cartoon translation project)
PixArt-alpha
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Pytorch_Retinaface
Retinaface get 80.99% in widerface hard val using mobilenet0.25.
tmux-config
:green_book: Example tmux configuration - screen + vim key-bindings, system stat, cpu load bar.
Barbershop
Barbershop: GAN-based Image Compositing using Segmentation Masks (SIGGRAPH Asia 2021)
awesome-diffusion-categorized
collection of diffusion model papers categorized by their subareas
Generative-AI
[TPAMI 2023] Multimodal Image Synthesis and Editing: The Generative AI Era
distill-sd
Segmind Distilled diffusion
Rotate-and-Render
Code for Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images (CVPR 2020)
Multi-LoRA-Composition
Repository for the Paper "Multi-LoRA Composition for Image Generation"
iCartoonFace
iCartoonFace dataset, and baseline approaches, the project is supported by iQIYI
HairCLIPv2
[ICCV 2023] HairCLIPv2: Unifying Hair Editing via Proxy Feature Blending