yapengyu's starred repositories

vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Language:PythonLicense:MITStargazers:19915Issues:152Issues:265

sd-webui-controlnet

WebUI extension for ControlNet

Language:PythonLicense:GPL-3.0Stargazers:16893Issues:146Issues:1503

ml-stable-diffusion

Stable Diffusion with Core ML on Apple Silicon

Language:PythonLicense:MITStargazers:16740Issues:137Issues:246

Swin-Transformer

This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".

Language:PythonLicense:MITStargazers:13674Issues:127Issues:312

libfacedetection

An open source library for face detection in images. The face detection speed can reach 1000FPS.

Language:C++License:NOASSERTIONStargazers:12263Issues:531Issues:318

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11292Issues:160Issues:301
Language:PythonLicense:Apache-2.0Stargazers:9431Issues:91Issues:1999

gemma.cpp

lightweight, standalone C++ inference engine for Google's Gemma models.

Language:C++License:Apache-2.0Stargazers:5939Issues:40Issues:86

SwinIR

SwinIR: Image Restoration Using Swin Transformer (official repository)

Language:PythonLicense:Apache-2.0Stargazers:4371Issues:52Issues:148

latent-consistency-model

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Language:PythonLicense:MITStargazers:4305Issues:62Issues:94

photo2cartoon

人像卡通化探索项目 (photo-to-cartoon translation project)

Language:PythonLicense:MITStargazers:3949Issues:83Issues:72

PixArt-alpha

PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Language:PythonLicense:AGPL-3.0Stargazers:2720Issues:47Issues:0

Pytorch_Retinaface

Retinaface get 80.99% in widerface hard val using mobilenet0.25.

Language:PythonLicense:MITStargazers:2600Issues:42Issues:199

CCPD

[ECCV 2018] CCPD: a diverse and well-annotated dataset for license plate detection and recognition

Language:PythonLicense:MITStargazers:2225Issues:64Issues:108

FaceX-Zoo

A PyTorch Toolbox for Face Recognition

Language:PythonLicense:NOASSERTIONStargazers:1875Issues:41Issues:158

tmux-config

:green_book: Example tmux configuration - screen + vim key-bindings, system stat, cpu load bar.

onediff

OneDiff: An out-of-the-box acceleration library for diffusion models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1614Issues:40Issues:433

Barbershop

Barbershop: GAN-based Image Compositing using Segmentation Masks (SIGGRAPH Asia 2021)

Language:PythonLicense:MITStargazers:1328Issues:65Issues:65

awesome-diffusion-categorized

collection of diffusion model papers categorized by their subareas

Generative-AI

[TPAMI 2023] Multimodal Image Synthesis and Editing: The Generative AI Era

distill-sd

Segmind Distilled diffusion

Language:PythonLicense:NOASSERTIONStargazers:558Issues:17Issues:16

Rotate-and-Render

Code for Rotate-and-Render: Unsupervised Photorealistic Face Rotation from Single-View Images (CVPR 2020)

Language:PythonLicense:CC-BY-4.0Stargazers:488Issues:16Issues:55

Multi-LoRA-Composition

Repository for the Paper "Multi-LoRA Composition for Image Generation"

iCartoonFace

iCartoonFace dataset, and baseline approaches, the project is supported by iQIYI

HairCLIPv2

[ICCV 2023] HairCLIPv2: Unifying Hair Editing via Proxy Feature Blending

Language:Jupyter NotebookStargazers:182Issues:15Issues:15
Language:PythonLicense:Apache-2.0Stargazers:106Issues:9Issues:2

CCPD2COCO

Convert CCPD to COCO format, including bounding box, segmentation mask, segmentation map.