Pocky's starred repositories

chinese-programmer-wrong-pronunciation

**程序员容易发音错误的单词

InstantID

InstantID : Zero-shot Identity-Preserving Generation in Seconds 🔥

Language:PythonLicense:Apache-2.0Stargazers:10741Issues:125Issues:217

PhotoMaker

PhotoMaker [CVPR 2024]

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:9551Issues:102Issues:162

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:9186Issues:85Issues:36

zotero-pdf-translate

Translate PDF, EPub, webpage, metadata, annotations, notes to the target language. Support 20+ translate services.

Language:TypeScriptLicense:AGPL-3.0Stargazers:7503Issues:23Issues:813

DiffSynth-Studio

Enjoy the magic of Diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:6583Issues:57Issues:154

IC-Light

More relighting!

Language:PythonLicense:Apache-2.0Stargazers:5494Issues:54Issues:89

Moore-AnimateAnyone

Character Animation (AnimateAnyone, Face Reenactment)

Language:PythonLicense:Apache-2.0Stargazers:3177Issues:37Issues:152

Open-AnimateAnyone

Unofficial Implementation of Animate Anyone

MuseV

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Language:PythonLicense:NOASSERTIONStargazers:2460Issues:34Issues:116

Stable-Diffusion-WebUI-TensorRT

TensorRT Extension for Stable Diffusion Web UI

Language:PythonLicense:MITStargazers:1913Issues:24Issues:263

InstantStyle

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

Language:Jupyter NotebookStargazers:1559Issues:21Issues:36

EasyAnimate

📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion

Language:PythonLicense:Apache-2.0Stargazers:1465Issues:17Issues:92

stable-fast

Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

Language:PythonLicense:MITStargazers:1185Issues:17Issues:126

masa

Official Implementation of CVPR24 highligt paper: Matching Anything by Segmenting Anything

Language:PythonLicense:Apache-2.0Stargazers:1000Issues:58Issues:35

FollowYourClick

[arXiv 2024] Follow-Your-Click: This repo is the official implementation of "Follow-Your-Click: Open-domain Regional Image Animation via Short Prompts"

Language:PythonLicense:Apache-2.0Stargazers:755Issues:38Issues:28

MistoLine

A Versatile and Robust SDXL-ControlNet Model for Adaptable Line Art Conditioning

T-GATE

T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!

Language:PythonLicense:MITStargazers:360Issues:12Issues:17

DEADiff

[CVPR 2024] Official implementation of "DEADiff: An Efficient Stylization Diffusion Model with Disentangled Representations"

Language:PythonLicense:Apache-2.0Stargazers:227Issues:11Issues:17

PerspectiveFields

[CVPR 2023 Highlight] Perspective Fields for Single Image Camera Calibration

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:209Issues:7Issues:17

FineControlNet

Official Pytorch Implementation of "FineControlNet: Fine-level Text Control for Image Generation with Spatially Aligned Text Control Injection", 2023

Language:PythonLicense:NOASSERTIONStargazers:178Issues:8Issues:3

DSTA

This is the code of our paper "Video-Based Human Pose Regression via Decoupled Space-Time Aggregation".

Language:PythonLicense:Apache-2.0Stargazers:122Issues:2Issues:5

3DHM

Synthesizing Moving People with 3D Control

RectifID

[NeurIPS 2024] RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance

Language:Jupyter NotebookStargazers:108Issues:3Issues:7

bilibili-downloader

B站视频下载,支持下载大会员清晰度4K,持续更新中

Language:PythonLicense:MITStargazers:89Issues:2Issues:9

Paint-by-Inpaint

Paint by Inpaint: Learning to Add Image Objects by Removing Them First

Language:PythonLicense:MITStargazers:87Issues:11Issues:3