simongao's repositories
ControlNet_TensorRT
天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案
distill-sd
Segmind Distilled diffusion
distrifuser
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
DynamiCrafter
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
EvTexture
[ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution
FastSDXL
An efficient implementation of Stable-Diffusion-XL
flash-diffusion
Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation
FreeDoM
[ICCV 2023] Official PyTorch implementation for the paper "FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model"
GPT-2v
Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish
hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
img2img-turbo
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
LipSick
🤢 LipSick: Fast, High Quality, Low Resource Lipsync Tool 🤮
MimicBrush
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
minSDXL
Huggingface-compatible SDXL Unet implementation that is readily hackable
ODConv
The official project website of "Omni-Dimensional Dynamic Convolution" (ODConv for short, spotlight in ICLR 2022).
OnnxStream
Running Stable Diffusion on a RPI Zero 2 (or in 260MB of RAM)
patch_conv
Patch convolution to avoid large GPU memory usage of Conv2D
sd-webui-fastblend
Make videos smooth!
SeeSR
[CVPR2024] SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
stable-diffusion-webui
Stable Diffusion web UI
stable-fast
Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
tiny-llm-zh
从零实现一个小参数量中文大语言模型。
Tri-RMSNorm
Efficient kernel for RMS normalization with fused operations, includes both forward and backward passes, compatibility with PyTorch.
VideoFlow
Official implementation of ICCV2023 VideoFlow: Exploiting Temporal Cues for Multi-frame Optical Flow Estimation
visual-concept-translator
Code of ICCV 2023 paper titled General Image-to-Image Translation with One-Shot Image Guidance
zest_code
This is the official implementation of ZeST