simongao's repositories
ControlNet_TensorRT
天池 NVIDIA TensorRT Hackathon 2023 —— 生成式AI模型优化赛 初赛第三名方案
genmoai_Mochi1
The best OSS video generation models
distrifuser
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
DynamiCrafter
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
EvTexture
[ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution
FastSDXL
An efficient implementation of Stable-Diffusion-XL
flash-diffusion
Official implementation of ⚡ Flash Diffusion ⚡: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation
FreeDoM
[ICCV 2023] Official PyTorch implementation for the paper "FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model"
GPT-2v
Following master Karpathy with GPT-2 implementation and training, writing lots of comments cause I have memory of a goldfish
hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
img2img-turbo
One-step image-to-image with Stable Diffusion turbo: sketch2image, day2night, and more
In-Context-LoRA
Official repository of In-Context LoRA for Diffusion Transformers
LipSick
🤢 LipSick: Fast, High Quality, Low Resource Lipsync Tool 🤮
MimicBrush
Official implementations for paper: Zero-shot Image Editing with Reference Imitation
patch_conv
Patch convolution to avoid large GPU memory usage of Conv2D
Pyramid-Flow
Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
REPA
Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
sd-webui-fastblend
Make videos smooth!
SeeSR
[CVPR2024] SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
stable-diffusion-webui
Stable Diffusion web UI
stable-fast
Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
tiny-llm-zh
从零实现一个小参数量中文大语言模型。
Tri-RMSNorm
Efficient kernel for RMS normalization with fused operations, includes both forward and backward passes, compatibility with PyTorch.
VideoFlow
Official implementation of ICCV2023 VideoFlow: Exploiting Temporal Cues for Multi-frame Optical Flow Estimation
zest_code
This is the official implementation of ZeST