signal processing fan's repositories
SplattingAvatar
[CVPR2024] Official implementation of SplattingAvatar.
AnimateAnyone-unofficial
Unofficial Implementation of Animate Anyone
av3d
https://aku02.github.io/projects/avatarone/
codellama
Inference code for CodeLlama models
DECO
Official PyTorch implementation of "DECO: Query-Based End-to-End Object Detection with ConvNets"
diff_instruct
official code for Diff-Instruct algorithm for one-step diffusion distillation
DM-NonUniform
Official code for Accelerating Diffusion Sampling with Optimized Time Steps (CVPR 2024)
dmd
PyTorch implementation of One-step Diffusion with Distribution Matching Distillation
DPE
[CVPR 2023] DPE: Disentanglement of Pose and Expression for General Video Portrait Editing
FlashAvatar-code
[CVPR 2024] The official repo for FlashAvatar
GaussianAvatar
[CVPR 2024] The official repo for "GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians"
GPS-Gaussian
[CVPR 2024] The official repo for “GPS-Gaussian: Generalizable Pixel-wise 3D Gaussian Splatting for Real-time Human Novel View Synthesis”
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
gpupixel
Cross-Platform AI Beauty Effects Library, Achieving Commercial-Grade Beauty Effects. Written in C++11, Based on OpenGL/ES and VNN.
hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
InCTRL
Official implementation of CVPR'24 paper 'Toward Generalist Anomaly Detection via In-context Residual Learning with Few-shot Sample Prompts'.
Kalman-and-Bayesian-Filters-in-Python
Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filters, and more. All exercises include solutions.
latent-consistency-model
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
MagicDance
MagicDance: Realistic Human Dance Video Generation with Motions & Facial Expressions Transfer
nanoGPT
The simplest, fastest repository for training/finetuning medium-sized GPTs.
Open-GroundingDino
This is the third party implementation of the paper Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection.
SingDiffusion
[CVPR 2024] Tackling the Singularities at the Endpoints of Time Intervals in Diffusion Models
starcoder2
Home of StarCoder2!
street-gaussians-ns
Unofficial implementation of "Street Gaussians for Modeling Dynamic Urban Scenes"
vit-pytorch
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection