艾梦's starred repositories
HeadStudio
HeadStudio: Text to Animatable Head Avatars with 3D Gaussian Splatting.
pixelsplat
[CVPR 2024 Oral] Code for "pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruction" by David Charatan, Sizhe Lester Li, Andrea Tagliasacchi, and Vincent Sitzmann
WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
DreamWaltz
[NeurIPS 2023] Official implementation of the paper "DreamWaltz: Make a Scene with Complex 3D Animatable Avatars".
ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
AnimateLCM
AnimateLCM: Let's Accelerate the Video Generation within 4 Steps!
sdxl-koala
Compressing SDXL via knowledge-distillation
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
ChatLM-mini-Chinese
中文对话0.2B小模型(ChatLM-Chinese-0.2B),开源所有数据集来源、数据清洗、tokenizer训练、模型预训练、SFT指令微调、RLHF优化等流程的全部代码。支持下游任务sft微调,给出三元组信息抽取微调示例。
I2V-Adapter-repo
I2V-Adapter: A General Image-to-Video Adapter for Video Diffusion Models
PhotoMaker
PhotoMaker
TachiyomiSY
Free and open source manga reader for Android
metahuman-stream
Real time streaming digital human based on nerf