YANGTAO WANG's starred repositories
Paints-UNDO
Understand Human Behavior to Align True Needs
gaussian_splatting_notes
A detailed formulae explanation on gaussian splatting
Video-LLaMA
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Track-Anything
Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.
learning_research
ćś¬äşşçš„ç§‘ç ”ç»ŹéŞŚ
Segment-Anything-NeRF
Segment-anything interactively in NeRF.
Versatile-Diffusion
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023
stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Dreambooth-Stable-Diffusion
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
instant-ngp
Instant neural graphics primitives: lightning fast NeRF and more
stable-dreamfusion
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
latent-stable-dreamfusion
A variant on ashawkey/stable-dreamfusion, operating in latent space
make-a-video-pytorch
Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch
nerfstudio
A collaboration friendly studio for NeRFs