vincentliuheyang's starred repositories
ultralytics
NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite
embedchain
Memory for AI agents
stable-dreamfusion
Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.
streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
threestudio
A unified framework for 3D content generation.
ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
mmdetection3d
OpenMMLab's next-generation platform for general 3D object detection.
DiffusionDet
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)
StableVideo
[ICCV 2023] StableVideo: Text-driven Consistency-aware Diffusion Video Editing
Versatile-Diffusion
Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023
Tracking-Anything-with-DEVA
[ICCV 2023] Tracking Anything with Decoupled Video Segmentation
ControlNet-for-Diffusers
Transfer the ControlNet with any basemodel in diffusers🔥
rich-text-to-image
Rich-Text-to-Image Generation
latent-nerf
Official Implementation for "Latent-NeRF for Shape-Guided Generation of 3D Shapes and Textures"
Anti-DreamBooth
Anti-DreamBooth: Protecting users from personalized text-to-image synthesis (ICCV 2023)
Speech2Lip
[ICCV2023] Speech2Lip: High-fidelity Speech to Lip Generation by Learning from a Short Video