0iui0's repositories
OpenSplat
Free and open source 3D gaussian splatting in C++ 💦
StableVITON
[CVPR2024] StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-On
pl2map
Representing 3D sparse map points and lines for camera relocalization
GaussianEditor
[CVPR 2024] GaussianEditor: Swift and Controllable 3D Editing with Gaussian Splatting
ollama
Get up and running with Llama 2, Mistral, Gemma, and other large language models.
YOLOPoint
Joint Keypoint and Object Detection
gaussian-head
Official repository for 'GaussianHead: High-fidelity Head Avatars with Learnable Gaussian Derivation'
OmniLMM
Large Multi-modal Models for Strong Performance and Efficient Deployment
MiniCPM
MiniCPM-2B: An end-side LLM outperforms Llama2-13B.
sam-pt
SAM-PT: Extending SAM to zero-shot video segmentation with point-based tracking.
AGI-Samantha
AGI has been achieved externally
vision_msgs
Algorithm-agnostic computer vision message types for ROS.
YOLOv8-TensorRT
YOLOv8 using TensorRT accelerate !
SwiftInfer
Efficient AI Inference & Serving
OVSG
[CoRL2023] Open-Vocabulary Scene-Graph
magic-animate
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Osprey
The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"
CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
SuGaR
Official implementation of SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering
DemoFusion
Let us democratise high-resolution generation! (arXiv 2023)
EgoThink
The official code and data for paper "Can Vision-Language Models Think from a First-Person Perspective?"
zed-open-capture
Low level camera driver for the ZED stereo camera family. API docs available here:
dm-vio
Source code for the paper DM-VIO: Delayed Marginalization Visual-Inertial Odometry
SAM-Graph
Code for "SAM-guided Graph Cut for 3D Instance Segmentation"
Instant-angelo
Instant-angelo: Build high-fidelity Digital Twin within 20 Minutes!