0iui0

0iui0's repositories

dm-vio

Source code for the paper DM-VIO: Delayed Marginalization Visual-Inertial Odometry

Language:C++GPL-3.0000

AGI-Samantha

AGI has been achieved externally

MIT000

AnimateDiff

Official implementation of AnimateDiff.

Apache-2.0000

AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Language:PythonMIT000

CogVLM

a state-of-the-art-level open visual language model | 多模态预训练模型

NOASSERTION000

DemoFusion

Let us democratise high-resolution generation! (arXiv 2023)

000

Depth-Anything

Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

Language:PythonApache-2.0000

EgoThink

The official code and data for paper "Can Vision-Language Models Think from a First-Person Perspective?"

Apache-2.0000

embodied-generalist

Official code repository for 3D Embodied Generalist LEO

MIT000

gaussian-head

Official repository for 'GaussianHead: High-fidelity Head Avatars with Learnable Gaussian Derivation'

000

generative-models

Generative Models by Stability AI

Language:PythonMIT000

home-robot

Mobile manipulation research tools for roboticists

MIT000

HumanGaussian

Github Repo for "HumanGaussian: Text-Driven 3D Human Generation with Gaussian Splatting"

MIT000

Instant-angelo

Instant-angelo: Build high-fidelity Digital Twin within 20 Minutes!

Language:PythonMIT000

magic-animate

MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

Language:PythonBSD-3-Clause000

MiniCPM

MiniCPM-2B: An end-side LLM outperforms Llama2-13B.

Apache-2.0000

OmniLMM

Large Multi-modal Models for Strong Performance and Efficient Deployment

Apache-2.0000

Osprey

The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"

Apache-2.0000

ovsam

NOASSERTION000

OVSG

[CoRL2023] Open-Vocabulary Scene-Graph

000

SAM-Graph

Code for "SAM-guided Graph Cut for 3D Instance Segmentation"

000

SuGaR

Official implementation of SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering

NOASSERTION000

SwiftInfer

Efficient AI Inference & Serving

Apache-2.0000

TEASER-plusplus

A fast and robust point cloud registration library

MIT000

tensorrtx

Implementation of popular deep learning networks with TensorRT network definition API

Language:C++MIT000

ultralytics

NEW - YOLOv8 🚀 in PyTorch > ONNX > OpenVINO > CoreML > TFLite

Language:PythonAGPL-3.0000

vision_msgs

Algorithm-agnostic computer vision message types for ROS.

Apache-2.0000

YOLOv8-multi-task

Language:PythonAGPL-3.0000

YOLOv8-TensorRT

YOLOv8 using TensorRT accelerate !

MIT000

zed-open-capture

Low level camera driver for the ZED stereo camera family. API docs available here:

MIT000