0iui0's repositories
otel-profiling-agent
The production-scale datacenter profiler
marepo
[CVPR 2024 Highlight] Map-Relative Pose Regression for Visual Re-Localization
nersemble
[Siggraph '23] NeRSemble: Neural Radiance Field Reconstruction of Human Heads
CLoT
Official Codebase of our Paper: "Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation" (CVPR 2024)
self-operating-computer
A framework to enable multimodal models to operate a computer.
pram
official implementation of PRAM: Place Recognition Anywhere Model for Efficient Visual Localization
GoMVS
[CVPR'24]🦿GoMVS: Geometrically Consistent Cost Aggregation for Multi-View Stereo
VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"
latentsplat
Implementation of latentSplat: Autoencoding Variational Gaussians for Fast Generalizable 3D Reconstruction
UFO
A UI-Focused Agent for Windows OS Interaction.
Gaussian-Head-Avatar
[CVPR 2024] Official repository for "Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians"
cobra
Cobra: Extending Mamba to Multi-modal Large Language Model for Efficient Inference
mickey
[CVPR 2024 - Oral] Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences
openai-translator
基于 ChatGPT API 的划词翻译浏览器插件和跨平台桌面端应用 - Browser extension and cross-platform desktop application for translation based on ChatGPT API.
FlashAvatar-code
[CVPR 2024] The official repo for FlashAvatar
SWE-agent
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.29% of bugs in the SWE-bench evaluation set and takes just 1.5 minutes to run.
GRM
Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation
MobileAgent
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception
XScale-NVS
The official implementation of the CVPR'24 paper titled "XScale-NVS: Cross-Scale Novel View Synthesis with Hash Featurized Manifold".
vlfm
The repository provides code associated with the paper VLFM: Vision-Language Frontier Maps for Zero-Shot Semantic Navigation (ICRA 2024)
Neural3DStrokes
Pytorch Code for "Neural 3D Strokes: Creating Stylized 3D Scenes with Vectorized 3D Strokes"
promptflow
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
gaustudio
A Uniform Framework for 3D Gaussian Splatting and Beyond
GLEE
【CVPR2024】GLEE: General Object Foundation Model for Images and Videos at Scale
SplaTAM
SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM
myscaledb
An open-source, high-performance SQL vector database built on ClickHouse.
U-ARE-ME
Uncertainty-Aware Rotation Estimation in Manhattan Environments using only monocular cues.
OmniSeg3D-GS
3D Gaussian Splatting adapted version of OmniSeg3D (CVPR2024)