0iui0's repositories
AppFlowy
AppFlowy is an open-source alternative to Notion. You are in charge of your data and customizations. Built with Flutter and Rust.
CLoT
Official Codebase of our Paper: "Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation" (CVPR 2024)
cobra
Cobra: Extending Mamba to Multi-modal Large Language Model for Efficient Inference
corenet
CoreNet: A library for training deep neural networks
FlashAvatar-code
[CVPR 2024] The official repo for FlashAvatar
flowmap
Code for "FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient Descent" by Cameron Smith*, David Charatan*, Ayush Tewari, and Vincent Sitzmann
gaustudio
A Uniform Framework for 3D Gaussian Splatting and Beyond
GLEE
【CVPR2024】GLEE: General Object Foundation Model for Images and Videos at Scale
GoMVS
[CVPR'24]🦿GoMVS: Geometrically Consistent Cost Aggregation for Multi-View Stereo
GRM
Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation
LeanCopilot
LLMs as Copilots for Theorem Proving in Lean
marepo
[CVPR 2024 Highlight] Map-Relative Pose Regression for Visual Re-Localization
mickey
[CVPR 2024 - Oral] Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences
MobileAgent
Mobile-Agent: Autonomous Multi-Modal Mobile Device Agent with Visual Perception
myscaledb
An open-source, high-performance SQL vector database built on ClickHouse.
OmniSeg3D-GS
3D Gaussian Splatting adapted version of OmniSeg3D (CVPR2024)
openai-translator
基于 ChatGPT API 的划词翻译浏览器插件和跨平台桌面端应用 - Browser extension and cross-platform desktop application for translation based on ChatGPT API.
otel-profiling-agent
The production-scale datacenter profiler
pram
official implementation of PRAM: Place Recognition Anywhere Model for Efficient Visual Localization
promptflow
Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
SplaTAM
SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM
SWE-agent
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.29% of bugs in the SWE-bench evaluation set and takes just 1.5 minutes to run.
U-ARE-ME
Uncertainty-Aware Rotation Estimation in Manhattan Environments using only monocular cues.
UFO
A UI-Focused Agent for Windows OS Interaction.
VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction"
vggsfm
[CVPR 2024 Highlight] VGGSfM Visual Geometry Grounded Deep Structure From Motion
XScale-NVS
The official implementation of the CVPR'24 paper titled "XScale-NVS: Cross-Scale Novel View Synthesis with Hash Featurized Manifold".