There are 52 repositories under vision topic.
Xray, Penetrates Everything. Also the best v2ray-core. Where the magic happens. An open platform for various uses.
Enhanced ChatGPT Clone: Features Agents, MCP, DeepSeek, Anthropic, AWS, OpenAI, Responses API, Azure, Groq, o1, GPT-5, Mistral, OpenRouter, Vertex AI, Gemini, Artifacts, AI model switching, message search, Code Interpreter, langchain, DALL-E-3, OpenAPI Actions, Functions, Secure Multi-User Auth, Presets, open-source for self-hosting. Active.
The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra
Automate browser based workflows with AI
AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording
📸 A powerful, high-performance React Native Camera library.
One UI is all done with chatgpt web, midjourney, gpts,suno,luma,runway,viggle,flux,ideogram,realtime,pika,udio; Simultaneous support Web / PWA / Linux / Win / MacOS platform
Open source hardware and software platform to build a small scale self driving car.
👨🏻💻 Examples of new iOS 11 APIs
[CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.
The Open Source Framework for Machine Vision
Java and Kotlin Code samples used on cloud.google.com
Python code to fuse multiple RGB-D images into a TSDF voxel volume.
Videos, notes and experiments to understand deep learning
Let Home Assistant see!
A vision library for genicam based cameras
Train robotic agents to learn to plan pushing and grasping actions for manipulation with deep reinforcement learning.
[Deprecated] 🇨🇳**二代身份证光学识别
Deepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving
A CUDA implementation of SIFT for NVidia GPUs (1.2 ms on a GTX 1060)
A curated collection of iOS, ML, AR resources sprinkled with some UI additions
Multi-Sensor Fusion (GNSS, IMU, Camera) 多源多传感器融合定位 GPS/INS组合导航 PPP/INS紧组合
3DMatch - a 3D ConvNet-based local geometric descriptor for aligning 3D meshes and point clouds.
🎰 A curated list of machine learning resources, preferably CoreML
Fuse multiple depth frames into a TSDF voxel volume.
CALVIN - A benchmark for Language-Conditioned Policy Learning for Long-Horizon Robot Manipulation Tasks
Implementation of Bottleneck Transformer in Pytorch
Convert any web design screenshot to clean HTML/CSS code