There are 44 repositories under vision topic.
Enhanced ChatGPT Clone: Features OpenAI, Assistants API, Azure, Groq, GPT-4 Vision, Mistral, Bing, Anthropic, OpenRouter, Vertex AI, Gemini, AI model switching, message search, langchain, DALL-E-3, ChatGPT Plugins, OpenAI Functions, Secure Multi-User System, Presets, completely open-source for self-hosting. More features in development
Awesome pre-trained models toolkit based on PaddlePaddle. (400+ models including Image, Text, Audio, Video and Cross-Modal with Easy Inference & Serving)
📸 A powerful, high-performance React Native Camera library.
Automate browser-based workflows with LLMs and Computer Vision
One UI is all done with chatgpt web, midjourney, gpts,suno-v3,luma ; Simultaneous support Web / PWA / Linux / Win / MacOS platform
👨🏻💻 Examples of new iOS 11 APIs
Open source hardware and software platform to build a small scale self driving car.
The Open Source Framework for Machine Vision
Java and Kotlin Code samples used on cloud.google.com
Python code to fuse multiple RGB-D images into a TSDF voxel volume.
Videos, notes and experiments to understand deep learning
[Deprecated] 🇨🇳**二代身份证光学识别
Train robotic agents to learn to plan pushing and grasping actions for manipulation with deep reinforcement learning.
Deepdrive is a simulator that allows anyone with a PC to push the state-of-the-art in self-driving
A curated collection of iOS, ML, AR resources sprinkled with some UI additions
A vision library for genicam based cameras
A CUDA implementation of SIFT for NVidia GPUs (1.2 ms on a GTX 1060)
3DMatch - a 3D ConvNet-based local geometric descriptor for aligning 3D meshes and point clouds.
🎰 A curated list of machine learning resources, preferably CoreML
Multi-Sensor Fusion (GNSS, IMU, Camera) 多源多传感器融合定位 GPS/INS组合导航 PPP/INS紧组合
Fuse multiple depth frames into a TSDF voxel volume.
Implementation of Bottleneck Transformer in Pytorch
Library to build personalized AI powered by what you've seen, said, or heard. Works with Ollama. Alternative to Rewind.ai. Open. Secure. You own your data. Rust.
Computer vision based ML training data generation tool :rocket:
Code for "Dense Object Nets: Learning Dense Visual Object Descriptors By and For Robotic Manipulation"
Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.
Rewriting a Deep Generative Model, ECCV 2020 (oral). Interactive tool to directly edit the rules of a GAN to synthesize scenes with objects added, removed, or altered. Change StyleGANv2 to make extravagant eyebrows, or horses wearing hats.