There are 1 repository under vision-api topic.
:scissors: Crop faces, inside of your image, with iOS 11 Vision api.
A memory efficient Android image transformation library providing cropping above Face Detection (Face Centering) for Glide.
✂️ Detect and crop faces, barcodes and texts in image with iOS 11 Vision api.
Rename images using deep learning
Smart Expense Manager App
Open Source Hackathon - Convert your database diagrams into SQL Schema using the capabilities of Vision API
Real-time Object Recognition using Apple's CoreML 2.0 and Vision API -
Code examples for Google Vision API.
Play around with code while we make sure you aren't lost!
This repository is for self driving car project developed as a part of Software Engineering project at NIIT University
An innovative AI conversation API leveraging Google's Gemini for multimodal understanding. Combines FastAPI, Langchain, and Redis for robust, scalable, and privacy-conscious text and image-based interactions
A simple Telegram bot that performs OCR on images you send to it
Customized Google Vision API Barcode Scanner
object detect and track demo using ios 11 vision api
Starter code for using GPT4o to extract text from an image
A Wordpress plugin to detect broad sets of objects in your media library images, from flowers, animals, or transportation to thousands of other object categories commonly found within images.
GPT_NEXT is an AI chat tool (OpenAI + Groq).
A WhatsApp Sticker Maker iOS App with Vision API Subject Lifting / Background Removal
This game showcases the immersive capabilities of VisionOS, where players pop floating bubbles 🫧 within a virtual space. It’s simple, addictive, and perfect for demonstrating the power of augmented reality gaming!
Medium tutorial: https://medium.com/@juancurti.it/convert-paper-based-notes-to-html-content-with-google-vision-api-e398fdb45cb9
Project is now moved to mgks.dev (Pro version of SWV with additional features)
Android Google Play API Demos
VisionDetect let you track user face gestures like blink, smile etc.
An automatic parking system solution for the modern work spaces.
A low cost reading device for blind people.
The GroqCloud API wrapper for Delphi provides access to models from Meta, OpenAI, MistralAI and Google on Groq’s LPUs, offering chat, text generation, image analysis, audio transcription, JSON output, tool integration, and content moderation capabilities.
oCaption: Leveraging OpenAI's GPT-4 Vision for Advanced Image Captioning
Contains Colab Notebooks show cool use-cases of different GCP ML APIs.
Demo repository for Google Cloud Vision API Product Search
An Android Beer Detector developed to test Google's Vision API