J0SAL / Aide

An App with Voice Assisted Image Captioning and VQA For Visually Challenged Individuals

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

AIDE

The aim of the application is to provide assistance to Visually Challenged Individuals with the power of image captioning and VQA (Visual Question Answering)!

GitHub code size in bytes GitHub last commit Languages Top

App Can be downloaded from here

Publication IEEE

About

A mobile application built using MVP architecture in Flutter which leverages the power of image processing, and deep learning techniques to identify and describe the current scene through the camera and inform it to the user by audio cues. Added support for gesture controls like tap, double tap and long-press, for navigating through the app. Includes feature that allows the result be converted and read into multiple languages.

Features And Interfaces

  1. Home Page with Inputs - takes image and (optional) question from the user.

    • drawing
  2. Output page - displays and speaks the answer to the question

    • drawing
  3. Output page - with translation

    • drawing

Built with 🛠️

flask flask

Installation

To clone and run this repository you'll need Git and Flutter installed on your computer. From your command line:

# Clone this repository
git clone https://github.com/J0SAL/Aide.git
# Go into the repository
cd Aide
# Install packages
flutter pub get
# Run the app
flutter run

Developers Contact 📞

  1. Joy Almeida
  2. Kristen Pereira
  3. Rushil Patel

About

An App with Voice Assisted Image Captioning and VQA For Visually Challenged Individuals


Languages

Language:Dart 75.9%Language:HTML 16.1%Language:Python 5.6%Language:Swift 1.7%Language:Kotlin 0.5%Language:Objective-C 0.2%Language:Procfile 0.1%