adriaanbd / kamtutecs-api

A Dockerized API for OCR and NLP using Tesseract, OpenCV, and spaCy.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Kamtutecs

Getting Started

Instructions

  1. Setup Backend
  2. Setup Frontend
  3. Go to localhost:3000
  4. Upload image with horizontal text in it
  5. Draw a bounding box around the desired text to extract it
  6. Submit
  7. Open Developer Tools (F12)
  8. Look at the Console to see the response

Backend

Docker

$ git clone https://github.com/adriaanbd/kamtutecs-api.git
$ cd kamtutecs-api
$ docker-compose build
$ docker-compose up

Without Docker

$ apt-get -y install tesseract-ocr tesseract-ocr-spa
$ apt-get -y install libtesseract-dev
$ apt-get -y install libleptonica-dev
$ git clone https://github.com/adriaanbd/kamtutecs-api.git
$ cd kamtutecs-api
$ pip install -r requirements.txt
$ uvicorn app.main:app

Frontend

$ git clone https://github.com/adriaanbd/kamtutecs.git
$ cd kamtutecs
$ npm install
$ npm start

About

A Dockerized API for OCR and NLP using Tesseract, OpenCV, and spaCy.


Languages

Language:Python 82.4%Language:Dockerfile 17.6%