omoju / receiptdID

Receipt.ID is a multi-label, multi-class, hierarchical classification system implemented in a two layer feed forward network.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Receipt.ID

About

Receipt.ID is a multi-label, multi-class, hierarchical classification system. It trains individual Random Forest text-based classifiers and combines the result with other features. Receipt.ID is built to scale with an application as the taxonomy for the domain in which it is applied grows.

Dependencies

Receipt.id is tested to work under Python 2.7 and Python 3

Code

The data preprocessing code is provided in the notebook receiptID_1_Data_Preprocessing.ipynb. While the modeling code is provided in the notebook receiptID_2_Model.ipynb.

To open it, go to the top-level project directory receiptID/ and start the notebook server:

jupyter notebook

This should open a web browser to the server's dashboard (typically http://127.0.0.1:8888). Click on the appropriate notebook (.ipynb) to open it, and follow the instructions.

Run

To run a code cell in the notebook, hit Shift+Enter. Any output will be displayed below the corresponding cell.

You can also add/edit markdown text cells and render them using Shift+Enter.

License

The contents of this repository are covered under the GNU GENERAL PUBLIC LICENSE.

About

Receipt.ID is a multi-label, multi-class, hierarchical classification system implemented in a two layer feed forward network.

License:GNU General Public License v3.0


Languages

Language:Jupyter Notebook 100.0%