urigoren / contract_classification

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Contract classification challenge

Prerequisites

  1. Python 3.6+ installed
  2. Pip (curl https://bootstrap.pypa.io/get-pip.py -o get-pip.py && python get-pip.py)
  3. Jupyter notebook

Recommended software for Windows users

  1. Anaconda: https://www.anaconda.com/download/#windows
  2. cmder: https://github.com/cmderdev/cmder/releases/download/v1.3.6/cmder.zip

Installation steps

  1. Clone this repository
  2. Download the training data from: https://drive.google.com/file/d/1CD-wyXoZ5UKJWN20kZDWvdVqcQvRCzP5/view
  3. Download the test data from: https://drive.google.com/file/d/1d3doBKzy3TNgtAnqkiB0BouwdOuf9pPF/view
  4. Copy it to data/
  5. Make sure all the requirements are installed pip3 install -r requirements.txt OR conda install --yes --file requirements.txt if you're with Anaconda
  6. Launch Jupyter by running cd notebooks; jupyter notebook in your terminal

Dataset

  1. data.zip - The raw contracts, classified by their filename
  2. test_data.zip - Unlabeled contracts

Leaderboard

is available at http://goren.ml/uattcontract

A minimum accuracy of 90% is required

About


Languages

Language:Jupyter Notebook 96.9%Language:Python 3.1%