nvnvashisth / Document_Classification

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Document_Classification

  1. Script folder will help to download the MIT big dataset
  2. Look at the another repository for pdfminer, which will help in order to preprocess the pdf to text.
  3. Trained model a. Multinomial Naive Bias Classifier - 17% b. Neural Network - 55% c. Stochastic Gradient Descent - 72%

About


Languages

Language:Python 54.6%Language:Jupyter Notebook 45.4%