Anaisdg / predictive-author-id

NLP Analysis for identifying authors based on excerpts/text classification.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

predictive-author-id

Natural Language Analysis for identifying authors based on excerpts

Project Title:

Spooky Author - Predictive Author Identification

Project Outline:

Predict the author of excerpts from horror stories by Edgar Allan Poe, Mary Shelley, and HP Lovecraft.

Goals:

  • Analyze sentence length, word length, word variety (vocabulary), punctuation, pronoun usage, and sentiment scores for each author in a training dataset
  • Perform probability analysis for each text and predict authorship on a test dataset with no assigned authorship
  • Compare analyses on training dataset to the test dataset once authorship has been assigned

Notes:

About

NLP Analysis for identifying authors based on excerpts/text classification.


Languages

Language:Jupyter Notebook 99.9%Language:Python 0.1%