krpetrich / Movie-Data-NLTK-Analysis

Master's level class project to analyze a movie dataset with NLTK

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Movie-Data-NLTK-Analysis

Code developed for a final project in Master's level introductory Python class.

Program has two movie datasets in varying structures.

The movies are read in and instances are created for each unique movie.

The movie summaries are analyzed with NLTK to categorize which words are nouns or adjectives. Additional analyses like determining most popular movies, most frequent nouns and adjectives, and movie title length are completed.

Future improvements:

  • Use NLTK to perform part of speech counts rather than using a Pandas DataFrame
  • Determine if there is a correlation between sentiment of movie summary or title and popularity.

About

Master's level class project to analyze a movie dataset with NLTK


Languages

Language:Python 100.0%