ankit-kmar / Finding-Movie-Similarity-from-Plot-Summaries

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Finding-Movie-Similarity-from-Plot-Summaries

We all love watching movies! There are some movies we like, some we don't. Most people have a preference for movies of a similar genre. Some of us love watching action movies, while some of us like watching horror. Some of us like watching movies that have ninjas in them, while some of us like watching superheroes.

Movies within a genre often share common base parameters. Consider the following two movies:

2001: A Space Odyssey Close Encounters of the Third Kind

Both movies, 2001: A Space Odyssey and Close Encounters of the Third Kind, are movies based on aliens coming to Earth. I've seen both, and they indeed share many similarities. We could conclude that both of these fall into the same genre of movies based on intuition, but that's no fun in a data science context. In this notebook, I have quantify the similarity of movies based on their plot summaries available on IMDb and Wikipedia, then separated them into groups, also known as clusters. Then created a dendrogram to represent how closely the movies are related to each other.

About


Languages

Language:Jupyter Notebook 100.0%