polakowo / oscarobber

Network and sentiment analysis applied on the IMDB database

Home Page:https://polakowo.github.io/oscarobber/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

oscarobber

Graphs are everywhere and their possibilities are endless

The idea is to use graph mining to analyze collaboration of actors in science fiction movies, where two actors are nodes connected by an edge if they both appeared in the same movie. For this, we applied a set of social network analysis techniques on a network of 200k actors and 20k sci-fi movies with Python, sklearn and NetworkX.

Here we assessed the structure and behavior of the network using measures from complex network theory, such as clustering, degree distribution, centrality, assortativity, modularity, growth and preferential attachment. For example: "What are the most influential actors/movies in the network? How do actors choose in which movies to play? Do some attributes such as genre build good communities? Who is crucial in connecting Bollywood and Hollywood?"

We also applied NLP methods to analyze plot summaries, user reviews and the sentiment they express. To make things more acessible, the results of analysis are published on an interactive website.

About

Network and sentiment analysis applied on the IMDB database

https://polakowo.github.io/oscarobber/


Languages

Language:Jupyter Notebook 99.1%Language:Python 0.9%