wahabu / TMDb_Movie_Data_Analysis

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

TMDb_Movie_Data_Analysis

This Project for Udacity course of Data Analysis Nonodgree. Second Project

Project: TMDb Movie Data Analysis

Introduction

We always find behind each art or product someone who works with infatuation and passion and shows to us what we do not see, therefore the author is that person who has a great interest in films and watching. And He asks us: What shows us whether the film is good or bad? there are should be factors that influence this. Through that, I tried to present something good through this project and find some a logical analysis that adds some good notes and I present them to author.

A note specifying which dataset you analyzed

Depending on link of Google doc. Then, I chose TMDb movie data. This data set contains information more than 10,000 movies collected from The Movie Database (TMDb), including user ratings and revenue.

Attribute Information

In the beginning, I knew the dataset, and then to questin what it was written and what the head of each column was and what if we could discover and see logical information, for example the value of revenue each movies in the year of issue etc. therefore decided my steps the following:

Input varibles id imdb_id popularity budget revenue original_title cast homepage director tagline kewords overview runtime genres production_companies release_date vote_count vote_average release_year budget_adj revenue_adj A statement of the question(s) you posed

I set brainstorming session some questions I could answer using the data set I chose, then I start answering those questions.

Questions To Answer:

  • What are ten movies Vs highest profit for each one?
  • We note Avatar movie is top one of profit where is more than billion and half.
  • Which movie title had the longest run time?
  • There is Band of Brothers longest run time
  • What is year has the muchest recording of movies?
  • There is 558 poducted movies at year of 2015
  • A description of what you did to investigate those questions

Top revenue

Top revenue original_title id
2781505847 Avatar 19995

Top profitable

Top profit original_title runtime
2544505847 Avatar 162

Top number release movies at year

Top release_year Number of Movies
2011 219

Top runtime movie

Top runtime original_title
705 Band of Brothers

References and Citations:

1.https://mode.com/example-gallery/python_horizontal_bar/

2.https://stackoverflow.com/search?q=sort+number+with+df+python

3.https://pandas.pydata.org/pandas-docs/stable/getting_started/basics.html?highlight=delete%20row

4.https://stackoverflow.com/questions/11285613/selecting-multiple-columns-in-a-pandas-dataframe

5.https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.plot.bar.html

6.https://stackoverflow.com/questions/26139423/plot-different-color-for-different-categorical-levels-using-matplotlib

About


Languages

Language:HTML 59.5%Language:Jupyter Notebook 40.5%