Patryk-S-W / kaggle_data_analysis

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Ramen Data Analysis

Analysis of ramen ratings dataset from Kaggle.

  • Frequency of stars given in each country
  • Most eaten Style of Ramen
  • Number of participants in each country
  • Top 10 best Brands
  • Top 10 worst Brands
  • Percentage of good and bad ramen
  • Number of Bad and Good ratings in each serving style
  • Number of brands serving ramen in specific style
  • Elbow Method
  • K-means clustering

Context

The Ramen Rater is a product review website for the hardcore ramen enthusiast (or "ramenphile"), with over 2500 reviews to date. This dataset is an export of "The Big List" (of reviews), converted to a CSV format.

Content

Each record in the dataset is a single ramen product review. Review numbers are contiguous: more recently reviewed ramen varieties have higher numbers. Brand, Variety (the product name), Country, and Style (Cup? Bowl? Tray?) are pretty self-explanatory. Stars indicate the ramen quality, as assessed by the reviewer, on a 5-point scale; this is the most important column in the dataset!

Note that this dataset does not include the text of the reviews themselves. For that, you should browse through https://www.theramenrater.com/ instead!

Acknowledgements

This dataset is republished as-is from the original BIG LIST on https://www.theramenrater.com/.

Inspiration

  • What are the best ramen brands?
  • Prediction of ramen ratings.

About


Languages

Language:Jupyter Notebook 99.1%Language:Python 0.9%