zinizhu / cs686-pp-yelp-dataset-analysis

CS686 Data Processing Personal Project

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Yelp Dataset Analysis With BigQuery & Data Studio

Introduction

This is the personal project for CS686 Data Processing in Cloud. In this project, a series of data analysis are conducted based on Yelp's public dataset. The analysis focused on three aspects: Yelp's development, businesses advertised on Yelp, and Yelp's users.

Data Source

Data is from Yelp's public dataset and downloaded from Kaggle.

Tools

In this project, BigQuery is used to conduct data analysis. Data Studio is used to visualize data. Both tools are from Google Cloud Platform.

Report

This repo contains both the analysis report and the pdf version of the Data Studio report.

You can access the Data Studio report here.

You can access the Google Doc Report here.

In the 'Business Analysis' Section, the report allows 'custom query' on different businesses, but this requires edit access of the report, and the link above only grants you view access. If you want to try out the feature, please contact me at zzhu35@dons.usfca.edu. To change the custom query parameter, enter the edit mode -> click on the target graph -> in DATA section, select a different business from the dropdown. Note here you can only change one graph's parameter at a time, so if you want to see the complete summary for a different business, you have to change all relevant graphs.

About

CS686 Data Processing Personal Project