dankimjw / gpuscraper-project

Web Scraping Project analyzing GPU Brand Value and Consumer Preference from Newegg.com

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Web Scraping Project: Analysis of GPU Brand Value and Consumer Preference

Update

The user reviews could use further analysis rather than analyzing individual words. Perhaps, ChatGPT can be leveraged to better analyze the review data.

Overview

Introduction: Graphics Card Value

Analysis of Specifications, Ratings, Reviews

Brief Analysis of Tokenized Reviews

Conclusion

What factors help obtain higher reviews?

Motivation

•Graphics processor manufacturers can differentiate themselves by performance.

•GPU’s provided are the same across all graphics cards manufacturers. Then, how do graphics card manufacturers outperform their competitors?

Questions

•Do they have lower or higher prices?•

•Do companies provide other sources of value that are not on the product specifications?

image-20200522173732450

image-20200522173447253

Motivation

image-20200522173639060

Data

Project achieves stated objectives and evidence-based actionable steps for future recommendations are given.

The review data was scraped from Neweg's website.

https://www.newegg.com

Newegg Inc. is an online retailer of items including computer hardware and consumer electronics. It is based in City of Industry, California, in the United States.

In 2016, Liaison Interactive, a Chinese technology company, acquired a majority stake in Newegg in an investment deal. Wikipedia

Customer service: 1 (800) 390-1119

Headquarters: City of Industry, CA

CEO: Fred Chang (Aug 2010–)

Revenue: 2.7 billion USD (2016)

Founder: Fred Chang

Parent organization: [Hangzhou New Century Information Technology Co., Ltd.](https://www.google.com/

Analysis

* NLP Preprocessing

Average Price

Reviews Per Brand

Average Score per brand

Average score per category

  1. Parse, clean, product and review data
    1. Product data -> csv
    2. Text Customer -> data
    3. Three Points -3 Subpoints

Limitations

Because a very specific product category was chosen

Presentation Slides

Slide1 Slide2 Slide3 Slide4 Slide5 Slide6 Slide7 Slide8 Slide9 Slide10 Slide11 Slide12 Slide13 Slide14 Slide15 Slide16 Slide17 Slide18 Slide19 Slide20 Slide21 Slide22 Slide23

About

Web Scraping Project analyzing GPU Brand Value and Consumer Preference from Newegg.com


Languages

Language:Jupyter Notebook 55.4%Language:HTML 44.5%Language:Python 0.1%