Maggie Han's repositories
Cancer-Sentiment-Analysis
This project was run in DataBricks using spark to analyze the recent news in 'cancer' for sentiment evaluation. The goal of this project is to practice traditional NLP like tokenization, stopwords, CV and TF-IDF, N-grams. Also, this project applied tools like AWS S3, athena, QuickSight etc. to address big data.
contributions
🚀✨ Help beginners to contribute to open source projects
Customer-Shopping-Behavior-Regression-Model
This project included a XGBoost Regression model, which predict the purchase possibility of a customer customer based on their online shopping behavior. In addtion, a recommendation model including both CF and CBF was built using customer purchase transaction data.
Lululemon-Webscraping
This project aims to find the reason of lululemon's sucuess by analyzing their products, price, and reviews. The main skill sets required are Python, Pandas, BeautifulSoup, Selenium, Matplotlib, Seaborn, etc.
maggiehan.github.io
Please click on my protfolio webpage !
Product-Review-Project
This project included webscraping, clustering and classification models. The ML models applied to predict course types to complete the dataset for further analysis.The aim is to evaluate course review for an online education platform.
Telecom-Churn
This project provide a template of the traditional binary classification model. Feel free to check the detailed steps of the whole process machine learning modelling.
Uber-Visualization
This is a Tableau visualization of New York city's uber pickup data from april to september.