Hannah Huang's repositories
Twitter-Sentiment-Analysis-about-ChatGPT
A quantitative study on over 1.25 million tweets about ChatGPT, employed data scrapping, data cleaning, EDA, topic modeling, and sentiment analysis.
DS-Take-Home-Challenge
My solution to the book: A Collection of Data Science Take-Home Challenges
Stroke-Prediction
A project that works on exploratory data analysis, feature engineering, and model selection for the prediction of stroke disease.
AB_Testing_Udacity_Free-trial-screener
This project aims at analyzing two versions of the Udacity course review page and determining whether the new feature will be effective in reducing the early course cancellation during the free trial period. The project includes choosing a metric, building intuition, defining hypothesis and comparing two samples for hypothesis testing.
Formula1-Racing-Cloud-Data-Platform
An end-to-end data engineering solution using Azure Databricks, PySpark, Spark SQL, Azure Data Lake Gen2, Azure Data Factory, and Power BI to analyze Formula 1 racing data.