Shan Tong 's repositories
election2016-analysis
This project aims to recreate some of the machine learning methods Nate Silver used in 2016, by using the actual election data and determining how accurate some of these methods were at predicting the final results. The methods we used were Principal Component Analysis, Hierarchical Clustering, Decision Trees, Logistic Regression and Lasso Regularization. In addition to using those methods, we performed other classification methods such as K-Nearest Neighbors and Random Forest and explored the possibility of Simpson’s Paradox in our dataset used for the algorithms.
fatalshootings
Our project aims to explore whether police shootings are racially biased and if certain racial groups are targeted more. We also aim to look specifically at California which is the State that has the most police shootings and to see if there are any racial disparities in its victims.
danny_sql_challenge
This folder contains my progress and solutions for the case studies for the 8 Week SQL Challenge prepared by Danny Ma.
college-tuition-diversity-analysis
An analysis on what factor determines in-state tuition and the difference between African-American enrollment percentage in different institutions compared to other major races
conventional-films-and-social-media-regression-analysis
This is an analysis on movies created in 2014 and 2015 which utilizes multiple linear regression analysis to predict the financial success of films and investigate the relationship between screens and year.
Covid19-Case-Forecasting
Forecasting Short-term Future COVID19 Cases
covid_sql_project
This is an SQL project focused on the use of SQL for data exploration and data cleaning on COVID data.
data-police-shootings
The Washington Post is compiling a database of every fatal shooting in the United States by a police officer in the line of duty since 2015.
Data-science
Collection of useful data science topics along with code and articles
github-slideshow
A robot powered training repository :robot:
workshop2021-playground
A testing repo for the UCSB Data Science Capstone Preparation Workshop