Shan Tong 's repositories

election2016-analysis

This project aims to recreate some of the machine learning methods Nate Silver used in 2016, by using the actual election data and determining how accurate some of these methods were at predicting the final results. The methods we used were Principal Component Analysis, Hierarchical Clustering, Decision Trees, Logistic Regression and Lasso Regularization. In addition to using those methods, we performed other classification methods such as K-Nearest Neighbors and Random Forest and explored the possibility of Simpson’s Paradox in our dataset used for the algorithms.

Language:HTMLStargazers:2Issues:0Issues:0

fatalshootings

Our project aims to explore whether police shootings are racially biased and if certain racial groups are targeted more. We also aim to look specifically at California which is the State that has the most police shootings and to see if there are any racial disparities in its victims.

Language:Jupyter NotebookStargazers:2Issues:1Issues:0

danny_sql_challenge

This folder contains my progress and solutions for the case studies for the 8 Week SQL Challenge prepared by Danny Ma.

Stargazers:1Issues:0Issues:0

PSTAT131

These are homework assignments from my Introduction to Machine Learning course for Fall 2020. In this class, I learned about the concepts of basic statistical machine learning and applying them to discover patterns and relationships in large data sets.

college-tuition-diversity-analysis

An analysis on what factor determines in-state tuition and the difference between African-American enrollment percentage in different institutions compared to other major races

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

conventional-films-and-social-media-regression-analysis

This is an analysis on movies created in 2014 and 2015 which utilizes multiple linear regression analysis to predict the financial success of films and investigate the relationship between screens and year.

Stargazers:0Issues:1Issues:0

Covid19-Case-Forecasting

Forecasting Short-term Future COVID19 Cases

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

covid_sql_project

This is an SQL project focused on the use of SQL for data exploration and data cleaning on COVID data.

Stargazers:0Issues:0Issues:0

data-police-shootings

The Washington Post is compiling a database of every fatal shooting in the United States by a police officer in the line of duty since 2015.

License:NOASSERTIONStargazers:0Issues:0Issues:0

Data-science

Collection of useful data science topics along with code and articles

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

github-slideshow

A robot powered training repository :robot:

Language:HTMLLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

workshop2021-playground

A testing repo for the UCSB Data Science Capstone Preparation Workshop

Stargazers:0Issues:0Issues:0