Matthew Curry's repositories
RetireRight
Personal finance SPA that uses a Monte Carlo simulation to model user net worth based on various investment and lifestyle choices. Deployed using AWS serverless technologies.
Census-Income-Classifier
This is a classifier which takes Census Data and builds a logistic regression model to predict whether a particular individual makes above or below $50,000 a year. The script features data manipulation to deal with NaN values and to group overly granular features. It then features hyper-parameter tuning to determine the optimal type of regulization and the regularization constant before testing the best model on the set of test data (yielded 85% classification accuracy!!!!!!!)
Coffee-Shop-Event-Driven-Simulation
This is an event driven simulation of a coffee shop. The program takes a text file including the arrival times of customers to the shop, the revenue per customer, the cost per cashier, and the time to process a transaction and uses the principles of object oriented programming, event driven simulation, and various data structures to simulate the shop and calculate net profit for the day. This program was used to determine the optimum number of cashiers to use in the shop by changing this variable between runs and comparing net profit values.
color-my-country
Web app allowing users to view all counties visited on a US map
Community-Center-Calculus
In a capstone Mathematics course at Lafayette College I was tasked with creating and presenting a model to optimize the Community Engagement Center's transportation schedule and to clearly present the model's results. To complete this project, I worked with a partner who was primarily responsible for developing the program, while my main task was to clearly write our results and define mathematically equations to describe how our program worked. This is the paper which I primarily wrote which I included to show my ability to convey technical information in writing.
Data-Structures
What I had remaining from CS 150 at Lafayette College. Implementation of various data structures in Java
keploy
Unit Testing Toolkit that creates test-cases and data mocks from API calls.
re-region-api
API to access demographic statistics and individual taxation estimates for US states and counties deployed on the AWS cloud.
re-region-etl
Dockerized ETL CLI tool to load source data for the re-region-api (https://github.com/Matthew-Curry/re-region-api) into a Postgres DB from the Census Bureau Data API and excel files produced by the Tax Foundation.
sql_on_excel
A simple command line tool to allow execution of SQL on Excel and CSV files by constructing temporary SQLite Databases. Outputs results of queries to an excel file.
what-aspects-of-culture-impact-happiness
This script compiles data from multiple excel files to create 3 separate regression models using cultural indices defined by Professor Geert Hofstede to explain national happiness levels from the 2018 world happiness report. The first model is a regression using just the cultural variables to explain happiness, while the later two models add features to limit omitted variable bias. Finally, the script includes several functions that analyze how statistical significance of the cultural indices changes between models to explain which omitted variables were responsible for the biased estimates and how they were responsible for such errors.
NBA_POWER
The following Jupyter Notebook seeks to compile data from 3 different Excel documents to ultimately relate Twitter engagements for a player with the player's salary and stats to see what player specific parameters are best at determining social media popularity. The approach to this question is to build a multiple regression model, test and verify its various assumptions, and ultimately examine the p-values of the different features.
Stanford-Machine-Learning-Week-1
This is my solution to the week 1 programming assignment for the CS 229 course on machine learning offered through Coursera. This assignment involved implementing linear regression through the gradient descent algorithm. To complete this assignment, I altered the warmUpExercise.m, plotData.m, gradientDescent.m, computeCost.m, gradientDescentMulti.m, computeCostMulti.m, featureNormalize.m, and normalEqn.m files
swagger-ui
Swagger UI is a collection of HTML, JavaScript, and CSS assets that dynamically generate beautiful documentation from a Swagger-compliant API.