Oliver Zagorin's repositories
Credit-Risk-Classification-using-Logistic-Regression
Trained and evaluated two supervised machine learning models using original and resampled data to identify 'healthy loan' and 'high risk loan' applicants from financial disclosures.
Business-Data-Analysis-Queries-in-SQL
This project shows the creation of an SQL database from six related csv datasets, focused on an organization containing over 300,000 employees. I also conducted queries in order to obtain specific pre-defined results.
Crowdfunding-ETL
My team built an ETL pipeline using Python, Pandas, and regular expressions to extract and transform the data. After we transformed the data, we created four CSV files and used the CSV file data to create an Entity Relationship Diagram. Finally, we uploaded the CSV file data into a Postgres database.
Neural-Network-Analysis-of-Corporate-Funding-Allocation
Analysis of over 34,000 businesses that received funding, to generate 184 Neural Network algorithm to predict effective allocation of funding.
Polling-and-Banking-Data-Analysis-in-Python
We are performing two analyses using Python. The first analysis consists analyzing polling data for a small town, while the second consists of analyzing financial data.
Predicting-Autism-with-Machine-Learning
Development and comparison of 12 machine learning models to predict autism as well as a discussion of the process.
School-Resources-and-Outcomes-Analysis-in-Jupyter-Notebook
Using Pandas and Jupyter Notebook, I created a report that analyzes Student and School data together in order to gain insights into how student education outcomes relate to a variety of school-level factors.
Stock-Analysis-in-VBA
Outcome and insights analysis on "Multiple_year_stock_data" dataset, which contains stock data over the course of 3 years: 2018, 2019, and 2020. This dataset includes over 2.26 million individual entries.
Autism-Diagnosis-with-Linear-Regression-and-Neural-Networks-using-Random-Oversampling
We used a dataset that included birth and personal data as well as Autism Spectrum Quotient test scores to train machine learning algorithms to predict autism. We used Logistic Regression, Neural Network Models and Keras Tuner with Random Oversampling to train one with 90% accuracy.
Cryptocurrency-Cluster-Analysis-with-K-Means-and-PCA
Used K Means and PCA to analyze 42 cryptocurrencies in order to determine the effect of price changes over different periods of time.
Crime-and-Cannabis-Legal-Status-Analysis-Team-Project
This was a team project focused on analyzing the relationship between the legal status of cannabis and crime rates in 2020. This includes the analysis, data visualizations, and a powerpoint presentation that discusses our insights and findings.
Crowdfunding-Analysis-in-Excel
Analysis of crowdfunding campaigns to uncover actionable insights.
Food-Standards-Evaluation-in-NoSQL
This repository focuses on loading and performing exploratory analysis on a dataset in NoSQL and MongoDB which contains rating data for restaurants
Mars-Data-Web-Scraping
Web Scraping HTML/CSS using BeautifulSoup on Mars News and Weather websites.
Microbe-Biodiversity-Analysis-and-Dashboard-Creation-
The analysis reveals that a small handful of microbial species (also called operational taxonomic units, or OTUs, in the study) were present in more than 70% of people, while the rest were relatively rare.
Weather-Station-Analysis-with-SQLAlchemy
Analysis of two datasets which relate to weather station information over the course of one year in Hawaii in 2016-2017. Analysis using SQLAlchemy ORM queries, and Pandas and Matplotlib in Python. A Flask API is created to store all the information.