Emily Maynard's repositories
Amazon_Vine_Analysis
Pick one of 50 datasets and use PySpark to perform the ETL process to extract the dataset, transform the data, connect to an AWS RDS instance, and load the transformed data into pgAdmin. Next, use PySpark to determine if there is any bias toward favorable reviews from Vine members.
bikesharing
Use bike sharing data from one city to help determine if the program will be successful in another city. Use Tableau to create a dashboard with data visualizations.
Credit_Risk_Analysis
Using supervised machine learning to predict credit risk. Trying oversampling, under sampling, combination sampling and ensemble learning to find the model with the best fit
Crime-Data
Data in Motion weekly challenges
Crop_Yield_Analysis
Columbia University final group project
Cryptocurrencies
Unsupervised machine learning to preform principal component analysis, and use clustering to determine trends in different crytocurrencies.
Mapping_Earthquakes
Using D3, geoJson, Mapbox, API calls, and Leaflet to create a map plotting earthquakes over the last seven days from USGS data with magnitude.
MechaCar_Statistical_Analysis
Using R and statistics in RStudio to analyze different variables for review. Use the production data for insights that may help the manufacturing team. Perform multiple linear regression analysis to identify which variables in the dataset predict the mpg .
Movies-ETL
This project uses exploratory data analysis or EDA to scrape movie data from the internet. The ETL process cleans files, merges dataframes, and publishes to a SQL database. This code creates functions, uses list comprehension, and utilizes regular expressions to clean the web data. After cleaning the data step by step, the code was refactored to take three files in a function order to automate the pipeline.
Neural_network_charity_analysis
Exploring machine learning with nueral networks for a charity analysis. Adjusting the model to try and improve accuracy to predict which projects are likely to be successful.
Portfolio
A portfolio with info about me and some of my projects
Election_Analysis
Using Python to automate an election analysis. Using for loops and practice coding!
emaynard10
Config files for my GitHub profile.
energy-stock-analysis
An analysis of renewable energy stocks using VBA
Mission-to-Mars
Using webscraping, this analysis visits the NASA Mars news. page and Space images pages to scrape data which is then displayed on a website about Mars.
Model-Optimization
Learning about model optimization, choosing good metrics, resampling, and hyperparameters.
Pewlett-Hackard-Analysis
Using SQL to analyze HR data and link tables to determine a process for company retirement planning and mentorship programs.
Phenology_Analysis
Plant Phenology study with IPF
Plotly_Biodiversity
This analysis uses Plotly to make interactive charts for a study on belly button biodiversity. The analysis uses
PyBer_Analysis
Use Matplotlib to visualize ride sharing data in different types of cities to determine trends for program improvements.
School_District_Analysis
This analyzes and compares schools in a district for a school board to allocate funding and resources based on school performance
Spam-Detection-Model
Using logistic regression and random forest classifier to compare models and try to create a span detection model.
UFOs
Use D3 and javascript to create a website that allows users to filter for UFO criteria to find out about various UFO sightings. Using bootstrap for formatting.
World_Weather_Analysis
Using API calls to GoogleMaps and OpenWeather, this analysis uses certain weather criteria to plan a vacation itinerary with hotel stops.