Elliott Einstein's repositories
Machine-Learning-Cryptocurrency-Clusters
I am on the Advisory Services Team of a financial consultancy. One of MY clients, a prominent investment bank, is interested in offering a new cryptocurrency investment portfolio for its customers. The company, however, is lost in the vast universe of cryptocurrencies. They’ve asked me to create a report that includes what cryptocurrencies are on the trading market and determine whether they can be grouped to create a classification system for this new investment.
Stock_Correlation-Analysis
In this Notebook, I analyze the following five semiconductor stocks: HD, INTC, AMD, MU, NVDA, and TSM. Then, I choose the stock with the least correlation to JNJ in order to diversify a portfolio. The data was generated using the GOOGLEFINANCE historical market data script.
Indicators-of-Heart-Disease-Analysis
This project is about statistically analyzing risk factors for heart disease and performing A/B testing, descriptive and inferential statistics to provide health care plans and strategies to better understand the risk factors assocaited with heart disease and give key insights into what factors contribute most heavily and least heavily to the development of heart disease.
Machine-Learning-Predicting-Credit-Risk
In this assignment, I will be building a machine learning model that attempts to predict whether a loan from LendingClub will become high risk or not.
Pymaceutical_ANOVA_Tukey
For this Project, I first applied an analysis of variance (ANOVA) model to the Pymaceutical dataset and then did a post-hoc analysis of the results by using Tukey Honest Significant Difference (HSD) to determine which drug treatments in the dataset significantly reduce tumor volume and metastasis. I then wrote a summary of my findings.
Tableau-Project-Citi-Bike-Analysis
As the new lead analyst for the New York Citi Bike Program, I am now responsible for overseeing the largest bike sharing program in the United States. In this new role, I will be expected to generate regular reports for city officials looking to publicize and improve the city program.City officials have a number of questions on the program, so my first task on the job is to build a set of data reports to provide the answers to key business needs to drive decision making.
Top-1000-IMDB-Rated-Movies-Analysis
Group Project: Top 1000 Movies by IMDB Rating Exploratory Data Analysis
Big-Data-Challenge--Amazon-Shoppers-Product-Reviews
In this assignment I will put my ETL skills to the test. Many of Amazon's shoppers depend on product reviews to make a purchase. Amazon makes these datasets publicly available. However, they are quite large and can exceed the capacity of local machines to handle. One dataset alone contains over 1.5 million rows; with over 40 datasets, this can be quite taxing on the average local computer. My first goal for this project will be to perform the ETL process completely in the cloud and upload a DataFrame to an RDS instance. The second goal will be to use PySpark or SQL to perform a statistical analysis of selected data.
Deep-Learning-Charity-Funding-Predictor
The non-profit foundation Alphabet Soup wants to create an algorithm to predict whether or not applicants for funding will be successful. With my knowledge of machine learning and neural networks, I'll use the features in the provided dataset to create a binary classifier that is capable of predicting whether applicants will be successful if funded by Alphabet Soup.
Elliott-dev.github.io
Student Biography
ETL-with-Pandas-Project
Project uses Pandas to create multiple DataFrames from CSV files containing Disneyland Reviews and Chocolate Reviews.. Cleaned those DataFrames, then loaded to PostgreSQL to create a relational database to join everything together.
Heroes-Of-Pymoli-Video-Game-Analysis-
Heroes Of Pymoli: Like many others in its genre, the game is free-to-play, but players are encouraged to purchase optional items that enhance their playing experience. As a first task, the company would like you to generate a report that breaks down the game's purchasing data into meaningful insights.
Identify-Multiple-Faces-in-a-Photo
Use this python script to identify any faces in any images.
Intro_Data_Mining_Analytics
Read through from book.
Netflix-Data-Science-Midterm-Project
Project Name :Analysis of Video Games Sales Project description This project is about statistically analyzing platform, genre, game rating, user score, and regional user-preferences against 11563 video games dating back from 1984 to 2016 for effective marketing strategies. We use descriptive statistic to understand user trends which is necessary to target our audiences and appeal to their preferences.
NETFLIX-DataFrame_Functions
Practicing and exploring tools by Pandas
NYC_Bike_Counts_Retrospective_Analysis
I perform a retrospective analysis on the linear regression analysis that I previously performed on the NYC Bike Counts dataset. Specifically, I analyze my linear regression analysis to identify anything that I could have done differently.
NYC_Bike_Linear_Regression-
I used the New York Bike Counts dataset to formulate a hypothesis about the number of bikes crossing the Brooklyn Bridge. This dataset contains the number of bikes that crossed each bridge during each day. I first used this dataset to formulate a hypothesis and then used linear regression to test if my hypothesis was correct.
PROJECT-Investigating-Netflix-Movies-and-Guest-Stars-in-The-Office
Netflix Pandas Project (Investigation of Netflix Movies and Guest Stars in the Office)
Pymaceuticals
In this study, 249 mice identified with SCC tumor growth were treated through a variety of drug regimens. Over the course of 45 days, tumor development was observed and measured. The purpose of this study was to compare the performance of Pymaceuticals' drug of interest, Capomulin, with the other treatment regimens. You have been tasked by the senior scientist team to generate an initial drug regimen comparison and a summary of your findings.
Pymaceuticals-Continued-Making-Matplotlib-Magic
It has been a few days since you sent your boxplot to the senior scientist at Pymaceuticals and today they finally got back to you with feedback. They said your inital For this, I will leverage the same drug regimen data from last class and utilize subplots to create an advanced visualization that is packed with insightful information!
Python-API-Weather-Project
A weather analysis that randomly selects more than 500 cities across the globe, pulls data from the OpenWeatherMap API for each city. Analysis of the weather and perfect vacation spot is viewable on my Jupyter Notebook.
Python-Challenge
In this challenge, I am are tasked with creating a Python script for analyzing the financial records of my company. I will give a set of financial data called budget_data.csv. The dataset is composed of two columns: Date and Profit/Losses. (Thankfully, my company has rather lax standards for accounting so the records are simple.)In this challenge, I am tasked with helping a small, rural town modernize its vote counting process.
Simple-Facial-Recognition-Application
Face Recognition Image Matcher: This script uses the face_recognition library to compare a reference image with a set of images in a directory, identifying matches based on facial features. It efficiently handles face encoding, comparison, and reports matches or no-face-found cases.
SQLAlchemy-Project-Advanced-Data-Storage-Retrieval-Flask-
I used Python and SQLAlchemy to do basic climate analysis and data exploration of my climate database. All of the following analysis were completed using SQLAlchemy ORM queries, Pandas, and Matplotlib.
Web-Scraping-Project--Mission-to-Mars
In this Project, I will build a web application that scrapes various websites for data related to the Mission to Mars and displays the information in a single HTML page.
Web-Visualization-Dashboard-Latitude-
Latitude - Latitude Analysis Dashboard with Attitude For this Project, I created a visualization dashboard website using visualizations. I created in a past assignment. Specifically, I plotted weather data. In building this dashboard, I created individual pages for each plot and a means by which we can navigate between them. These pages contain the visualizations and their corresponding explanations. I also have a landing page, a page where we can see a comparison of all of the plots, and another page where we can view the data used to build them.