Phyllis Gitu's repositories
CovidProject
SQL code to query data from https://ourworldindata.org/covid-deaths . Download the latest data to run these SQL codes.
Baseball-Prediction-Model
To predict future season stats for baseball players a player will generate next season, we'll first download baseball season data using pybaseball and clean it. Link to the data: https://www.fangraphs.com/players/shohei-ohtani/19755/stats?position=DH
Car-Sales-Data-from-Ebay-Kleinanzeigen
This project is an analysis of the sales dataset of used cars from eBay Kleinanzeigen, a classified section of the German eBay website. This analysis provides useful insights to enable one to sell, buy or investigate the best value deals for used cars.
Clean-and-Analyze-Employee-Exit-Surveys-
Clean and Analyze Employee Exit Surveys. We'll work with exit surveys from employees of the Department of Education, Training and Employment (DETE) and the Technical and Further Education (TAFE) institute in Queensland, Australia.¶
Data-Analysis-with-SQL
In this project, we'll work with data from the CIA World Factbook, containing statistics about all of the countries on Earth. The Factbook contains demographic information like the following: population — the global population. population_growth — the annual population growth rate, as a percentage. area — the total land and water area.
Data-Visualization-on-Exchange-Rates-2008-Financial-Crisis-
The dataset we'll use describes Euro daily exchange rates between 1999 and 2021.
KaggleXProject
This repository contains a Python application that uses an XGBoost classifier to make predictions based on a CSV dataset. The application is designed to run on your local machine and use it for making predictions.
Profitable-Apps
Profitable App Profiles for the App Store and Google Play Markets Our aim in this project is to find mobile app profiles that are profitable for the App Store and Google Play markets. We're working as data analysts for a company that builds Android and iOS mobile apps, and our job is to enable our team of developers to make data-driven decisions with respect to the kind of apps they build. At our company, we only build apps that are free to download and install, and our main source of revenue consists of in-app ads. This means that our revenue for any given app is mostly influenced by the number of users that use our app. Our goal for this project is to analyze data to help our developers understand what kinds of apps are likely to attract more users.
SAT_Analysis
The SAT, or Scholastic Aptitude Test, is a test that high school seniors in the U.S. take every year. The SAT has three sections, each is worth 800 points. Colleges use the SAT to determine which students to admit. High average SAT scores are usually indicative of a good school.
Using-SQL-for-Business-Analysis
The Chinook Record Store signed a deal with a new record label, and the task is selecting the first three albums that will be added to the store, from a list of four. All four albums are by artists that don't have any tracks in the store right now - we have the artist names, and the genre of music they produce.
PrisonBreak
There have been multiple prison escapes where an inmate escapes by means of a helicopter. Using data from Wikipedia to do this project, the aim is to explore the data. Link to the data: "https://en.wikipedia.org/wiki/List_of_helicopter_prison_escapes"
Survey-Analysis-Star-Wars-Films
Among the six Star Wars movies released between 1977 and 2003; which is the most favorite and what is the order of movie preference from first to sixth among the viewers surveyed?
Visualizations
A sample