There are 2 repositories under datanalysis topic.
A Data science and Analytics project with the main aim of doing some Descriptive and Exploratory Data Analysis and then applying predictive modelling for predicting why and which are the best and most experienced employees leaving prematurely?
A Machine Learning model for generating gravitational waves
Hospital Database Management System (DBMS) is a comprehensive SQL project designed to streamline and optimize the management of hospital operations. This project aims to provide an efficient and user-friendly solution for storing, retrieving, and manipulating various types of healthcare-related data.
Data Analysis Project on Power BI
Hydro Analyser is is a first ever open source tool designed for hydrogeologists, engineers, and other professionals in the water resources industry to accurately analyze and optimize groundwater pumping tests
Data Analysis and Data Visualization with Python & PowerBI
There are basic/advanced codes, notes and exercises for data analysis and data visualization. There are notes about statistic and advanced implement of statistic at python. There are some folders about exploratory data analysis, regex, linear algebra etc. Pandas, NumPy, Seaborn, Matplotlib, SciPy, Researchpy.
Introduction to Data, Signal, and Image Analysis with MATLAB Welcome to Introduction to Data, Signal, and Image Analysis with MATLAB! MATLAB is an extremely versatile programming language for data, signal, and image analysis tasks, including hundreds if not thousands of functions. With such a comprehensive tool set, knowing where to start can be overwhelming. My goal is to help you learn the basics, with video lessons and assignments that introduce you to the most fundamental functions, show you how to write new code, and demonstrate how to learn how to use functions you have not used before. This course is designed to introduce data, signal, and image processing and analysis to students who have little or no experience with data and signals but have basic programming experience in the MATLAB programming language, for example, those who have completed the Introduction to Programming with Matlab course. The level is targeted at first-year college students and high school seniors, but really this course is suitable for anybody who wants to learn about data and signal analysis and has experience with linear algebra. The length of the course is five weeks. As shown below, that includes four weeks of video lectures plus an extra week for a final project submission. Schedule Week 1: Introduction Week 2: Data analysis in Matlab Week 3: Signal analysis in Matlab Week 4: Image analysis in Matlab Week 5: Course Project Course objectives After completing this course, a learner will be able to use MATLAB to… Understand how signals, images, and data are represented Load and save datasets Visualize high dimensional data Apply machine learning methods for data classification Perform signal frequency analysis Design signal and image filters Process and analyze image content
Prediction using Decision Tree Algorithm to create Decision Tree Classifier.
User Guide of the Data Interpolating Variational Analysis (Diva) software tool
The dataset that we will be wrangling (and analysing and visualising) is the tweet archive of Twitter user @dog_rates, also known as WeRateDogs. WeRateDogs is a Twitter account that rates people's dogs with a humorous comment about the dog.
Predicting which areas in L.A are more prone to severe crime using decision trees and logistic regression models in R.
Food Market Analysis Comprehensive analysis of the food market, covering data preparation, exploratory data analysis (EDA), feature engineering, and machine learning modeling. Gain insights into customer demographics, meal preferences, and dining trends. Leverage findings for business decisions and further analysis.
Restaurant Tips Optimization Project Unlock the potential of your restaurant's earnings with our comprehensive Tips Optimization Project. We analyze customer data, unveil insights, and employ advanced machine learning models to predict tip amounts. This project empowers restaurant owners and staff to enhance services, boost earni
An ensemble of 3 models - AdaBoost, XgBoost and Random Forests to classify machine failures.
In this repository I will study data science from the beginning
Dans ce projet j'ai utilisé la librairie EasyOCR qui est un projet open source de deep learning, qui permet d'extraire facilement du text à partir des images.
Python programme for scraping live football data from NaijaBet using selenium
Normalizing Covid-19 data for efficient analysis using SQL.
This is a project for RajasthanHackathon 4.0
Analysis of the recognized moons of the planets and of the largest potential dwarf planets of the Solar System
Greetings! This repository showcases the continuous assessment for CCT College Dublin's "Machine Learning for Business" course, specifically focusing on the application of machine learning models to time series data. In this project, we applied a total of 8 time series models to gain comprehensive insights into the dataset.
The Kings County Housing Project analyzes house sales in a northwestern county, examining features like bedrooms, square footage, location, and condition to understand their impact on prices. Insights and recommendations are provided to sellers and buyers, helping them estimate home values and make informed decisions.
Data Analysis of Bicycle Manufacturing Company Using Python, SQL and Power BI
Web scraping & analysis project using Python, Pandas, Selenium & Beautiful Soup. Gathers data from gov website, converts it to structured format, generates daily sales totals, identifies potential customers. Valuable insights for data-driven decisions.
An SQL analysis of traffic crash reports on city streets within the City of Chicago limits and under the jurisdiction of Chicago Police Department (CPD). Data shown as is from the electronic crash reporting system (E-Crash) at CPD, excluding any personally identifiable information.
A dataset of OSHA fatality reports of work-related deaths from June 2009 through Dec 31, 2021.
This project uses Machine Learning to Cluster loan together based on their similarities. The project uses a dataset of loan application which includes information about the Loan amount and Balance. The project then use the clustering algorithm to group the loan together based on the similarities.
Exploratory Data Analysis of Dress Sales Data from a raw dataset
mooKIT is an open source MOOC Management System designed & developed at IIT Kanpur to address the challenges in hosting, Managing, Scaling to the local needs of the MOOC Courses
EDA on ZOMATO DATASET
Dashboards (Netflix overview - Part 1, Single Title View - Part 2) had been created in Power BI using Excel and SQL in order to extract, organize and manipulate the data.