There are 1 repository under datapreparation topic.
Make your dataset talk to you. The AI assistant for data preparation.
This repo includes codes for ML Zoomcamp. If you can follow the tutorials from the link here: https://www.youtube.com/watch?v=rowoDjPc8HU&list=PL3MmuxUbc_hIhxl5Ji8t4O6lPAOpHaCLR 👩🏼💻
Trifacta Flows Examples and Templates. Flows zip files, recipes and datasets.
Public repository for custom blocks for Omniscope
Learn data visualization through Tableau 2020 and create opportunities for you or key decision-makers to discover data patterns such as customer purchase behavior, sales trends, or production bottlenecks. This Course on Udemy
Data Science materials
This repository demonstrates data imputation using Scikit-Learn's SimpleImputer, KNNImputer, and IterativeImputer.
The project deals with determining and predicting the type of accident taking place in the city of Austin. The data would help in understanding what possible factors are leading to the accidents based on the severity of the incident that has occurred.
Preparing and Cleaning Data
This project involves the use of K-Means Clustering to find the best accommodation for students in Bangalore (or any other city of your choice) by classifying accommodation for incoming students on the basis of their preferences on amenities, budget and proximity to the location.
In this project, I have used logistic regression, a supervised machine learning algorithm, to predict whether a person has diabetes or not based on various features such as age, blood pressure, glucose level, body mass index, etc. I have used Python and popular libraries such as Pandas, Scikit-Learn, and Matplotlib to perfom model building
India GDP Analysis using Python
This is my 6th semester Essentials of Data Analytics project.
CAN U PREDICT CORRECTLY WHETHER A CUSTOMER WILL CANCEL THE RESERVATION?? You can find the dataset from this kaggle website: https://www.kaggle.com/datasets/ahsan81/hotel-reservations-classification-dataset
The data is available in kaggle competitions. https://www.kaggle.com/competitions/spaceship-titanic I have participated and completed the competition on my own.
Trying to predict survival rate of passengers using algorithms like Logistic Regression, Ada Boost, Gradient Boost , Decision Tree Classifiers , Extra Tree Classifiers , Random Forest Classifiers and XG Boost with appropriate data preprocessing techniques.
For a real estate firm, building a house price prediction model based upon various factors. Problem - Regression | Algorithm used -Linear Regression using OLS
Crime and Incarceration in the United States contain data on crimes that are committed, and the prisoner counts in every 50 states, for which the data is analyzed using various analytical methods.
A one stop shop for all tools to prepare datasets for generative ai
The Bikes Sales Analysis Excel Project is a practical exploration of sales data analysis using Microsoft Excel. This project showcases how Excel can be a powerful tool for data cleaning, preprocessing, visualization, and dashboard creation, all within a familiar spreadsheet environment.
Nordstrom Products dataset preparation includes collection, discovery, cleaning, normalization, enrichment, and validation using SQL
Using Power BI to analyze the competitors sales
This project estimates Sentiment Analysis by calculating text Metrices to drive sentimental opinion, sentiment scores, readability, passive words, personal pronouns, etc, etc.
Data Preparation and Data Visualization
NLP Analysis on Tripadvisor Restaurant Reviews
End-to-end real time project from hicounselor
Cryptocurrency Clusters
Analysed Retail data to understand customer behavior, transaction pattern using SQL
Proyecto de segmentación de clientes, mediante clusterización.
This project was completed for the data preparation techniques course.
WELCOME. This is my project about a comprehernsive analysis of financial performance Insights from a leading banks