There are 7 repositories under data-cleansing topic.
The JavaScript data transformation and analysis toolkit inspired by Pandas and LINQ.
Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.
Exploratory data analysis 📊using python 🐍of used car 🚘 database taken from ⓚ𝖆𝖌𝖌𝖑𝖊
Quizzes & Assignment Solutions for Google Data Analytics Professional Certificate on Coursera. Also included a few resources on side that I found helpful.
Wrangler Transform: A DMD system for transforming Big Data
XGBoost, LightGBM, LSTM, Linear Regression, Exploratory Data Analysis
This is a binary classification problem related with Autistic Spectrum Disorder (ASD) screening in Adult individual. Given some attributes of a person, my model can predict whether the person would have a possibility to get ASD using different Supervised Learning Techniques and Multi-Layer Perceptron.
This repo created for sharing the required/discussed files during Online Internship training program on Data Science Using Python in May-2021
An SQL data cleaning project
Make quick and dirty data mining made easier in Sublime Text
Predict if a driver will file an insurance claim next year. (Kaggle Competition)
This library contains the file system extensions to Data-Forge that allow it to directly read and write CSV and JSON files in Node.js
Data cleaning tool.
Data cleanse, clustering with Vector Quantization and Adaptive Resonance Theory
Product Rationalization of Pro Bikes Inc using Power BI
Data Structures project in C++11 language, uses custom Vector & String structures with Move Semantics (Rule of Five)
This is Repository containing two .csv files , one is cleaned and another one is very unconsistent and non-regular data of Mobile Phones. See the difference between both the data files.
Comprehensive Power BI dashboards showcasing insights on Call Centre Trends, Customer Retention, and Diversity & Inclusion to drive business impact.
Data Cleaning is a python package for data preprocessing. This cleans the CSV file and returns the cleaned data frame. It does the work of imputation, removing duplicates, replacing special characters, and many more.
This repository contains all the files related to project's data collection, data normalization / cleansing and database management.
Power BI based Data\Business analysis of e-commerce company with focus on demographic analysis
A dataset of waste vehicles with 350,000+ rows is cleaned using pandas and jupyter notebook in python.
Here is some implementation and using methods in Topics on Data mining course
A Python script to Parse data from Non-Meaningful data to Meaningful and save it to .csv File
This is the source code for the paper "A probabilistic database approach to autoencoder-based data cleaning".
Some little json tools for my own use and maybe can help you
Commercial banks receive a lot of applications for credit cards. Many of them get rejected for many reasons, like high loan balances, low income levels, or too many inquiries on an individual's credit report, for example. Manually analyzing these applications is mundane, error-prone, and time-consuming. Luckily, this task can be automated with the power of machine learning. Here is an automatic credit card approval predictor using machine learning techniques.
This project showcase the business dashboard for an E-Commerce company based on its 2008 - 2012 sales records
⭐️ Google Data Analytics + Coursera ⭐️ 👩💻 Datos, datos, en todas partes(este curso) 🔍 Skills : Spreadsheet, Data Cleansing, Data Analysis, Data Visualization (DataViz), SQL
Categorical Binary Feature encoding script