There are 9 repositories under data-preparation topic.
Prepping tables for machine learning
Data Preparation for Satellite Machine Learning
An open source book to learn data science, data analysis and machine learning, suitable for all ages!
A New, Interactive Approach to Learning Data Science
convtools is a python library to declaratively define conversions for processing collections, doing complex aggregations and joins.
ABAP unit testing framework, prepare in Excel, reuse in abap code
This repository contains my implementations of the algorithms which MoNuSAC participants could use for data preparation to train their models at ISBI 2020.
“Data science” is just about as broad of a term as they come. It may be easiest to describe what it is by listing its more concrete components: Data exploration & analysis. Included here: Pandas; NumPy; SciPy; a helping hand from Python's Standard Library.
Market Mix Modelling for an eCommerce firm to estimate the impact of various marketing levers on sales
GWAS summary statistics files QC tool
Data preparation for data science projects.
Foofah: programming-by-example data transformation program synthesizer
Pipeline module for parallel real-time data processing for machine learning models development and production purposes.
Extract and evaluate radiomics for liver cancer tumors from DICOM segmentation masks. Using SimpleITK, PyRadiomics and PyDicom.
The data is related with direct marketing campaigns (phone calls) of a Portuguese banking institution. The classification goal is to predict if the client will subscribe a term deposit.
Data preprocessing is a data mining technique that involves transforming raw data into an understandable format.
Integrate SageMaker Data Wrangler into your MLOps workflows with Amazon SageMaker Pipelines, AWS Step Functions, and Amazon Managed Workflow for Apache Airflow (MWAA)
A python script to convert and down-sample mesh data into pointclouds using FPS algorithm.
BIOBOT: A Fall Detection System (FDS) using Artificial Intelligence
general-purpose fast, stateless, and deterministic feature extractor written in golang for use in machine learning
Developing self learning robot
Documenting the data cleaning process on a bank statement dataset using the python libraries, NumPy and Pandas.
Image classification svm with simple neural network.
🐍 Mental Maps Related to Contents in Data Science 🐍
Forecast Apple stock prices using Python, machine learning, and time series analysis. Compare performance of four models for comprehensive analysis and prediction.
This Dataiku DSS plugin provides visual recipes to perform resampling, windowing, interval extraction, extrema extraction, and decomposition on time series data.
Performed data pre-processing, optimized data warehousing, applied statistics and machine learning, and used Power BI for insightful visualizations to support informed decisions
🚀SpAnnor annotator for Named Entity Recognition easy to use tool. The annotator allows users to quickly assign custom labels to one or more entities in the text. Easy to setup for Data Training for SpaCy 🔥.