crcrcry / Self-Service-Data-Preparation

Project overview and links to various resources

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

"Self Service" Data Preparation

Overview

It is widely cited that data analysts and data scientists today spend a large fraction (up to 80%) of their time on preparing and cleaning data.

At Microsoft Research, we are looking at ways to automate common data preparation tasks, where the goal is to empower enterprise workers as well as less-technical end-users (e.g., in Excel, Power BI, etc.), to solve their data preparation challenges and improve their productivity.

Technologies developed in this project have shipped as features in Microsoft products, such as in Power Query (natively integrated in Excel under the “Data” tab, also available in Power BI), and Azure Machine Learning Data Prep.

List of benchmark data sets used in published work

From time to time we receive requests from researchers for benchmark data sets used in our projects. We produce a compiled list here on GitHub to facilitate future research.

About

Project overview and links to various resources