HDI-Project / FeatureHub

A collaborative feature engineering system built on JupyterHub

Home Page:https://hdi-project.github.io/FeatureHub

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

“SDV” An open source project from Data to AI Lab at MIT.

FeatureHub

FeatureHub is a web application built on top of JupyterHub and an accompanying Python package that together facilitate collaborative data science efforts as well as data collection experiments on those efforts.

The FeatureHub Python package is used to administer a data science collaboration. It allows users to "register" features in a database backend and discover features written by other users. These features are automatically scored by the system so that users can see realtime feedback on the quality of their features.

The FeatureHub app includes a JupyterHub server, containerized Jupyter notebooks for each user with the FeatureHub package installed, a database backend to store the features, and data- and user-management tools.

Citing FeatureHub

Please consider referencing our paper about FeatureHub:

@inproceedings{smith2017featurehub,
  title={FeatureHub: Towards collaborative data science},
  author={Smith, Micah J and Wedge, Roy and Veeramachaneni, Kalyan},
  booktitle={2017 IEEE International Conference on Data Science and Advanced Analytics (DSAA)},
  pages={590--600},
  year={2017},
  organization={IEEE}
}

About

A collaborative feature engineering system built on JupyterHub

https://hdi-project.github.io/FeatureHub

License:MIT License


Languages

Language:Python 85.2%Language:Jupyter Notebook 9.3%Language:Shell 3.2%Language:Makefile 2.2%