5hirish / dabl

Data Analysis Baseline Library

Home Page:https://amueller.github.io/dabl/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

dabl

The data analysis baseline library.

  • "Mr Sanchez, are you a data scientist?"
  • "I dabl, Mr president."

Warning

This is pre-alpha software and is still very-much in flux.

Current scope and upcoming features

This library is very much still under development. Current code focusses mostly on exploratory visualiation and preprocessing. There are also drop-in replacements for GridSearchCV and RandomizedSearchCV using successive halfing. The next step in the development will be adding portfolios in the style of POSH auto-sklearn to find strong models quickly. In essence that boils down to a quick search over different gradient boosting models and other tree ensembles and potentially kernel methods.

Stay Tuned!

About

Data Analysis Baseline Library

https://amueller.github.io/dabl/

License:BSD 3-Clause "New" or "Revised" License


Languages

Language:Jupyter Notebook 90.7%Language:Python 9.0%Language:Batchfile 0.2%Language:Shell 0.2%