genfifth / cvopt

Machine learning's parameter search and feature selection module which is integrated log management and visualization.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

cvopt -to simplify Data Science-

cvopt (cross validation optimizer) is python module for machine learning's parameter search and feature selection. To simplify modeling, in cvopt, log management and visualization are integrated and the API like scikit-learn is provided.

readme_00

In Data Science modeling, sometimes would like to ...

  • Use various search algorithms on the same interface.
  • Optimize parameters and feature selections simultaneously.
  • Integrate log management and its visualization into search API.

To make these simpler, cvopt was created.

Features

  • API like scikit-learn.
    • Support Algorithm:
      • Sequential Model Based Global Optimization (Hyperopt)
      • Bayesian Optimization (GpyOpt)
      • Genetic Algorithm
      • Random Search
  • Optimization of parameters and feature selections.
  • Integration of log management and visualization.

Installation

$ pip install Gpy
$ pip install cvopt

Requires:

  • Python3
  • NumPy
  • pandas
  • scikit-learn
  • Hyperopt
  • Gpy
  • GpyOpt
  • bokeh

Quick start -search can be written in 5 lines.-

param_distributions = {"penalty": search_category(['l1', 'l2']), "C": search_numeric(0, 3, "float"), 
                       "tol" : search_numeric(0, 4, "float"),  "class_weight" : search_category([None, "balanced"])}
feature_groups = np.random.randint(0, 5, Xtrain.shape[1]) 
opt = SimpleoptCV(estimator=LogisticRegression(), param_distributions=param_distributions)
opt.fit(Xtrain, ytrain, feature_groups=feature_groups)

Documents

Basic usage(en)/(jp)

Keras sample

API Reference

Changelog

Log

About

Machine learning's parameter search and feature selection module which is integrated log management and visualization.

License:BSD 2-Clause "Simplified" License


Languages

Language:Python 72.8%Language:Jupyter Notebook 25.9%Language:Makefile 1.3%