padreati / rapaio

statistics, data mining and machine learning toolbox

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

rapaio logo

Disambiguation

  1. (Italian dictionary) Field of turnips. It is also a place where there is confusion, where tricks and sims are plotted.

  2. (Computer science) Statistics, data mining and machine learning library written in Java.

Try it online

Launch rapaio with rapaio-jupyter-kernel jupyter binder Launch rapaio with rapaio-jupyter-kernel jupyter-lab binder

Build status

build codecov.io

Documentation

Rapaio is a rich collection of data mining, statistics and machine learning tools written completely in Java. Documentation for this library is hosted as github pages. Most of the documentation is written as Jupyter notebooks and hosted at rapaio-notebooks github repository. The notebooks repository can also be spin up through binder.

The complete list of features is presented here. An incomplete list of implemented algorithms and features includes: core statistical tools, common distributions and hypothesis testing, Naive Bayes, Binary Logistic Regression, Decision Trees (regression and classification), Random Forests (regression and classification), AdaBoost, Gradient Boosting Trees (regression and classification), BinarySMO, SVM, Relevant Vector Machines (regression), Linear and Ridge Regression, PCA and KMeans. Additionally there is a fair share of graphical tools and linear algebra stuff. And the list is growing periodically.

Installation

Last published release on maven central is 6.0.0

<dependency>
    <groupId>io.github.padreati</groupId>
    <artifactId>rapaio-core</artifactId>
    <version>6.0.0</version>
</dependency>

The best way for exploration is through jupyter / jupyter-lab notebooks. This is excellent for experimenting with interactive notebooks or to document the ideas you are working on. You have to install jupyter / jupyter-lab and rapaio-jupyter-kernel kernel. For more information you can follow the instruction from Rapaio Jupyter Kernel.

%dependency /add io.github.padreati:rapaio-core:6.0.0
%dependency /resolve

Acknowledgements

Many thanks to JetBrains who provided open source licenses for their brilliant IDE a.

About

statistics, data mining and machine learning toolbox

License:Other


Languages

Language:Java 99.9%Language:R 0.1%