elhachimi-ch / dst

Data Science Toolkit

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Data Science Toolkit

readthedocs License Reproducible Capsule

Data Science Toolkit (DST) is a Python library that helps implement data science related project with ease.

Simple Demo

from data_science_toolkit.dataframe import DataFrame
from data_science_toolkit.model import Model

data = DataFrame()
data.load_dataset('iris')
y = data.get_column('target')
data.drop_column('target')

# decision tree model
model = Model(data_x=data.get_dataframe(), data_y=y, model_type='dt', training_percent=0.8)

# train the model
model.train()

# get all classification evaluation metrics
model.report()

#get the cross validation
model.cross_validation(5)

Documentation

More information can be found on the DST documentation site.

Contributing

Contrubution and suggestions are welcome via GitHub Pull Requests.

Maintainership

We're actively enhacing the repo with new algorithms.

How to cite

About

Data Science Toolkit

License:MIT License


Languages

Language:Python 99.7%Language:Jupyter Notebook 0.3%