ijeffking / Machine_Learning_Problems

Testing repository for various machine learning algorithms and methods using public test data sets

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Machine Learning Algorithm/Coding Test Repository

Sample test data are published at UCI.

Python Packages to install

  • Some nice list of Python packages for Python coding. Another its link is here.
  • Standard Python packages include pandas, numpy, scipy, matplotlib, scikit-learn, seaborn
  • H2O for R and Python
    • The programs run on Java SDK.
    • For Python version, check out the installation instruction here. Basic introduction to its useage is here. If there is a conflict between Linux OS system pip and Anaconda's pip, and you wish to install it on Anaconda envrionment, then use /root/anaconda/bin/pip *** instead of pip install ***
    • Some of its powerful tools are Deep Learning and GBM. Its deep learning so far primarily focuses on the model of multi-layer, feedforward neural networks for predictive modeling.
  • Other useful packages include
    • seaborn for data visualization.
    • Orange for graphic modeling instead of source coding.
    • Theano for deep learning and tensor calculation.
    • numba for JIT (just-in-time compilation) when it involves (especially looping) many generic Python codes and numpy.

About

Testing repository for various machine learning algorithms and methods using public test data sets


Languages

Language:Jupyter Notebook 74.9%Language:HTML 13.6%Language:Python 5.5%Language:Java 4.6%Language:R 1.1%Language:TypeScript 0.1%Language:MATLAB 0.1%Language:Scala 0.0%Language:Fortran 0.0%Language:JavaScript 0.0%Language:Shell 0.0%Language:CSS 0.0%Language:Batchfile 0.0%Language:C++ 0.0%