csinva/trees-to-networks

artificial-intelligence classification decision-tree decision-tree-classifier deep-learning machine-learning machinelearning neural-network neural-networks paper-implementations python pytorch random-forest scikit-learn statistics

Bridging trees and neural networks.

Code here converts sklearn tree -> pytorch DNN

rf_to_dnn_ex.ipynb shows a simple example of converting a random forest to a DNN

Both random forests and DNNs are very strong predictive models. This cool recent paper title "Neural Random Forests" gives a simple algorithm for exactly rewriting any random forest as a sparse neural network. This could be useful then for combining the inductive biases of both and other interesting things.

how does it work?

The idea is to rewrite the neural network into 3 layers:

The first layer identifies whether a point is on the left or right side of a split (outputs -1, 1)
- $out_{split} = \text{sign}(in - thresh)$
- this can be made faster by doing indexing rather than a matrix multiply (maybe sparse tensor will be fast enough)
The second layer determines whether a a point is in a leaf or not (ouputs 0/1)
- $out_{leaf} = (\sum w \cdot in) == depth(leaf)$
the final layer simply multiplies the vector of (0s/1s) by the value of the leaf

$out_{pred} = \sum_{leaf} out_{leaf} \cdot val_{leaf}$
the 0-1 helps make it sparser (and simpler 😄)

About

Bridging random forests and deep neural networks. Partial implementation of "Neural Random Forests" https://arxiv.org/abs/1604.07143

artificial-intelligence classification decision-tree decision-tree-classifier deep-learning machine-learning machinelearning neural-network neural-networks paper-implementations python pytorch random-forest scikit-learn statistics

Languages

Language:Jupyter Notebook 49.7%Language:Python 49.6%Language:Shell 0.6%