activation-function deep-neural-networks deep-learning machine-learning approximate-algorithms polynomial

Improved Polynomial Neural Networks with Normalised Activations

Polynomials, which are widely used to study non-linear systems, have been shown to be extremely useful in analyzing neural networks (NNs). However, the existing methods for training neural networks with polynomial activation functions (PAFs), called as PNNs, are applicable for shallow networks and give a stable performance with quadratic PAFs only. This is due to the optimization issues encountered during training PNNs. We propose a working model for PAFs using a novel normalizing transformation which alleviates the problem of training PNNs with arbitrary degree. Our PAF can be directly used to train shallow PNNs in practice for degrees as high as ten. It can also be utilized to learn multivariate sparse polynomials of small degrees. We also propose a way to train deep CNNs with PAFs which achieve performance similar to deep CNNs with standard activations. Through rigorous experimentation on multiple data sets, we show that PNNs can be effectively trained in practice. This also highlights the potential of the proposed method to support the research on using polynomials to study deep learning.

Requirements

python 2.x/3.x
tensorflow 1.8+
keras 2.2.2

About

Understanding DNN

activation-function deep-neural-networks deep-learning machine-learning approximate-algorithms polynomial

Languages

Language:Python 100.0%