View article

[PDF] from google.com

Uploaded 2023/12/11

A Max-Affine Spline Perspective of Recurrent Neural Networks

Authors

Zichao Wang, Randall Balestriero, Richard G. Baraniuk

Publication date

2019

Conference

ICLR 2019

Description

We develop a framework for understanding and improving recurrent neural networks (RNNs) using max-affine spline operators (MASOs). We prove that RNNs using piecewise affine and convex nonlinearities can be written as a simple piecewise affine spline operator. The resulting representation provides several new perspectives for analyzing RNNs, three of which we study in this paper. First, we show that an RNN internally partitions the input space during training and that it builds up the partition through time. Second, we show that the affine slope parameter of an RNN corresponds to an input-specific template, from which we can interpret an RNN as performing a simple template matching (matched filtering) given the input. Third, by carefully examining the MASO RNN affine mapping, we prove that using a random initial hidden state corresponds to an explicit l2 regularization of the affine parameters, which can mollify exploding gradients and improve generalization. Extensive experiments on several datasets of various modalities demonstrate and validate each of the above conclusions. In particular, using a random initial hidden states elevates simple RNNs to near state-of-the-art performers on these datasets.

Total citations

Cited by 20

2019202020212022202320241 3 5 6 4 1

Scholar articles

A max-affine spline perspective of recurrent neural networks

Z Wang, R Balestriero, R Baraniuk - International Conference on Learning Representations, 2018