301

Analytical Solution of a Three-layer Network with a Matrix Exponential Activation Function

Main:7 Pages
1 Figures
Bibliography:1 Pages
Abstract

In practice, deeper networks tend to be more powerful than shallow ones, but this has not been understood theoretically. In this paper, we find the analytical solution of a three-layer network with a matrix exponential activation function, i.e., f(X)=W3exp(W2exp(W1X)),XCd×d f(X)=W_3\exp(W_2\exp(W_1X)), X\in \mathbb{C}^{d\times d} have analytical solutions for the equations Y1=f(X1),Y2=f(X2) Y_1=f(X_1),Y_2=f(X_2) for X1,X2,Y1,Y2X_1,X_2,Y_1,Y_2 with only invertible assumptions. Our proof shows the power of depth and the use of a non-linear activation function, since one layer network can only solve one equation,i.e.,Y=WXY=WX.

View on arXiv
Comments on this paper