51
0

Fast Heavy Inner Product Identification Between Weights and Inputs in Neural Network Training

Abstract

In this paper, we consider a heavy inner product identification problem, which generalizes the Light Bulb problem~(\cite{prr89}): Given two sets A{1,+1}dA \subset \{-1,+1\}^d and B{1,+1}dB \subset \{-1,+1\}^d with A=B=n|A|=|B| = n, if there are exact kk pairs whose inner product passes a certain threshold, i.e., {(a1,b1),,(ak,bk)}A×B\{(a_1, b_1), \cdots, (a_k, b_k)\} \subset A \times B such that i[k],ai,biρd\forall i \in [k], \langle a_i,b_i \rangle \geq \rho \cdot d, for a threshold ρ(0,1)\rho \in (0,1), the goal is to identify those kk heavy inner products. We provide an algorithm that runs in O(n2ω/3+o(1))O(n^{2 \omega / 3+ o(1)}) time to find the kk inner product pairs that surpass ρd\rho \cdot d threshold with high probability, where ω\omega is the current matrix multiplication exponent. By solving this problem, our method speed up the training of neural networks with ReLU activation function.

View on arXiv
Comments on this paper

We use cookies and other tracking technologies to improve your browsing experience on our website, to show you personalized content and targeted ads, to analyze our website traffic, and to understand where our visitors are coming from. See our policy.