ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1909.13384
76
45

Optimal Sketching for Kronecker Product Regression and Low Rank Approximation

29 September 2019
H. Diao
Rajesh Jayaram
Zhao Song
Wen Sun
David P. Woodruff
ArXiv (abs)PDFHTML
Abstract

We study the Kronecker product regression problem, in which the design matrix is a Kronecker product of two or more matrices. Given Ai∈Rni×diA_i \in \mathbb{R}^{n_i \times d_i}Ai​∈Rni​×di​ for i=1,2,…,qi=1,2,\dots,qi=1,2,…,q where ni≫din_i \gg d_ini​≫di​ for each iii, and b∈Rn1n2⋯nqb \in \mathbb{R}^{n_1 n_2 \cdots n_q}b∈Rn1​n2​⋯nq​, let A=A1⊗A2⊗⋯⊗Aq\mathcal{A} = A_1 \otimes A_2 \otimes \cdots \otimes A_qA=A1​⊗A2​⊗⋯⊗Aq​. Then for p∈[1,2]p \in [1,2]p∈[1,2], the goal is to find x∈Rd1⋯dqx \in \mathbb{R}^{d_1 \cdots d_q}x∈Rd1​⋯dq​ that approximately minimizes ∥Ax−b∥p\|\mathcal{A}x - b\|_p∥Ax−b∥p​. Recently, Diao, Song, Sun, and Woodruff (AISTATS, 2018) gave an algorithm which is faster than forming the Kronecker product A\mathcal{A}A Specifically, for p=2p=2p=2 their running time is O(∑i=1qnnz(Ai)+nnz(b))O(\sum_{i=1}^q \text{nnz}(A_i) + \text{nnz}(b))O(∑i=1q​nnz(Ai​)+nnz(b)), where nnz(Ai)(A_i)(Ai​) is the number of non-zero entries in AiA_iAi​. Note that nnz(b)(b)(b) can be as large as n1⋯nqn_1 \cdots n_qn1​⋯nq​. For p=1,p=1,p=1, q=2q=2q=2 and n1=n2n_1 = n_2n1​=n2​, they achieve a worse bound of O(n13/2poly(d1d2)+nnz(b))O(n_1^{3/2} \text{poly}(d_1d_2) + \text{nnz}(b))O(n13/2​poly(d1​d2​)+nnz(b)). In this work, we provide significantly faster algorithms. For p=2p=2p=2, our running time is O(∑i=1qnnz(Ai))O(\sum_{i=1}^q \text{nnz}(A_i) )O(∑i=1q​nnz(Ai​)), which has no dependence on nnz(b)(b)(b). For p<2p<2p<2, our running time is O(∑i=1qnnz(Ai)+nnz(b))O(\sum_{i=1}^q \text{nnz}(A_i) + \text{nnz}(b))O(∑i=1q​nnz(Ai​)+nnz(b)), which matches the prior best running time for p=2p=2p=2. We also consider the related all-pairs regression problem, where given A∈Rn×d,b∈RnA \in \mathbb{R}^{n \times d}, b \in \mathbb{R}^nA∈Rn×d,b∈Rn, we want to solve min⁡x∥Aˉx−bˉ∥p\min_{x} \|\bar{A}x - \bar{b}\|_pminx​∥Aˉx−bˉ∥p​, where Aˉ∈Rn2×d,bˉ∈Rn2\bar{A} \in \mathbb{R}^{n^2 \times d}, \bar{b} \in \mathbb{R}^{n^2}Aˉ∈Rn2×d,bˉ∈Rn2 consist of all pairwise differences of the rows of A,bA,bA,b. We give an O(nnz(A))O(\text{nnz}(A))O(nnz(A)) time algorithm for p∈[1,2]p \in[1,2]p∈[1,2], improving the Ω(n2)\Omega(n^2)Ω(n2) time needed to form Aˉ\bar{A}Aˉ. Finally, we initiate the study of Kronecker product low rank and low ttt-rank approximation. For input A\mathcal{A}A as above, we give O(∑i=1qnnz(Ai))O(\sum_{i=1}^q \text{nnz}(A_i))O(∑i=1q​nnz(Ai​)) time algorithms, which is much faster than computing A\mathcal{A}A.

View on arXiv
Comments on this paper