ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2212.00642
33
4

Sub-quadratic Algorithms for Kernel Matrices via Kernel Density Estimation

1 December 2022
Ainesh Bakshi
Piotr Indyk
Praneeth Kacham
Sandeep Silwal
Samson Zhou
ArXivPDFHTML
Abstract

Kernel matrices, as well as weighted graphs represented by them, are ubiquitous objects in machine learning, statistics and other related fields. The main drawback of using kernel methods (learning and inference using kernel matrices) is efficiency -- given nnn input points, most kernel-based algorithms need to materialize the full n×nn \times nn×n kernel matrix before performing any subsequent computation, thus incurring Ω(n2)\Omega(n^2)Ω(n2) runtime. Breaking this quadratic barrier for various problems has therefore, been a subject of extensive research efforts. We break the quadratic barrier and obtain subquadratic\textit{subquadratic}subquadratic time algorithms for several fundamental linear-algebraic and graph processing primitives, including approximating the top eigenvalue and eigenvector, spectral sparsification, solving linear systems, local clustering, low-rank approximation, arboricity estimation and counting weighted triangles. We build on the recent Kernel Density Estimation framework, which (after preprocessing in time subquadratic in nnn) can return estimates of row/column sums of the kernel matrix. In particular, we develop efficient reductions from weighted vertex\textit{weighted vertex}weighted vertex and weighted edge sampling\textit{weighted edge sampling}weighted edge sampling on kernel graphs, simulating random walks\textit{simulating random walks}simulating random walks on kernel graphs, and importance sampling\textit{importance sampling}importance sampling on matrices to Kernel Density Estimation and show that we can generate samples from these distributions in sublinear\textit{sublinear}sublinear (in the support of the distribution) time. Our reductions are the central ingredient in each of our applications and we believe they may be of independent interest. We empirically demonstrate the efficacy of our algorithms on low-rank approximation (LRA) and spectral sparsification, where we observe a 9x\textbf{9x}9x decrease in the number of kernel evaluations over baselines for LRA and a 41x\textbf{41x}41x reduction in the graph size for spectral sparsification.

View on arXiv
Comments on this paper