ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2202.01085
21
2

Giga-scale Kernel Matrix Vector Multiplication on GPU

2 February 2022
Robert Hu
Siu Lun Chau
Dino Sejdinovic
J. Glaunès
ArXivPDFHTML
Abstract

Kernel matrix-vector multiplication (KMVM) is a foundational operation in machine learning and scientific computing. However, as KMVM tends to scale quadratically in both memory and time, applications are often limited by these computational constraints. In this paper, we propose a novel approximation procedure coined \textit{Faster-Fast and Free Memory Method} (\fthreem\fthreem\fthreem) to address these scaling issues of KMVM for tall~(108∼10910^8\sim 10^9108∼109) and skinny~(D≤7D\leq7D≤7) data. Extensive experiments demonstrate that \fthreem\fthreem\fthreem has empirical \emph{linear time and memory} complexity with a relative error of order 10−310^{-3}10−3 and can compute a full KMVM for a billion points \emph{in under a minute} on a high-end GPU, leading to a significant speed-up in comparison to existing CPU methods. We demonstrate the utility of our procedure by applying it as a drop-in for the state-of-the-art GPU-based linear solver FALKON, \emph{improving speed 1.5-5.5 times} at the cost of <1%<1\%<1% drop in accuracy. We further demonstrate competitive results on \emph{Gaussian Process regression} coupled with significant speedups on a variety of real-world datasets.

View on arXiv
@article{hu2025_2202.01085,
  title={ Giga-scale Kernel Matrix Vector Multiplication on GPU },
  author={ Robert Hu and Siu Lun Chau and Dino Sejdinovic and Joan Alexis Glaunès },
  journal={arXiv preprint arXiv:2202.01085},
  year={ 2025 }
}
Comments on this paper