ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2006.01225
21
14

Streaming Coresets for Symmetric Tensor Factorization

1 June 2020
Rachit Chhaya
Jayesh Choudhari
A. Dasgupta
Supratim Shit
ArXivPDFHTML
Abstract

Factorizing tensors has recently become an important optimization module in a number of machine learning pipelines, especially in latent variable models. We show how to do this efficiently in the streaming setting. Given a set of nnn vectors, each in Rd\mathbb{R}^dRd, we present algorithms to select a sublinear number of these vectors as coreset, while guaranteeing that the CP decomposition of the ppp-moment tensor of the coreset approximates the corresponding decomposition of the ppp-moment tensor computed from the full data. We introduce two novel algorithmic techniques: online filtering and kernelization. Using these two, we present six algorithms that achieve different tradeoffs of coreset size, update time and working space, beating or matching various state of the art algorithms. In the case of matrices (222-ordered tensor), our online row sampling algorithm guarantees (1±ϵ)(1 \pm \epsilon)(1±ϵ) relative error spectral approximation. We show applications of our algorithms in learning single topic modeling.

View on arXiv
Comments on this paper