ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2209.03997
40
9

Online Low Rank Matrix Completion

8 September 2022
Prateek Jain
S. Pal
ArXivPDFHTML
Abstract

We study the problem of {\em online} low-rank matrix completion with M\mathsf{M}M users, N\mathsf{N}N items and T\mathsf{T}T rounds. In each round, the algorithm recommends one item per user, for which it gets a (noisy) reward sampled from a low-rank user-item preference matrix. The goal is to design a method with sub-linear regret (in T\mathsf{T}T) and nearly optimal dependence on M\mathsf{M}M and N\mathsf{N}N. The problem can be easily mapped to the standard multi-armed bandit problem where each item is an {\em independent} arm, but that leads to poor regret as the correlation between arms and users is not exploited. On the other hand, exploiting the low-rank structure of reward matrix is challenging due to non-convexity of the low-rank manifold. We first demonstrate that the low-rank structure can be exploited using a simple explore-then-commit (ETC) approach that ensures a regret of O(polylog(M+N)T2/3)O(\mathsf{polylog} (\mathsf{M}+\mathsf{N}) \mathsf{T}^{2/3})O(polylog(M+N)T2/3). That is, roughly only polylog(M+N)\mathsf{polylog} (\mathsf{M}+\mathsf{N})polylog(M+N) item recommendations are required per user to get a non-trivial solution. We then improve our result for the rank-111 setting which in itself is quite challenging and encapsulates some of the key issues. Here, we propose \textsc{OCTAL} (Online Collaborative filTering using iterAtive user cLustering) that guarantees nearly optimal regret of O(polylog(M+N)T1/2)O(\mathsf{polylog} (\mathsf{M}+\mathsf{N}) \mathsf{T}^{1/2})O(polylog(M+N)T1/2). OCTAL is based on a novel technique of clustering users that allows iterative elimination of items and leads to a nearly optimal minimax rate.

View on arXiv
Comments on this paper