36
0

Online Matrix Completion: A Collaborative Approach with Hott Items

Abstract

We investigate the low rank matrix completion problem in an online setting with M{M} users, N{N} items, T{T} rounds, and an unknown rank-rr reward matrix RRM×N{R}\in \mathbb{R}^{{M}\times {N}}. This problem has been well-studied in the literature and has several applications in practice. In each round, we recommend S{S} carefully chosen distinct items to every user and observe noisy rewards. In the regime where M,N>>T{M},{N} >> {T}, we propose two distinct computationally efficient algorithms for recommending items to users and analyze them under the benign \emph{hott items} assumption.1) First, for S=1{S}=1, under additional incoherence/smoothness assumptions on R{R}, we propose the phased algorithm \textsc{PhasedClusterElim}. Our algorithm obtains a near-optimal per-user regret of O~(NM1(Δ1+Δhott2))\tilde{O}({N}{M}^{-1}(\Delta^{-1}+\Delta_{{hott}}^{-2})) where Δhott,Δ\Delta_{{hott}},\Delta are problem-dependent gap parameters with Δhott>>Δ\Delta_{{hott}} >> \Delta almost always. 2) Second, we consider a simplified setting with S=r{S}=r where we make significantly milder assumptions on R{R}. Here, we introduce another phased algorithm, \textsc{DeterminantElim}, to derive a regret guarantee of O~(NM1/rΔdet1))\widetilde{O}({N}{M}^{-1/r}\Delta_{det}^{-1})) where Δdet\Delta_{{det}} is another problem-dependent gap. Both algorithms crucially use collaboration among users to jointly eliminate sub-optimal items for groups of users successively in phases, but with distinctive and novel approaches.

View on arXiv
Comments on this paper