ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1711.02317
  4. Cited By
Multi-Player Bandits Revisited
v1v2v3 (latest)

Multi-Player Bandits Revisited

7 November 2017
Lilian Besson
E. Kaufmann
ArXiv (abs)PDFHTML

Papers citing "Multi-Player Bandits Revisited"

10 / 10 papers shown
Title
Bandits with Movement Costs and Adaptive Pricing
Bandits with Movement Costs and Adaptive Pricing
Tomer Koren
Roi Livni
Yishay Mansour
38
20
0
24 Feb 2017
Explore First, Exploit Next: The True Shape of Regret in Bandit Problems
Explore First, Exploit Next: The True Shape of Regret in Bandit Problems
Aurélien Garivier
Pierre Ménard
Gilles Stoltz
59
214
0
23 Feb 2016
Optimal Regret Analysis of Thompson Sampling in Stochastic Multi-armed
  Bandit Problem with Multiple Plays
Optimal Regret Analysis of Thompson Sampling in Stochastic Multi-armed Bandit Problem with Multiple Plays
Junpei Komiyama
Junya Honda
Hiroshi Nakagawa
71
134
0
02 Jun 2015
Multi-user lax communications: a multi-armed bandit approach
Multi-user lax communications: a multi-armed bandit approach
Orly Avner
Shie Mannor
26
11
0
30 Apr 2015
Kullback-Leibler upper confidence bounds for optimal sequential
  allocation
Kullback-Leibler upper confidence bounds for optimal sequential allocation
Olivier Cappé
Aurélien Garivier
Odalric-Ambrym Maillard
Rémi Munos
Gilles Stoltz
129
395
0
03 Oct 2012
Decentralized Learning for Multi-player Multi-armed Bandits
Decentralized Learning for Multi-player Multi-armed Bandits
D. Kalathil
Naumaan Nayyar
R. Jain
98
44
0
14 Jun 2012
Thompson Sampling: An Asymptotically Optimal Finite Time Analysis
Thompson Sampling: An Asymptotically Optimal Finite Time Analysis
E. Kaufmann
N. Korda
Rémi Munos
162
588
0
18 May 2012
Distributed Algorithms for Learning and Cognitive Medium Access with
  Logarithmic Regret
Distributed Algorithms for Learning and Cognitive Medium Access with Logarithmic Regret
Anima Anandkumar
Nithin Michael
A. Tang
A. Swami
108
348
0
08 Jun 2010
A Contextual-Bandit Approach to Personalized News Article Recommendation
A Contextual-Bandit Approach to Personalized News Article Recommendation
Lihong Li
Wei Chu
John Langford
Robert Schapire
471
2,954
0
28 Feb 2010
Distributed Learning in Multi-Armed Bandit with Multiple Players
Distributed Learning in Multi-Armed Bandit with Multiple Players
Keqin Liu
Qing Zhao
97
440
0
12 Oct 2009
1