Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1302.1611
Cited By
Bounded regret in stochastic multi-armed bandits
6 February 2013
Sébastien Bubeck
Vianney Perchet
Philippe Rigollet
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Bounded regret in stochastic multi-armed bandits"
17 / 17 papers shown
Title
Communication-Efficient Collaborative Regret Minimization in Multi-Armed Bandits
Nikolai Karpov
Qin Zhang
21
1
0
26 Jan 2023
Anonymous Bandits for Multi-User Systems
Hossein Esfandiari
Vahab Mirrokni
Jon Schneider
PICV
18
0
0
21 Oct 2022
Collaborative Linear Bandits with Adversarial Agents: Near-Optimal Regret Bounds
A. Mitra
Arman Adibi
George J. Pappas
Hamed Hassani
44
6
0
06 Jun 2022
Policy Optimization as Online Learning with Mediator Feedback
Alberto Maria Metelli
Matteo Papini
P. DÓro
Marcello Restelli
OffRL
27
10
0
15 Dec 2020
Crush Optimism with Pessimism: Structured Bandits Beyond Asymptotic Optimality
Kwang-Sung Jun
Chicheng Zhang
10
10
0
15 Jun 2020
Categorized Bandits
Matthieu Jedor
Jonathan Louëdec
Vianney Perchet
12
11
0
04 May 2020
Bounded Regret for Finitely Parameterized Multi-Armed Bandits
Kishan Panaganti
D. Kalathil
8
1
0
03 Mar 2020
Online Allocation and Pricing: Constant Regret via Bellman Inequalities
Alberto Vera
Siddhartha Banerjee
I. Gurvich
OffRL
14
48
0
14 Jun 2019
Batched Multi-armed Bandits Problem
Zijun Gao
Yanjun Han
Zhimei Ren
Zhengqing Zhou
8
138
0
03 Apr 2019
Beating Stochastic and Adversarial Semi-bandits Optimally and Simultaneously
Julian Zimmert
Haipeng Luo
Chen-Yu Wei
11
79
0
25 Jan 2019
Regional Multi-Armed Bandits
Zhiyang Wang
Ruida Zhou
Cong Shen
13
18
0
22 Feb 2018
On Explore-Then-Commit Strategies
Aurélien Garivier
E. Kaufmann
Tor Lattimore
24
104
0
29 May 2016
Refined Lower Bounds for Adversarial Bandits
Sébastien Gerchinovitz
Tor Lattimore
AAML
17
58
0
24 May 2016
Risk-Averse Multi-Armed Bandit Problems under Mean-Variance Measure
Sattar Vakili
Qing Zhao
13
88
0
18 Apr 2016
Batched bandit problems
Vianney Perchet
Philippe Rigollet
Sylvain Chassang
E. Snowberg
OffRL
40
199
0
02 May 2015
Learning Nash Equilibria in Congestion Games
Walid Krichene
Benjamin Drighès
Alexandre M. Bayen
30
9
0
31 Jul 2014
On the Complexity of Best Arm Identification in Multi-Armed Bandit Models
E. Kaufmann
Olivier Cappé
Aurélien Garivier
48
1,013
0
16 Jul 2014
1