Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2008.03959
Cited By
Lenient Regret for Multi-Armed Bandits
10 August 2020
Nadav Merlis
Shie Mannor
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Lenient Regret for Multi-Armed Bandits"
2 / 2 papers shown
Title
Gap-Dependent Unsupervised Exploration for Reinforcement Learning
Jingfeng Wu
Vladimir Braverman
Lin F. Yang
33
12
0
11 Aug 2021
Bounded regret in stochastic multi-armed bandits
Sébastien Bubeck
Vianney Perchet
Philippe Rigollet
71
91
0
06 Feb 2013
1