ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1411.2919
  4. Cited By
Bounded Regret for Finite-Armed Structured Bandits

Bounded Regret for Finite-Armed Structured Bandits

11 November 2014
Tor Lattimore
Rémi Munos
ArXivPDFHTML

Papers citing "Bounded Regret for Finite-Armed Structured Bandits"

19 / 19 papers shown
Title
Causally Abstracted Multi-armed Bandits
Causally Abstracted Multi-armed Bandits
Fabio Massimo Zennaro
Nicholas Bishop
Joel Dyer
Yorgos Felekis
Anisoara Calinescu
Michael Wooldridge
Theodoros Damoulas
43
3
0
26 Apr 2024
Quantum contextual bandits and recommender systems for quantum data
Quantum contextual bandits and recommender systems for quantum data
Shrigyan Brahmachari
Josep Lumbreras
Marco Tomamichel
44
3
0
31 Jan 2023
Deep Hierarchy in Bandits
Deep Hierarchy in Bandits
Joey Hong
Branislav Kveton
S. Katariya
Manzil Zaheer
Mohammad Ghavamzadeh
38
20
0
03 Feb 2022
Best Arm Identification under Additive Transfer Bandits
Best Arm Identification under Additive Transfer Bandits
Ojash Neopane
Aaditya Ramdas
Aarti Singh
26
2
0
08 Dec 2021
Bad-Policy Density: A Measure of Reinforcement Learning Hardness
Bad-Policy Density: A Measure of Reinforcement Learning Hardness
David Abel
Cameron Allen
Dilip Arumugam
D Ellis Hershkowitz
Michael L. Littman
Lawson L. S. Wong
31
2
0
07 Oct 2021
Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement
  Learning
Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning
Tong Zhang
29
64
0
02 Oct 2021
TSEC: a framework for online experimentation under experimental
  constraints
TSEC: a framework for online experimentation under experimental constraints
Simon Mak
Yuanshuo Zhou
Lavonne Hoang
C. F. J. Wu
28
2
0
17 Jan 2021
Policy Optimization as Online Learning with Mediator Feedback
Policy Optimization as Online Learning with Mediator Feedback
Alberto Maria Metelli
Matteo Papini
P. DÓro
Marcello Restelli
OffRL
32
10
0
15 Dec 2020
Multi-Armed Bandits with Dependent Arms
Multi-Armed Bandits with Dependent Arms
Rahul Singh
Fang Liu
Yin Sun
Ness B. Shroff
29
11
0
13 Oct 2020
Effects of Model Misspecification on Bayesian Bandits: Case Studies in
  UX Optimization
Effects of Model Misspecification on Bayesian Bandits: Case Studies in UX Optimization
Mack Sweeney
M. Adelsberg
Kathryn B. Laskey
C. Domeniconi
31
1
0
07 Oct 2020
Continuous-Time Multi-Armed Bandits with Controlled Restarts
Continuous-Time Multi-Armed Bandits with Controlled Restarts
Semih Cayci
A. Eryilmaz
R. Srikant
24
4
0
30 Jun 2020
Crush Optimism with Pessimism: Structured Bandits Beyond Asymptotic
  Optimality
Crush Optimism with Pessimism: Structured Bandits Beyond Asymptotic Optimality
Kwang-Sung Jun
Chicheng Zhang
36
10
0
15 Jun 2020
Categorized Bandits
Categorized Bandits
Matthieu Jedor
Jonathan Louëdec
Vianney Perchet
30
11
0
04 May 2020
Bounded Regret for Finitely Parameterized Multi-Armed Bandits
Bounded Regret for Finitely Parameterized Multi-Armed Bandits
Kishan Panaganti
D. Kalathil
26
1
0
03 Mar 2020
Multi-Armed Bandits with Correlated Arms
Multi-Armed Bandits with Correlated Arms
Samarth Gupta
Shreyas Chaudhari
Gauri Joshi
Osman Yağan
27
51
0
06 Nov 2019
Optimal Learning for Dynamic Coding in Deadline-Constrained
  Multi-Channel Networks
Optimal Learning for Dynamic Coding in Deadline-Constrained Multi-Channel Networks
Semih Cayci
A. Eryilmaz
19
3
0
27 Nov 2018
Learning to reinforcement learn
Learning to reinforcement learn
Jane X. Wang
Z. Kurth-Nelson
Dhruva Tirumala
Hubert Soyer
Joel Z Leibo
Rémi Munos
Charles Blundell
D. Kumaran
M. Botvinick
OffRL
19
975
0
17 Nov 2016
Global Bandits
Global Bandits
Onur Atan
Cem Tekin
Mihaela van der Schaar
39
16
0
29 Mar 2015
Bounded regret in stochastic multi-armed bandits
Bounded regret in stochastic multi-armed bandits
Sébastien Bubeck
Vianney Perchet
Philippe Rigollet
85
92
0
06 Feb 2013
1