Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1805.11845
Cited By
An Information-Theoretic Analysis for Thompson Sampling with Many Actions
30 May 2018
Shi Dong
Benjamin Van Roy
Re-assign community
ArXiv
PDF
HTML
Papers citing
"An Information-Theoretic Analysis for Thompson Sampling with Many Actions"
19 / 19 papers shown
Title
An Information-Theoretic Analysis of Thompson Sampling with Infinite Action Spaces
Amaury Gouverneur
Borja Rodríguez Gálvez
T. Oechtering
Mikael Skoglund
56
0
0
04 Feb 2025
The Power of Active Multi-Task Learning in Reinforcement Learning from Human Feedback
Ruitao Chen
Liwei Wang
75
1
0
18 May 2024
Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits
Yuwei Luo
Mohsen Bayati
23
1
0
26 Jun 2023
Incentivizing Exploration with Linear Contexts and Combinatorial Actions
Mark Sellke
24
3
0
03 Jun 2023
Adaptive Sampling for Discovery
Ziping Xu
Eunjae Shim
Ambuj Tewari
Paul M. Zimmerman
OffRL
19
4
0
30 May 2022
Lifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling for Contextual Bandits
Gergely Neu
Julia Olkhovskaya
Matteo Papini
Ludovic Schwartz
33
16
0
27 May 2022
Non-Stationary Bandit Learning via Predictive Sampling
Yueyang Liu
Kuang Xu
Benjamin Van Roy
24
19
0
04 May 2022
Gaussian Imagination in Bandit Learning
Yueyang Liu
Adithya M. Devraj
Benjamin Van Roy
Kuang Xu
34
7
0
06 Jan 2022
The Value of Information When Deciding What to Learn
Dilip Arumugam
Benjamin Van Roy
37
12
0
26 Oct 2021
Apple Tasting Revisited: Bayesian Approaches to Partially Monitored Online Binary Classification
James A. Grant
David S. Leslie
44
3
0
29 Sep 2021
A Payload Optimization Method for Federated Recommender Systems
Farwa K. Khan
Adrian Flanagan
K. E. Tan
Z. Alamgir
Muhammad Ammad-ud-din
82
29
0
27 Jul 2021
Metalearning Linear Bandits by Prior Update
Amit Peleg
Naama Pearl
Ron Meir
37
18
0
12 Jul 2021
Information Directed Sampling for Sparse Linear Bandits
Botao Hao
Tor Lattimore
Wei Deng
25
19
0
29 May 2021
UCB-based Algorithms for Multinomial Logistic Regression Bandits
Sanae Amani
Christos Thrampoulidis
34
10
0
21 Mar 2021
Reinforcement Learning, Bit by Bit
Xiuyuan Lu
Benjamin Van Roy
Vikranth Dwaracherla
M. Ibrahimi
Ian Osband
Zheng Wen
30
70
0
06 Mar 2021
The Elliptical Potential Lemma for General Distributions with an Application to Linear Thompson Sampling
N. Hamidi
Mohsen Bayati
14
1
0
16 Feb 2021
Improved Optimistic Algorithms for Logistic Bandits
Louis Faury
Marc Abeille
Clément Calauzènes
Olivier Fercoq
20
85
0
18 Feb 2020
Safe Linear Thompson Sampling with Side Information
Ahmadreza Moradipari
Sanae Amani
M. Alizadeh
Christos Thrampoulidis
27
42
0
06 Nov 2019
Connections Between Mirror Descent, Thompson Sampling and the Information Ratio
Julian Zimmert
Tor Lattimore
22
34
0
28 May 2019
1