An Information-Theoretic Analysis for Thompson Sampling with Many Actions

30 May 2018

Papers citing "An Information-Theoretic Analysis for Thompson Sampling with Many Actions"

19 / 19 papers shown

Title
An Information-Theoretic Analysis of Thompson Sampling with Infinite Action Spaces Amaury Gouverneur Borja Rodríguez Gálvez T. Oechtering Mikael Skoglund 56 0 0 04 Feb 2025
The Power of Active Multi-Task Learning in Reinforcement Learning from Human Feedback Ruitao Chen Liwei Wang 75 1 0 18 May 2024
Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits Yuwei Luo Mohsen Bayati 23 1 0 26 Jun 2023
Incentivizing Exploration with Linear Contexts and Combinatorial Actions Mark Sellke 24 3 0 03 Jun 2023
Adaptive Sampling for Discovery Ziping Xu Eunjae Shim Ambuj Tewari Paul M. Zimmerman OffRL 19 4 0 30 May 2022
Lifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling for Contextual Bandits Gergely Neu Julia Olkhovskaya Matteo Papini Ludovic Schwartz 33 16 0 27 May 2022
Non-Stationary Bandit Learning via Predictive Sampling Yueyang Liu Kuang Xu Benjamin Van Roy 24 19 0 04 May 2022
Gaussian Imagination in Bandit Learning Yueyang Liu Adithya M. Devraj Benjamin Van Roy Kuang Xu 34 7 0 06 Jan 2022
The Value of Information When Deciding What to Learn Dilip Arumugam Benjamin Van Roy 37 12 0 26 Oct 2021
Apple Tasting Revisited: Bayesian Approaches to Partially Monitored Online Binary Classification James A. Grant David S. Leslie 44 3 0 29 Sep 2021
A Payload Optimization Method for Federated Recommender Systems Farwa K. Khan Adrian Flanagan K. E. Tan Z. Alamgir Muhammad Ammad-ud-din 82 29 0 27 Jul 2021
Metalearning Linear Bandits by Prior Update Amit Peleg Naama Pearl Ron Meir 37 18 0 12 Jul 2021
Information Directed Sampling for Sparse Linear Bandits Botao Hao Tor Lattimore Wei Deng 25 19 0 29 May 2021
UCB-based Algorithms for Multinomial Logistic Regression Bandits Sanae Amani Christos Thrampoulidis 34 10 0 21 Mar 2021
Reinforcement Learning, Bit by Bit Xiuyuan Lu Benjamin Van Roy Vikranth Dwaracherla M. Ibrahimi Ian Osband Zheng Wen 30 70 0 06 Mar 2021
The Elliptical Potential Lemma for General Distributions with an Application to Linear Thompson Sampling N. Hamidi Mohsen Bayati 14 1 0 16 Feb 2021
Improved Optimistic Algorithms for Logistic Bandits Louis Faury Marc Abeille Clément Calauzènes Olivier Fercoq 20 85 0 18 Feb 2020
Safe Linear Thompson Sampling with Side Information Ahmadreza Moradipari Sanae Amani M. Alizadeh Christos Thrampoulidis 27 42 0 06 Nov 2019
Connections Between Mirror Descent, Thompson Sampling and the Information Ratio Julian Zimmert Tor Lattimore 22 34 0 28 May 2019