ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.11845
  4. Cited By
An Information-Theoretic Analysis for Thompson Sampling with Many
  Actions

An Information-Theoretic Analysis for Thompson Sampling with Many Actions

30 May 2018
Shi Dong
Benjamin Van Roy
ArXivPDFHTML

Papers citing "An Information-Theoretic Analysis for Thompson Sampling with Many Actions"

19 / 19 papers shown
Title
An Information-Theoretic Analysis of Thompson Sampling with Infinite Action Spaces
An Information-Theoretic Analysis of Thompson Sampling with Infinite Action Spaces
Amaury Gouverneur
Borja Rodríguez Gálvez
T. Oechtering
Mikael Skoglund
56
0
0
04 Feb 2025
The Power of Active Multi-Task Learning in Reinforcement Learning from Human Feedback
The Power of Active Multi-Task Learning in Reinforcement Learning from Human Feedback
Ruitao Chen
Liwei Wang
75
1
0
18 May 2024
Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits
Geometry-Aware Approaches for Balancing Performance and Theoretical Guarantees in Linear Bandits
Yuwei Luo
Mohsen Bayati
23
1
0
26 Jun 2023
Incentivizing Exploration with Linear Contexts and Combinatorial Actions
Incentivizing Exploration with Linear Contexts and Combinatorial Actions
Mark Sellke
24
3
0
03 Jun 2023
Adaptive Sampling for Discovery
Adaptive Sampling for Discovery
Ziping Xu
Eunjae Shim
Ambuj Tewari
Paul M. Zimmerman
OffRL
19
4
0
30 May 2022
Lifting the Information Ratio: An Information-Theoretic Analysis of
  Thompson Sampling for Contextual Bandits
Lifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling for Contextual Bandits
Gergely Neu
Julia Olkhovskaya
Matteo Papini
Ludovic Schwartz
33
16
0
27 May 2022
Non-Stationary Bandit Learning via Predictive Sampling
Non-Stationary Bandit Learning via Predictive Sampling
Yueyang Liu
Kuang Xu
Benjamin Van Roy
24
19
0
04 May 2022
Gaussian Imagination in Bandit Learning
Gaussian Imagination in Bandit Learning
Yueyang Liu
Adithya M. Devraj
Benjamin Van Roy
Kuang Xu
34
7
0
06 Jan 2022
The Value of Information When Deciding What to Learn
The Value of Information When Deciding What to Learn
Dilip Arumugam
Benjamin Van Roy
37
12
0
26 Oct 2021
Apple Tasting Revisited: Bayesian Approaches to Partially Monitored
  Online Binary Classification
Apple Tasting Revisited: Bayesian Approaches to Partially Monitored Online Binary Classification
James A. Grant
David S. Leslie
44
3
0
29 Sep 2021
A Payload Optimization Method for Federated Recommender Systems
A Payload Optimization Method for Federated Recommender Systems
Farwa K. Khan
Adrian Flanagan
K. E. Tan
Z. Alamgir
Muhammad Ammad-ud-din
82
29
0
27 Jul 2021
Metalearning Linear Bandits by Prior Update
Metalearning Linear Bandits by Prior Update
Amit Peleg
Naama Pearl
Ron Meir
37
18
0
12 Jul 2021
Information Directed Sampling for Sparse Linear Bandits
Information Directed Sampling for Sparse Linear Bandits
Botao Hao
Tor Lattimore
Wei Deng
25
19
0
29 May 2021
UCB-based Algorithms for Multinomial Logistic Regression Bandits
UCB-based Algorithms for Multinomial Logistic Regression Bandits
Sanae Amani
Christos Thrampoulidis
34
10
0
21 Mar 2021
Reinforcement Learning, Bit by Bit
Reinforcement Learning, Bit by Bit
Xiuyuan Lu
Benjamin Van Roy
Vikranth Dwaracherla
M. Ibrahimi
Ian Osband
Zheng Wen
30
70
0
06 Mar 2021
The Elliptical Potential Lemma for General Distributions with an
  Application to Linear Thompson Sampling
The Elliptical Potential Lemma for General Distributions with an Application to Linear Thompson Sampling
N. Hamidi
Mohsen Bayati
14
1
0
16 Feb 2021
Improved Optimistic Algorithms for Logistic Bandits
Improved Optimistic Algorithms for Logistic Bandits
Louis Faury
Marc Abeille
Clément Calauzènes
Olivier Fercoq
20
85
0
18 Feb 2020
Safe Linear Thompson Sampling with Side Information
Safe Linear Thompson Sampling with Side Information
Ahmadreza Moradipari
Sanae Amani
M. Alizadeh
Christos Thrampoulidis
27
42
0
06 Nov 2019
Connections Between Mirror Descent, Thompson Sampling and the
  Information Ratio
Connections Between Mirror Descent, Thompson Sampling and the Information Ratio
Julian Zimmert
Tor Lattimore
22
34
0
28 May 2019
1