ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2010.00827
  4. Cited By
Neural Thompson Sampling

Neural Thompson Sampling

2 October 2020
Weitong Zhang
Dongruo Zhou
Lihong Li
Quanquan Gu
ArXivPDFHTML

Papers citing "Neural Thompson Sampling"

24 / 24 papers shown
Title
Neural Logistic Bandits
Neural Logistic Bandits
Seoungbin Bae
Dabeen Lee
177
0
0
04 May 2025
Online Clustering of Dueling Bandits
Online Clustering of Dueling Bandits
Zhiyong Wang
Jiahang Sun
Mingze Kong
Jize Xie
Qinghua Hu
J. C. Lui
Zhongxiang Dai
83
0
0
04 Feb 2025
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
H. Bui
Enrique Mallada
Anqi Liu
144
0
0
08 Nov 2024
Batched Bayesian optimization by maximizing the probability of including the optimum
Batched Bayesian optimization by maximizing the probability of including the optimum
Jenna C. Fromer
Runzhong Wang
Mrunali Manjrekar
Austin Tripp
José Miguel Hernández-Lobato
Connor W. Coley
47
0
0
08 Oct 2024
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback
Arun Verma
Zhongxiang Dai
Xiaoqiang Lin
Patrick Jaillet
K. H. Low
37
5
0
24 Jul 2024
Improving Reward-Conditioned Policies for Multi-Armed Bandits using
  Normalized Weight Functions
Improving Reward-Conditioned Policies for Multi-Armed Bandits using Normalized Weight Functions
Kai Xu
Farid Tajaddodianfar
Ben Allison
21
0
0
16 Jun 2024
Graph Neural Thompson Sampling
Graph Neural Thompson Sampling
Shuang Wu
Arash A. Amini
51
0
0
15 Jun 2024
VITS : Variational Inference Thompson Sampling for contextual bandits
VITS : Variational Inference Thompson Sampling for contextual bandits
Pierre Clavier
Tom Huix
Alain Durmus
27
3
0
19 Jul 2023
BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary
  Contextual Bandits
BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits
Nicklas Werge
Abdullah Akgul
M. Kandemir
38
0
0
07 Jul 2023
Neural Exploitation and Exploration of Contextual Bandits
Neural Exploitation and Exploration of Contextual Bandits
Yikun Ban
Yuchen Yan
A. Banerjee
Jingrui He
42
8
0
05 May 2023
Adaptive Endpointing with Deep Contextual Multi-armed Bandits
Adaptive Endpointing with Deep Contextual Multi-armed Bandits
Do June Min
A. Stolcke
A. Raju
Colin Vaz
Di He
Venkatesh Ravichandran
V. Trinh
OffRL
35
0
0
23 Mar 2023
Learning When to Use Adaptive Adversarial Image Perturbations against
  Autonomous Vehicles
Learning When to Use Adaptive Adversarial Image Perturbations against Autonomous Vehicles
Hyung-Jin Yoon
H. Jafarnejadsani
P. Voulgaris
AAML
19
5
0
28 Dec 2022
Global Optimization with Parametric Function Approximation
Global Optimization with Parametric Function Approximation
Chong Liu
Yu-Xiang Wang
36
7
0
16 Nov 2022
A Provably Efficient Model-Free Posterior Sampling Method for Episodic
  Reinforcement Learning
A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning
Christoph Dann
M. Mohri
Tong Zhang
Julian Zimmert
OffRL
18
33
0
23 Aug 2022
Graph Neural Network Bandits
Graph Neural Network Bandits
Parnian Kassraie
Andreas Krause
Ilija Bogunovic
26
11
0
13 Jul 2022
POEM: Out-of-Distribution Detection with Posterior Sampling
POEM: Out-of-Distribution Detection with Posterior Sampling
Yifei Ming
Ying Fan
Yixuan Li
OODD
29
114
0
28 Jun 2022
Neural Collaborative Filtering Bandits via Meta Learning
Neural Collaborative Filtering Bandits via Meta Learning
Yikun Ban
Yunzhe Qi
Tianxin Wei
Jingrui He
OffRL
31
9
0
31 Jan 2022
Optimal Regret Is Achievable with Bounded Approximate Inference Error:
  An Enhanced Bayesian Upper Confidence Bound Framework
Optimal Regret Is Achievable with Bounded Approximate Inference Error: An Enhanced Bayesian Upper Confidence Bound Framework
Ziyi Huang
H. Lam
A. Meisami
Haofeng Zhang
36
4
0
31 Jan 2022
Quantifying Epistemic Uncertainty in Deep Learning
Quantifying Epistemic Uncertainty in Deep Learning
Ziyi Huang
H. Lam
Haofeng Zhang
UQCV
BDL
UD
PER
24
12
0
23 Oct 2021
EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits
EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits
Yikun Ban
Yuchen Yan
A. Banerjee
Jingrui He
OffRL
29
39
0
07 Oct 2021
Deep Exploration for Recommendation Systems
Deep Exploration for Recommendation Systems
Zheqing Zhu
Benjamin Van Roy
32
11
0
26 Sep 2021
Optimal Order Simple Regret for Gaussian Process Bandits
Optimal Order Simple Regret for Gaussian Process Bandits
Sattar Vakili
N. Bouziani
Sepehr Jalali
A. Bernacchia
Da-shan Shiu
31
51
0
20 Aug 2021
Neural Active Learning with Performance Guarantees
Neural Active Learning with Performance Guarantees
Pranjal Awasthi
Christoph Dann
Claudio Gentile
Ayush Sekhari
Zhilei Wang
29
22
0
06 Jun 2021
Online Limited Memory Neural-Linear Bandits with Likelihood Matching
Online Limited Memory Neural-Linear Bandits with Likelihood Matching
Ofir Nabati
Tom Zahavy
Shie Mannor
21
18
0
07 Feb 2021
1