Neural Thompson Sampling

2 October 2020

Quanquan Gu

Papers citing "Neural Thompson Sampling"

24 / 24 papers shown

Title
Neural Logistic Bandits Seoungbin Bae Dabeen Lee 177 0 0 04 May 2025
Online Clustering of Dueling Bandits Zhiyong Wang Jiahang Sun Mingze Kong Jize Xie Qinghua Hu J. C. Lui Zhongxiang Dai 83 0 0 04 Feb 2025
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits H. Bui Enrique Mallada Anqi Liu 144 0 0 08 Nov 2024
Batched Bayesian optimization by maximizing the probability of including the optimum Jenna C. Fromer Runzhong Wang Mrunali Manjrekar Austin Tripp José Miguel Hernández-Lobato Connor W. Coley 47 0 0 08 Oct 2024
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback Arun Verma Zhongxiang Dai Xiaoqiang Lin Patrick Jaillet K. H. Low 37 5 0 24 Jul 2024
Improving Reward-Conditioned Policies for Multi-Armed Bandits using Normalized Weight Functions Kai Xu Farid Tajaddodianfar Ben Allison 21 0 0 16 Jun 2024
Graph Neural Thompson Sampling Shuang Wu Arash A. Amini 51 0 0 15 Jun 2024
VITS : Variational Inference Thompson Sampling for contextual bandits Pierre Clavier Tom Huix Alain Durmus 27 3 0 19 Jul 2023
BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits Nicklas Werge Abdullah Akgul M. Kandemir 38 0 0 07 Jul 2023
Neural Exploitation and Exploration of Contextual Bandits Yikun Ban Yuchen Yan A. Banerjee Jingrui He 42 8 0 05 May 2023
Adaptive Endpointing with Deep Contextual Multi-armed Bandits Do June Min A. Stolcke A. Raju Colin Vaz Di He Venkatesh Ravichandran V. Trinh OffRL 35 0 0 23 Mar 2023
Learning When to Use Adaptive Adversarial Image Perturbations against Autonomous Vehicles Hyung-Jin Yoon H. Jafarnejadsani P. Voulgaris AAML 19 5 0 28 Dec 2022
Global Optimization with Parametric Function Approximation Chong Liu Yu-Xiang Wang 36 7 0 16 Nov 2022
A Provably Efficient Model-Free Posterior Sampling Method for Episodic Reinforcement Learning Christoph Dann M. Mohri Tong Zhang Julian Zimmert OffRL 18 33 0 23 Aug 2022
Graph Neural Network Bandits Parnian Kassraie Andreas Krause Ilija Bogunovic 26 11 0 13 Jul 2022
POEM: Out-of-Distribution Detection with Posterior Sampling Yifei Ming Ying Fan Yixuan Li OODD 29 114 0 28 Jun 2022
Neural Collaborative Filtering Bandits via Meta Learning Yikun Ban Yunzhe Qi Tianxin Wei Jingrui He OffRL 31 9 0 31 Jan 2022
Optimal Regret Is Achievable with Bounded Approximate Inference Error: An Enhanced Bayesian Upper Confidence Bound Framework Ziyi Huang H. Lam A. Meisami Haofeng Zhang 36 4 0 31 Jan 2022
Quantifying Epistemic Uncertainty in Deep Learning Ziyi Huang H. Lam Haofeng Zhang UQCV BDL UD PER 24 12 0 23 Oct 2021
EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits Yikun Ban Yuchen Yan A. Banerjee Jingrui He OffRL 29 39 0 07 Oct 2021
Deep Exploration for Recommendation Systems Zheqing Zhu Benjamin Van Roy 32 11 0 26 Sep 2021
Optimal Order Simple Regret for Gaussian Process Bandits Sattar Vakili N. Bouziani Sepehr Jalali A. Bernacchia Da-shan Shiu 31 51 0 20 Aug 2021
Neural Active Learning with Performance Guarantees Pranjal Awasthi Christoph Dann Claudio Gentile Ayush Sekhari Zhilei Wang 29 22 0 06 Jun 2021
Online Limited Memory Neural-Linear Bandits with Likelihood Matching Ofir Nabati Tom Zahavy Shie Mannor 21 18 0 07 Feb 2021