v1v2v3 (latest)

Randomized Exploration in Generalized Linear Bandits

21 June 2019

Papers citing "Randomized Exploration in Generalized Linear Bandits"

22 / 22 papers shown

Title
Neural Logistic Bandits Seoungbin Bae Dabeen Lee 506 0 0 04 May 2025
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits H. Bui Enrique Mallada Anqi Liu 485 1 0 08 Nov 2024
A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits Junghyun Lee Se-Young Yun Kwang-Sung Jun 141 6 0 19 Jul 2024
Random Latent Exploration for Deep Reinforcement Learning Srinath Mahankali Zhang-Wei Hong Ayush Sekhari Alexander Rakhlin Pulkit Agrawal 230 3 0 18 Jul 2024
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits Nicolas Nguyen Imad Aouali András Gyorgy Claire Vernade 77 2 0 08 Feb 2024
Truncated LinUCB for Stochastic Linear Bandits Yanglei Song Meng zhou 203 0 0 23 Feb 2022
On the Performance of Thompson Sampling on Logistic Bandits Shi Dong Tengyu Ma Benjamin Van Roy 49 39 0 12 May 2019
Perturbed-History Exploration in Stochastic Linear Bandits Branislav Kveton Csaba Szepesvári Mohammad Ghavamzadeh Craig Boutilier 36 43 0 21 Mar 2019
Perturbed-History Exploration in Stochastic Multi-Armed Bandits Branislav Kveton Csaba Szepesvári Mohammad Ghavamzadeh Craig Boutilier 44 31 0 26 Feb 2019
Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits Branislav Kveton Csaba Szepesvári Sharan Vaswani Zheng Wen Mohammad Ghavamzadeh Tor Lattimore 142 70 0 13 Nov 2018
Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling C. Riquelme George Tucker Jasper Snoek BDL 76 366 0 26 Feb 2018
Customized Nonlinear Bandits for Online Response Selection in Neural Conversation Models Bing-Quan Liu Tong Yu Ian Lane Ole J. Mengshoel OffRL 47 29 0 22 Nov 2017
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems Zachary Chase Lipton Xiujun Li Jianfeng Gao Lihong Li Faisal Ahmed Li Deng 68 172 0 15 Nov 2017
Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms Han Xiao Kashif Rasul Roland Vollgraf 283 8,904 0 25 Aug 2017
Scalable Generalized Linear Bandits: Online Computation and Hashing Kwang-Sung Jun Aniruddha Bhargava Robert D. Nowak Rebecca Willett 74 126 0 01 Jun 2017
Ensemble Sampling Xiuyuan Lu Benjamin Van Roy 129 121 0 20 May 2017
Online Stochastic Linear Optimization under One-bit Feedback Lijun Zhang Tianbao Yang Rong Jin Zhi Zhou 52 66 0 25 Sep 2015
Adam: A Method for Stochastic Optimization Diederik P. Kingma Jimmy Ba ODL 2.0K 150,260 0 22 Dec 2014
Generalization and Exploration via Randomized Value Functions Ian Osband Benjamin Van Roy Zheng Wen 91 314 0 04 Feb 2014
Thompson Sampling for Complex Bandit Problems Aditya Gopalan Shie Mannor Yishay Mansour 152 203 0 03 Nov 2013
Further Optimal Regret Bounds for Thompson Sampling Shipra Agrawal Navin Goyal 107 442 0 15 Sep 2012
Thompson Sampling for Contextual Bandits with Linear Payoffs Shipra Agrawal Navin Goyal 195 1,006 0 15 Sep 2012