Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1906.08947
Cited By
v1
v2
v3 (latest)
Randomized Exploration in Generalized Linear Bandits
21 June 2019
Branislav Kveton
Manzil Zaheer
Csaba Szepesvári
Lihong Li
Mohammad Ghavamzadeh
Craig Boutilier
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Randomized Exploration in Generalized Linear Bandits"
22 / 22 papers shown
Title
Neural Logistic Bandits
Seoungbin Bae
Dabeen Lee
506
0
0
04 May 2025
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
H. Bui
Enrique Mallada
Anqi Liu
485
1
0
08 Nov 2024
A Unified Confidence Sequence for Generalized Linear Models, with Applications to Bandits
Junghyun Lee
Se-Young Yun
Kwang-Sung Jun
141
6
0
19 Jul 2024
Random Latent Exploration for Deep Reinforcement Learning
Srinath Mahankali
Zhang-Wei Hong
Ayush Sekhari
Alexander Rakhlin
Pulkit Agrawal
230
3
0
18 Jul 2024
Prior-Dependent Allocations for Bayesian Fixed-Budget Best-Arm Identification in Structured Bandits
Nicolas Nguyen
Imad Aouali
András Gyorgy
Claire Vernade
77
2
0
08 Feb 2024
Truncated LinUCB for Stochastic Linear Bandits
Yanglei Song
Meng zhou
203
0
0
23 Feb 2022
On the Performance of Thompson Sampling on Logistic Bandits
Shi Dong
Tengyu Ma
Benjamin Van Roy
49
39
0
12 May 2019
Perturbed-History Exploration in Stochastic Linear Bandits
Branislav Kveton
Csaba Szepesvári
Mohammad Ghavamzadeh
Craig Boutilier
36
43
0
21 Mar 2019
Perturbed-History Exploration in Stochastic Multi-Armed Bandits
Branislav Kveton
Csaba Szepesvári
Mohammad Ghavamzadeh
Craig Boutilier
44
31
0
26 Feb 2019
Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits
Branislav Kveton
Csaba Szepesvári
Sharan Vaswani
Zheng Wen
Mohammad Ghavamzadeh
Tor Lattimore
142
70
0
13 Nov 2018
Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling
C. Riquelme
George Tucker
Jasper Snoek
BDL
76
366
0
26 Feb 2018
Customized Nonlinear Bandits for Online Response Selection in Neural Conversation Models
Bing-Quan Liu
Tong Yu
Ian Lane
Ole J. Mengshoel
OffRL
47
29
0
22 Nov 2017
BBQ-Networks: Efficient Exploration in Deep Reinforcement Learning for Task-Oriented Dialogue Systems
Zachary Chase Lipton
Xiujun Li
Jianfeng Gao
Lihong Li
Faisal Ahmed
Li Deng
68
172
0
15 Nov 2017
Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms
Han Xiao
Kashif Rasul
Roland Vollgraf
283
8,904
0
25 Aug 2017
Scalable Generalized Linear Bandits: Online Computation and Hashing
Kwang-Sung Jun
Aniruddha Bhargava
Robert D. Nowak
Rebecca Willett
74
126
0
01 Jun 2017
Ensemble Sampling
Xiuyuan Lu
Benjamin Van Roy
129
121
0
20 May 2017
Online Stochastic Linear Optimization under One-bit Feedback
Lijun Zhang
Tianbao Yang
Rong Jin
Zhi Zhou
52
66
0
25 Sep 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
2.0K
150,260
0
22 Dec 2014
Generalization and Exploration via Randomized Value Functions
Ian Osband
Benjamin Van Roy
Zheng Wen
91
314
0
04 Feb 2014
Thompson Sampling for Complex Bandit Problems
Aditya Gopalan
Shie Mannor
Yishay Mansour
152
203
0
03 Nov 2013
Further Optimal Regret Bounds for Thompson Sampling
Shipra Agrawal
Navin Goyal
107
442
0
15 Sep 2012
Thompson Sampling for Contextual Bandits with Linear Payoffs
Shipra Agrawal
Navin Goyal
195
1,006
0
15 Sep 2012
1