Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1309.6869
Cited By
Finite-Time Analysis of Kernelised Contextual Bandits
26 September 2013
Michal Valko
N. Korda
Rémi Munos
I. Flaounas
N. Cristianini
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Finite-Time Analysis of Kernelised Contextual Bandits"
50 / 66 papers shown
Title
Neural Logistic Bandits
Seoungbin Bae
Dabeen Lee
243
0
0
04 May 2025
Prompt Optimization with Logged Bandit Data
Haruka Kiyohara
Daniel Yiming Cao
Yuta Saito
Thorsten Joachims
81
0
0
03 Apr 2025
Bandit Optimal Transport
Lorenzo Croissant
84
0
0
11 Feb 2025
Improved Regret Analysis in Gaussian Process Bandits: Optimality for Noiseless Reward, RKHS norm, and Non-Stationary Variance
S. Iwazaki
Shion Takeno
85
1
0
10 Feb 2025
Every Call is Precious: Global Optimization of Black-Box Functions with Unknown Lipschitz Constants
Fares Fourati
Salma Kharrat
Vaneet Aggarwal
Mohamed-Slim Alouini
73
0
0
06 Feb 2025
Differentially Private Kernelized Contextual Bandits
Nikola Pavlovic
Sudeep Salgia
Qing Zhao
42
1
0
13 Jan 2025
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
H. Bui
Enrique Mallada
Anqi Liu
219
0
0
08 Nov 2024
Lower Bounds for Time-Varying Kernelized Bandits
Xu Cai
Jonathan Scarlett
41
0
0
22 Oct 2024
An Online Learning Approach to Prompt-based Selection of Generative Models
Xiaoyan Hu
Ho-fung Leung
Farzan Farnia
45
2
0
17 Oct 2024
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Yen-Ru Lai
Fu-Chieh Chang
Pei-Yuan Wu
OffRL
84
1
0
22 Aug 2024
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback
Arun Verma
Zhongxiang Dai
Xiaoqiang Lin
Patrick Jaillet
K. H. Low
42
5
0
24 Jul 2024
Simultaneous System Identification and Model Predictive Control with No Dynamic Regret
Hongyu Zhou
Vasileios Tzoumas
101
4
0
04 Jul 2024
Graph Neural Thompson Sampling
Shuang Wu
Arash A. Amini
56
0
0
15 Jun 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert Nowak
89
2
0
07 Jun 2024
Learning Decision Policies with Instrumental Variables through Double Machine Learning
Daqian Shao
Ashkan Soleymani
Francesco Quinzan
Marta Z. Kwiatkowska
56
1
0
14 May 2024
Causally Abstracted Multi-armed Bandits
Fabio Massimo Zennaro
Nicholas Bishop
Joel Dyer
Yorgos Felekis
Anisoara Calinescu
Michael Wooldridge
Theodoros Damoulas
38
2
0
26 Apr 2024
Stochastic Graph Bandit Learning with Side-Observations
Xueping Gong
Jiheng Zhang
34
1
0
29 Aug 2023
Online Network Source Optimization with Graph-Kernel MAB
Laura Toni
P. Frossard
34
1
0
07 Jul 2023
BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits
Nicklas Werge
Abdullah Akgul
M. Kandemir
42
0
0
07 Jul 2023
Neural Exploitation and Exploration of Contextual Bandits
Yikun Ban
Yuchen Yan
A. Banerjee
Jingrui He
44
8
0
05 May 2023
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function Approximation
Thanh Nguyen-Tang
R. Arora
OffRL
55
5
0
24 Feb 2023
Reward Learning as Doubly Nonparametric Bandits: Optimal Design and Scaling Laws
Kush S. Bhatia
Wenshuo Guo
Jacob Steinhardt
27
0
0
23 Feb 2023
Sequential Counterfactual Risk Minimization
Houssam Zenati
Eustache Diemert
Matthieu Martin
Julien Mairal
Pierre Gaillard
OffRL
29
3
0
23 Feb 2023
Reward Imputation with Sketching for Contextual Batched Bandits
Xiao Zhang
Ninglu Shao
Zihua Si
Jun Xu
Wen Wang
Hanjing Su
Jirong Wen
OffRL
25
1
0
13 Oct 2022
Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference
Debangshu Banerjee
Avishek Ghosh
Sayak Ray Chowdhury
Aditya Gopalan
40
9
0
23 Jul 2022
Collaborative Learning in Kernel-based Bandits for Distributed Users
Sudeep Salgia
Sattar Vakili
Qing Zhao
FedML
44
6
0
16 Jul 2022
Graph Neural Network Bandits
Parnian Kassraie
Andreas Krause
Ilija Bogunovic
31
11
0
13 Jul 2022
Online SuBmodular + SuPermodular (BP) Maximization with Bandit Feedback
Adhyyan Narang
Omid Sadeghi
Lillian J. Ratliff
Maryam Fazel
J. Bilmes
OffRL
18
1
0
07 Jul 2022
Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings
Masatoshi Uehara
Ayush Sekhari
Jason D. Lee
Nathan Kallus
Wen Sun
62
6
0
24 Jun 2022
Adjusted Expected Improvement for Cumulative Regret Minimization in Noisy Bayesian Optimization
Shouri Hu
Haowei Wang
Zhongxiang Dai
K. H. Low
Szu Hui Ng
33
4
0
10 May 2022
Provably Efficient Kernelized Q-Learning
Shuang Liu
H. Su
MLT
34
4
0
21 Apr 2022
Instance-Dependent Regret Analysis of Kernelized Bandits
S. Shekhar
T. Javidi
29
3
0
12 Mar 2022
Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive Interventions
Nina Deliu
Joseph Jay Williams
B. Chakraborty
OffRL
35
5
0
04 Mar 2022
Fast online inference for nonlinear contextual bandit based on Generative Adversarial Network
Yun-Da Tsai
Shou-De Lin
51
5
0
17 Feb 2022
Efficient Kernel UCB for Contextual Bandits
Houssam Zenati
A. Bietti
Eustache Diemert
Julien Mairal
Matthieu Martin
Pierre Gaillard
29
3
0
11 Feb 2022
Improved Convergence Rates for Sparse Approximation Methods in Kernel-Based Learning
Sattar Vakili
Jonathan Scarlett
Da-Shan Shiu
A. Bernacchia
38
18
0
08 Feb 2022
Neural Collaborative Filtering Bandits via Meta Learning
Yikun Ban
Yunzhe Qi
Tianxin Wei
Jingrui He
OffRL
39
9
0
31 Jan 2022
Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization
Thanh Nguyen-Tang
Sunil R. Gupta
A. Nguyen
Svetha Venkatesh
OffRL
36
29
0
27 Nov 2021
Misspecified Gaussian Process Bandit Optimization
Ilija Bogunovic
Andreas Krause
57
43
0
09 Nov 2021
Empirical analysis of representation learning and exploration in neural kernel bandits
Michal Lisicki
Arash Afkanpour
Graham W. Taylor
26
0
0
05 Nov 2021
Nearly Optimal Algorithms for Level Set Estimation
Blake Mason
Romain Camilleri
Subhojyoti Mukherjee
Kevin G. Jamieson
Robert D. Nowak
Lalit P. Jain
32
22
0
02 Nov 2021
Collaborative Pure Exploration in Kernel Bandit
Yihan Du
Wei Chen
Yuko Kuroki
Longbo Huang
45
10
0
29 Oct 2021
EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits
Yikun Ban
Yuchen Yan
A. Banerjee
Jingrui He
OffRL
39
39
0
07 Oct 2021
Uniform Generalization Bounds for Overparameterized Neural Networks
Sattar Vakili
Michael Bromberg
Jezabel R. Garcia
Da-Shan Shiu
A. Bernacchia
35
19
0
13 Sep 2021
Optimal Order Simple Regret for Gaussian Process Bandits
Sattar Vakili
N. Bouziani
Sepehr Jalali
A. Bernacchia
Da-Shan Shiu
39
51
0
20 Aug 2021
Pure Exploration in Kernel and Neural Bandits
Yinglun Zhu
Dongruo Zhou
Ruoxi Jiang
Quanquan Gu
Rebecca Willett
Robert D. Nowak
13
15
0
22 Jun 2021
Ada-BKB: Scalable Gaussian Process Optimization on Continuous Domains by Adaptive Discretization
Marco Rando
Luigi Carratino
S. Villa
Lorenzo Rosasco
44
5
0
16 Jun 2021
Neural Active Learning with Performance Guarantees
Pranjal Awasthi
Christoph Dann
Claudio Gentile
Ayush Sekhari
Zhilei Wang
32
22
0
06 Jun 2021
Combining Pessimism with Optimism for Robust and Efficient Model-Based Deep Reinforcement Learning
Sebastian Curi
Ilija Bogunovic
Andreas Krause
41
17
0
18 Mar 2021
On Function Approximation in Reinforcement Learning: Optimism in the Face of Large State Spaces
Zhuoran Yang
Chi Jin
Zhaoran Wang
Mengdi Wang
Michael I. Jordan
44
18
0
09 Nov 2020
1
2
Next