ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1309.6869
  4. Cited By
Finite-Time Analysis of Kernelised Contextual Bandits

Finite-Time Analysis of Kernelised Contextual Bandits

26 September 2013
Michal Valko
N. Korda
Rémi Munos
I. Flaounas
N. Cristianini
ArXivPDFHTML

Papers citing "Finite-Time Analysis of Kernelised Contextual Bandits"

50 / 66 papers shown
Title
Neural Logistic Bandits
Neural Logistic Bandits
Seoungbin Bae
Dabeen Lee
243
0
0
04 May 2025
Prompt Optimization with Logged Bandit Data
Prompt Optimization with Logged Bandit Data
Haruka Kiyohara
Daniel Yiming Cao
Yuta Saito
Thorsten Joachims
81
0
0
03 Apr 2025
Bandit Optimal Transport
Bandit Optimal Transport
Lorenzo Croissant
84
0
0
11 Feb 2025
Improved Regret Analysis in Gaussian Process Bandits: Optimality for Noiseless Reward, RKHS norm, and Non-Stationary Variance
S. Iwazaki
Shion Takeno
85
1
0
10 Feb 2025
Every Call is Precious: Global Optimization of Black-Box Functions with Unknown Lipschitz Constants
Every Call is Precious: Global Optimization of Black-Box Functions with Unknown Lipschitz Constants
Fares Fourati
Salma Kharrat
Vaneet Aggarwal
Mohamed-Slim Alouini
73
0
0
06 Feb 2025
Differentially Private Kernelized Contextual Bandits
Differentially Private Kernelized Contextual Bandits
Nikola Pavlovic
Sudeep Salgia
Qing Zhao
42
1
0
13 Jan 2025
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits
H. Bui
Enrique Mallada
Anqi Liu
219
0
0
08 Nov 2024
Lower Bounds for Time-Varying Kernelized Bandits
Lower Bounds for Time-Varying Kernelized Bandits
Xu Cai
Jonathan Scarlett
41
0
0
22 Oct 2024
An Online Learning Approach to Prompt-based Selection of Generative Models
An Online Learning Approach to Prompt-based Selection of Generative Models
Xiaoyan Hu
Ho-fung Leung
Farzan Farnia
45
2
0
17 Oct 2024
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning
Yen-Ru Lai
Fu-Chieh Chang
Pei-Yuan Wu
OffRL
84
1
0
22 Aug 2024
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback
Arun Verma
Zhongxiang Dai
Xiaoqiang Lin
Patrick Jaillet
K. H. Low
42
5
0
24 Jul 2024
Simultaneous System Identification and Model Predictive Control with No Dynamic Regret
Simultaneous System Identification and Model Predictive Control with No Dynamic Regret
Hongyu Zhou
Vasileios Tzoumas
101
4
0
04 Jul 2024
Graph Neural Thompson Sampling
Graph Neural Thompson Sampling
Shuang Wu
Arash A. Amini
56
0
0
15 Jun 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning
Subhojyoti Mukherjee
Josiah P. Hanna
Qiaomin Xie
Robert Nowak
89
2
0
07 Jun 2024
Learning Decision Policies with Instrumental Variables through Double
  Machine Learning
Learning Decision Policies with Instrumental Variables through Double Machine Learning
Daqian Shao
Ashkan Soleymani
Francesco Quinzan
Marta Z. Kwiatkowska
56
1
0
14 May 2024
Causally Abstracted Multi-armed Bandits
Causally Abstracted Multi-armed Bandits
Fabio Massimo Zennaro
Nicholas Bishop
Joel Dyer
Yorgos Felekis
Anisoara Calinescu
Michael Wooldridge
Theodoros Damoulas
38
2
0
26 Apr 2024
Stochastic Graph Bandit Learning with Side-Observations
Stochastic Graph Bandit Learning with Side-Observations
Xueping Gong
Jiheng Zhang
34
1
0
29 Aug 2023
Online Network Source Optimization with Graph-Kernel MAB
Online Network Source Optimization with Graph-Kernel MAB
Laura Toni
P. Frossard
34
1
0
07 Jul 2023
BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary
  Contextual Bandits
BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits
Nicklas Werge
Abdullah Akgul
M. Kandemir
42
0
0
07 Jul 2023
Neural Exploitation and Exploration of Contextual Bandits
Neural Exploitation and Exploration of Contextual Bandits
Yikun Ban
Yuchen Yan
A. Banerjee
Jingrui He
44
8
0
05 May 2023
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function
  Approximation
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function Approximation
Thanh Nguyen-Tang
R. Arora
OffRL
55
5
0
24 Feb 2023
Reward Learning as Doubly Nonparametric Bandits: Optimal Design and
  Scaling Laws
Reward Learning as Doubly Nonparametric Bandits: Optimal Design and Scaling Laws
Kush S. Bhatia
Wenshuo Guo
Jacob Steinhardt
27
0
0
23 Feb 2023
Sequential Counterfactual Risk Minimization
Sequential Counterfactual Risk Minimization
Houssam Zenati
Eustache Diemert
Matthieu Martin
Julien Mairal
Pierre Gaillard
OffRL
29
3
0
23 Feb 2023
Reward Imputation with Sketching for Contextual Batched Bandits
Reward Imputation with Sketching for Contextual Batched Bandits
Xiao Zhang
Ninglu Shao
Zihua Si
Jun Xu
Wen Wang
Hanjing Su
Jirong Wen
OffRL
25
1
0
13 Oct 2022
Exploration in Linear Bandits with Rich Action Sets and its Implications
  for Inference
Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference
Debangshu Banerjee
Avishek Ghosh
Sayak Ray Chowdhury
Aditya Gopalan
40
9
0
23 Jul 2022
Collaborative Learning in Kernel-based Bandits for Distributed Users
Collaborative Learning in Kernel-based Bandits for Distributed Users
Sudeep Salgia
Sattar Vakili
Qing Zhao
FedML
44
6
0
16 Jul 2022
Graph Neural Network Bandits
Graph Neural Network Bandits
Parnian Kassraie
Andreas Krause
Ilija Bogunovic
31
11
0
13 Jul 2022
Online SuBmodular + SuPermodular (BP) Maximization with Bandit Feedback
Online SuBmodular + SuPermodular (BP) Maximization with Bandit Feedback
Adhyyan Narang
Omid Sadeghi
Lillian J. Ratliff
Maryam Fazel
J. Bilmes
OffRL
18
1
0
07 Jul 2022
Computationally Efficient PAC RL in POMDPs with Latent Determinism and
  Conditional Embeddings
Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings
Masatoshi Uehara
Ayush Sekhari
Jason D. Lee
Nathan Kallus
Wen Sun
62
6
0
24 Jun 2022
Adjusted Expected Improvement for Cumulative Regret Minimization in
  Noisy Bayesian Optimization
Adjusted Expected Improvement for Cumulative Regret Minimization in Noisy Bayesian Optimization
Shouri Hu
Haowei Wang
Zhongxiang Dai
K. H. Low
Szu Hui Ng
33
4
0
10 May 2022
Provably Efficient Kernelized Q-Learning
Provably Efficient Kernelized Q-Learning
Shuang Liu
H. Su
MLT
34
4
0
21 Apr 2022
Instance-Dependent Regret Analysis of Kernelized Bandits
Instance-Dependent Regret Analysis of Kernelized Bandits
S. Shekhar
T. Javidi
29
3
0
12 Mar 2022
Reinforcement Learning in Modern Biostatistics: Constructing Optimal
  Adaptive Interventions
Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive Interventions
Nina Deliu
Joseph Jay Williams
B. Chakraborty
OffRL
35
5
0
04 Mar 2022
Fast online inference for nonlinear contextual bandit based on
  Generative Adversarial Network
Fast online inference for nonlinear contextual bandit based on Generative Adversarial Network
Yun-Da Tsai
Shou-De Lin
51
5
0
17 Feb 2022
Efficient Kernel UCB for Contextual Bandits
Efficient Kernel UCB for Contextual Bandits
Houssam Zenati
A. Bietti
Eustache Diemert
Julien Mairal
Matthieu Martin
Pierre Gaillard
29
3
0
11 Feb 2022
Improved Convergence Rates for Sparse Approximation Methods in
  Kernel-Based Learning
Improved Convergence Rates for Sparse Approximation Methods in Kernel-Based Learning
Sattar Vakili
Jonathan Scarlett
Da-Shan Shiu
A. Bernacchia
38
18
0
08 Feb 2022
Neural Collaborative Filtering Bandits via Meta Learning
Neural Collaborative Filtering Bandits via Meta Learning
Yikun Ban
Yunzhe Qi
Tianxin Wei
Jingrui He
OffRL
39
9
0
31 Jan 2022
Offline Neural Contextual Bandits: Pessimism, Optimization and
  Generalization
Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization
Thanh Nguyen-Tang
Sunil R. Gupta
A. Nguyen
Svetha Venkatesh
OffRL
36
29
0
27 Nov 2021
Misspecified Gaussian Process Bandit Optimization
Misspecified Gaussian Process Bandit Optimization
Ilija Bogunovic
Andreas Krause
57
43
0
09 Nov 2021
Empirical analysis of representation learning and exploration in neural
  kernel bandits
Empirical analysis of representation learning and exploration in neural kernel bandits
Michal Lisicki
Arash Afkanpour
Graham W. Taylor
26
0
0
05 Nov 2021
Nearly Optimal Algorithms for Level Set Estimation
Nearly Optimal Algorithms for Level Set Estimation
Blake Mason
Romain Camilleri
Subhojyoti Mukherjee
Kevin G. Jamieson
Robert D. Nowak
Lalit P. Jain
32
22
0
02 Nov 2021
Collaborative Pure Exploration in Kernel Bandit
Collaborative Pure Exploration in Kernel Bandit
Yihan Du
Wei Chen
Yuko Kuroki
Longbo Huang
45
10
0
29 Oct 2021
EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits
EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits
Yikun Ban
Yuchen Yan
A. Banerjee
Jingrui He
OffRL
39
39
0
07 Oct 2021
Uniform Generalization Bounds for Overparameterized Neural Networks
Uniform Generalization Bounds for Overparameterized Neural Networks
Sattar Vakili
Michael Bromberg
Jezabel R. Garcia
Da-Shan Shiu
A. Bernacchia
35
19
0
13 Sep 2021
Optimal Order Simple Regret for Gaussian Process Bandits
Optimal Order Simple Regret for Gaussian Process Bandits
Sattar Vakili
N. Bouziani
Sepehr Jalali
A. Bernacchia
Da-Shan Shiu
39
51
0
20 Aug 2021
Pure Exploration in Kernel and Neural Bandits
Pure Exploration in Kernel and Neural Bandits
Yinglun Zhu
Dongruo Zhou
Ruoxi Jiang
Quanquan Gu
Rebecca Willett
Robert D. Nowak
13
15
0
22 Jun 2021
Ada-BKB: Scalable Gaussian Process Optimization on Continuous Domains by
  Adaptive Discretization
Ada-BKB: Scalable Gaussian Process Optimization on Continuous Domains by Adaptive Discretization
Marco Rando
Luigi Carratino
S. Villa
Lorenzo Rosasco
44
5
0
16 Jun 2021
Neural Active Learning with Performance Guarantees
Neural Active Learning with Performance Guarantees
Pranjal Awasthi
Christoph Dann
Claudio Gentile
Ayush Sekhari
Zhilei Wang
32
22
0
06 Jun 2021
Combining Pessimism with Optimism for Robust and Efficient Model-Based
  Deep Reinforcement Learning
Combining Pessimism with Optimism for Robust and Efficient Model-Based Deep Reinforcement Learning
Sebastian Curi
Ilija Bogunovic
Andreas Krause
41
17
0
18 Mar 2021
On Function Approximation in Reinforcement Learning: Optimism in the
  Face of Large State Spaces
On Function Approximation in Reinforcement Learning: Optimism in the Face of Large State Spaces
Zhuoran Yang
Chi Jin
Zhaoran Wang
Mengdi Wang
Michael I. Jordan
44
18
0
09 Nov 2020
12
Next