Finite-Time Analysis of Kernelised Contextual Bandits

26 September 2013

Papers citing "Finite-Time Analysis of Kernelised Contextual Bandits"

50 / 66 papers shown

Title
Neural Logistic Bandits Seoungbin Bae Dabeen Lee 243 0 0 04 May 2025
Prompt Optimization with Logged Bandit Data Haruka Kiyohara Daniel Yiming Cao Yuta Saito Thorsten Joachims 81 0 0 03 Apr 2025
Bandit Optimal Transport Lorenzo Croissant 84 0 0 11 Feb 2025
Improved Regret Analysis in Gaussian Process Bandits: Optimality for Noiseless Reward, RKHS norm, and Non-Stationary Variance S. Iwazaki Shion Takeno 85 1 0 10 Feb 2025
Every Call is Precious: Global Optimization of Black-Box Functions with Unknown Lipschitz Constants Fares Fourati Salma Kharrat Vaneet Aggarwal Mohamed-Slim Alouini 73 0 0 06 Feb 2025
Differentially Private Kernelized Contextual Bandits Nikola Pavlovic Sudeep Salgia Qing Zhao 42 1 0 13 Jan 2025
Variance-Aware Linear UCB with Deep Representation for Neural Contextual Bandits H. Bui Enrique Mallada Anqi Liu 219 0 0 08 Nov 2024
Lower Bounds for Time-Varying Kernelized Bandits Xu Cai Jonathan Scarlett 41 0 0 22 Oct 2024
An Online Learning Approach to Prompt-based Selection of Generative Models Xiaoyan Hu Ho-fung Leung Farzan Farnia 45 2 0 17 Oct 2024
Leveraging Unlabeled Data Sharing through Kernel Function Approximation in Offline Reinforcement Learning Yen-Ru Lai Fu-Chieh Chang Pei-Yuan Wu OffRL 84 1 0 22 Aug 2024
Neural Dueling Bandits: Preference-Based Optimization with Human Feedback Arun Verma Zhongxiang Dai Xiaoqiang Lin Patrick Jaillet K. H. Low 42 5 0 24 Jul 2024
Simultaneous System Identification and Model Predictive Control with No Dynamic Regret Hongyu Zhou Vasileios Tzoumas 101 4 0 04 Jul 2024
Graph Neural Thompson Sampling Shuang Wu Arash A. Amini 56 0 0 15 Jun 2024
Pretraining Decision Transformers with Reward Prediction for In-Context Multi-task Structured Bandit Learning Subhojyoti Mukherjee Josiah P. Hanna Qiaomin Xie Robert Nowak 89 2 0 07 Jun 2024
Learning Decision Policies with Instrumental Variables through Double Machine Learning Daqian Shao Ashkan Soleymani Francesco Quinzan Marta Z. Kwiatkowska 56 1 0 14 May 2024
Causally Abstracted Multi-armed Bandits Fabio Massimo Zennaro Nicholas Bishop Joel Dyer Yorgos Felekis Anisoara Calinescu Michael Wooldridge Theodoros Damoulas 38 2 0 26 Apr 2024
Stochastic Graph Bandit Learning with Side-Observations Xueping Gong Jiheng Zhang 34 1 0 29 Aug 2023
Online Network Source Optimization with Graph-Kernel MAB Laura Toni P. Frossard 34 1 0 07 Jul 2023
BOF-UCB: A Bayesian-Optimistic Frequentist Algorithm for Non-Stationary Contextual Bandits Nicklas Werge Abdullah Akgul M. Kandemir 42 0 0 07 Jul 2023
Neural Exploitation and Exploration of Contextual Bandits Yikun Ban Yuchen Yan A. Banerjee Jingrui He 44 8 0 05 May 2023
VIPeR: Provably Efficient Algorithm for Offline RL with Neural Function Approximation Thanh Nguyen-Tang R. Arora OffRL 55 5 0 24 Feb 2023
Reward Learning as Doubly Nonparametric Bandits: Optimal Design and Scaling Laws Kush S. Bhatia Wenshuo Guo Jacob Steinhardt 27 0 0 23 Feb 2023
Sequential Counterfactual Risk Minimization Houssam Zenati Eustache Diemert Matthieu Martin Julien Mairal Pierre Gaillard OffRL 29 3 0 23 Feb 2023
Reward Imputation with Sketching for Contextual Batched Bandits Xiao Zhang Ninglu Shao Zihua Si Jun Xu Wen Wang Hanjing Su Jirong Wen OffRL 25 1 0 13 Oct 2022
Exploration in Linear Bandits with Rich Action Sets and its Implications for Inference Debangshu Banerjee Avishek Ghosh Sayak Ray Chowdhury Aditya Gopalan 40 9 0 23 Jul 2022
Collaborative Learning in Kernel-based Bandits for Distributed Users Sudeep Salgia Sattar Vakili Qing Zhao FedML 44 6 0 16 Jul 2022
Graph Neural Network Bandits Parnian Kassraie Andreas Krause Ilija Bogunovic 31 11 0 13 Jul 2022
Online SuBmodular + SuPermodular (BP) Maximization with Bandit Feedback Adhyyan Narang Omid Sadeghi Lillian J. Ratliff Maryam Fazel J. Bilmes OffRL 18 1 0 07 Jul 2022
Computationally Efficient PAC RL in POMDPs with Latent Determinism and Conditional Embeddings Masatoshi Uehara Ayush Sekhari Jason D. Lee Nathan Kallus Wen Sun 62 6 0 24 Jun 2022
Adjusted Expected Improvement for Cumulative Regret Minimization in Noisy Bayesian Optimization Shouri Hu Haowei Wang Zhongxiang Dai K. H. Low Szu Hui Ng 33 4 0 10 May 2022
Provably Efficient Kernelized Q-Learning Shuang Liu H. Su MLT 34 4 0 21 Apr 2022
Instance-Dependent Regret Analysis of Kernelized Bandits S. Shekhar T. Javidi 29 3 0 12 Mar 2022
Reinforcement Learning in Modern Biostatistics: Constructing Optimal Adaptive Interventions Nina Deliu Joseph Jay Williams B. Chakraborty OffRL 35 5 0 04 Mar 2022
Fast online inference for nonlinear contextual bandit based on Generative Adversarial Network Yun-Da Tsai Shou-De Lin 51 5 0 17 Feb 2022
Efficient Kernel UCB for Contextual Bandits Houssam Zenati A. Bietti Eustache Diemert Julien Mairal Matthieu Martin Pierre Gaillard 29 3 0 11 Feb 2022
Improved Convergence Rates for Sparse Approximation Methods in Kernel-Based Learning Sattar Vakili Jonathan Scarlett Da-Shan Shiu A. Bernacchia 38 18 0 08 Feb 2022
Neural Collaborative Filtering Bandits via Meta Learning Yikun Ban Yunzhe Qi Tianxin Wei Jingrui He OffRL 39 9 0 31 Jan 2022
Offline Neural Contextual Bandits: Pessimism, Optimization and Generalization Thanh Nguyen-Tang Sunil R. Gupta A. Nguyen Svetha Venkatesh OffRL 36 29 0 27 Nov 2021
Misspecified Gaussian Process Bandit Optimization Ilija Bogunovic Andreas Krause 57 43 0 09 Nov 2021
Empirical analysis of representation learning and exploration in neural kernel bandits Michal Lisicki Arash Afkanpour Graham W. Taylor 26 0 0 05 Nov 2021
Nearly Optimal Algorithms for Level Set Estimation Blake Mason Romain Camilleri Subhojyoti Mukherjee Kevin G. Jamieson Robert D. Nowak Lalit P. Jain 32 22 0 02 Nov 2021
Collaborative Pure Exploration in Kernel Bandit Yihan Du Wei Chen Yuko Kuroki Longbo Huang 45 10 0 29 Oct 2021
EE-Net: Exploitation-Exploration Neural Networks in Contextual Bandits Yikun Ban Yuchen Yan A. Banerjee Jingrui He OffRL 39 39 0 07 Oct 2021
Uniform Generalization Bounds for Overparameterized Neural Networks Sattar Vakili Michael Bromberg Jezabel R. Garcia Da-Shan Shiu A. Bernacchia 35 19 0 13 Sep 2021
Optimal Order Simple Regret for Gaussian Process Bandits Sattar Vakili N. Bouziani Sepehr Jalali A. Bernacchia Da-Shan Shiu 39 51 0 20 Aug 2021
Pure Exploration in Kernel and Neural Bandits Yinglun Zhu Dongruo Zhou Ruoxi Jiang Quanquan Gu Rebecca Willett Robert D. Nowak 13 15 0 22 Jun 2021
Ada-BKB: Scalable Gaussian Process Optimization on Continuous Domains by Adaptive Discretization Marco Rando Luigi Carratino S. Villa Lorenzo Rosasco 44 5 0 16 Jun 2021
Neural Active Learning with Performance Guarantees Pranjal Awasthi Christoph Dann Claudio Gentile Ayush Sekhari Zhilei Wang 32 22 0 06 Jun 2021
Combining Pessimism with Optimism for Robust and Efficient Model-Based Deep Reinforcement Learning Sebastian Curi Ilija Bogunovic Andreas Krause 41 17 0 18 Mar 2021
On Function Approximation in Reinforcement Learning: Optimism in the Face of Large State Spaces Zhuoran Yang Chi Jin Zhaoran Wang Mengdi Wang Michael I. Jordan 44 18 0 09 Nov 2020