Kernel-based methods for bandit convex optimization

11 July 2016

Papers citing "Kernel-based methods for bandit convex optimization"

29 / 29 papers shown

Title
Online Episodic Convex Reinforcement Learning B. Moreno Khaled Eldowa Pierre Gaillard Margaux Brégère Nadia Oudjane OffRL 29 0 0 12 May 2025
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure Aleksandrs Slivkins Yunzong Xu Shiliang Zuo 72 1 0 06 Mar 2025
A Regularized Online Newton Method for Stochastic Convex Bandits with Linear Vanishing Noise Jingxin Zhan Yuchen Xin Kaicheng Jin Zhihua Zhang 29 0 0 19 Jan 2025
Single Point-Based Distributed Zeroth-Order Optimization with a Non-Convex Stochastic Objective Function Elissa Mhanna Mohamad Assaad 51 4 0 08 Oct 2024
Online Newton Method for Bandit Convex Optimisation Hidde Fokkema Dirk van der Hoeven Tor Lattimore Jack J. Mayo 30 5 0 10 Jun 2024
Anytime Model Selection in Linear Bandits Parnian Kassraie N. Emmenegger Andreas Krause Aldo Pacchiano 46 2 0 24 Jul 2023
Fast Submodular Function Maximization Lianke Qin Zhao-quan Song Yitan Wang 21 10 0 15 May 2023
Bandit Convex Optimisation Revisited: FTRL Achieves $\tilde{O}(t^{1/2})$ Regret David Young D. Leith Georgios Iosifidis 13 0 0 01 Feb 2023
Tight Guarantees for Interactive Decision Making with the Decision-Estimation Coefficient Dylan J. Foster Noah Golowich Yanjun Han OffRL 28 29 0 19 Jan 2023
Zero-Order One-Point Estimate with Distributed Stochastic Gradient-Tracking Technique Elissa Mhanna Mohamad Assaad 19 4 0 11 Oct 2022
A Unifying Framework for Online Optimization with Long-Term Constraints Matteo Castiglioni A. Celli A. Marchesi Giulia Romano N. Gatti 20 34 0 15 Sep 2022
Learning in Stackelberg Games with Non-myopic Agents Nika Haghtalab Thodoris Lykouris Sloan Nietert Alexander Wei 15 29 0 19 Aug 2022
A Near-Optimal Algorithm for Univariate Zeroth-Order Budget Convex Optimization F. Bachoc Tommaso Cesari Roberto Colomboni Andrea Paudice 24 2 0 13 Aug 2022
A Note on Zeroth-Order Optimization on the Simplex Tijana Zrnic Eric Mazumdar 26 0 0 02 Aug 2022
Lazy Queries Can Reduce Variance in Zeroth-order Optimization Quan-Wu Xiao Qing Ling Tianyi Chen 41 0 0 14 Jun 2022
Building Robust Ensembles via Margin Boosting Dinghuai Zhang Hongyang R. Zhang Aaron Courville Yoshua Bengio Pradeep Ravikumar A. Suggala AAML UQCV 45 15 0 07 Jun 2022
Uncoupled Bandit Learning towards Rationalizability: Benchmarks, Barriers, and Algorithms Jibang Wu Haifeng Xu Fan Yao 22 1 0 10 Nov 2021
Who Leads and Who Follows in Strategic Classification? Tijana Zrnic Eric Mazumdar S. Shankar Sastry Michael I. Jordan 26 50 0 23 Jun 2021
Bandit Linear Optimization for Sequential Decision Making and Extensive-Form Games Gabriele Farina Robin Schmucker T. Sandholm 23 21 0 08 Mar 2021
Quantum Algorithm for Online Convex Optimization Jianhao He Feidiao Yang Jialin Zhang Lvzhou Li 25 4 0 29 Jul 2020
Zeroth-Order Algorithms for Nonconvex Minimax Problems with Improved Complexities Zhongruo Wang Krishnakumar Balasubramanian Shiqian Ma Meisam Razaviyayn 13 25 0 22 Jan 2020
Distributed Online Optimization with Long-Term Constraints Deming Yuan Alexandre Proutière Guodong Shi 18 57 0 20 Dec 2019
Learning Strategy-Aware Linear Classifiers Yiling Chen Yang Liu Chara Podimata 13 9 0 10 Nov 2019
Minimax Regret of Switching-Constrained Online Convex Optimization: No Phase Transition Lin Chen Qian-long Yu Hannah Lawrence Amin Karbasi 24 20 0 24 Oct 2019
No-Regret Learning in Unknown Games with Correlated Payoffs Pier Giuseppe Sessa Ilija Bogunovic Maryam Kamgarpour Andreas Krause OffRL 32 39 0 18 Sep 2019
$Non-stationary Stochastic Optimization under $L_{p,q}$-Variation Measures$ Non-stationary Stochastic Optimization under $L_{p,q}$ -Variation Measures Xi Chen Yining Wang Yu-Xiang Wang 14 12 0 09 Aug 2017
Bandits with Movement Costs and Adaptive Pricing Tomer Koren Roi Livni Yishay Mansour 19 20 0 24 Feb 2017
Fast Rates for Bandit Optimization with Upper-Confidence Frank-Wolfe Quentin Berthet Vianney Perchet 33 31 0 22 Feb 2017
Corralling a Band of Bandit Algorithms Alekh Agarwal Haipeng Luo Behnam Neyshabur Robert Schapire 27 154 0 19 Dec 2016