Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1607.03084
Cited By
Kernel-based methods for bandit convex optimization
11 July 2016
Sébastien Bubeck
Ronen Eldan
Y. Lee
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Kernel-based methods for bandit convex optimization"
29 / 29 papers shown
Title
Online Episodic Convex Reinforcement Learning
B. Moreno
Khaled Eldowa
Pierre Gaillard
Margaux Brégère
Nadia Oudjane
OffRL
29
0
0
12 May 2025
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure
Aleksandrs Slivkins
Yunzong Xu
Shiliang Zuo
72
1
0
06 Mar 2025
A Regularized Online Newton Method for Stochastic Convex Bandits with Linear Vanishing Noise
Jingxin Zhan
Yuchen Xin
Kaicheng Jin
Zhihua Zhang
29
0
0
19 Jan 2025
Single Point-Based Distributed Zeroth-Order Optimization with a Non-Convex Stochastic Objective Function
Elissa Mhanna
Mohamad Assaad
51
4
0
08 Oct 2024
Online Newton Method for Bandit Convex Optimisation
Hidde Fokkema
Dirk van der Hoeven
Tor Lattimore
Jack J. Mayo
30
5
0
10 Jun 2024
Anytime Model Selection in Linear Bandits
Parnian Kassraie
N. Emmenegger
Andreas Krause
Aldo Pacchiano
46
2
0
24 Jul 2023
Fast Submodular Function Maximization
Lianke Qin
Zhao-quan Song
Yitan Wang
21
10
0
15 May 2023
Bandit Convex Optimisation Revisited: FTRL Achieves
O
~
(
t
1
/
2
)
\tilde{O}(t^{1/2})
O
~
(
t
1/2
)
Regret
David Young
D. Leith
Georgios Iosifidis
13
0
0
01 Feb 2023
Tight Guarantees for Interactive Decision Making with the Decision-Estimation Coefficient
Dylan J. Foster
Noah Golowich
Yanjun Han
OffRL
28
29
0
19 Jan 2023
Zero-Order One-Point Estimate with Distributed Stochastic Gradient-Tracking Technique
Elissa Mhanna
Mohamad Assaad
19
4
0
11 Oct 2022
A Unifying Framework for Online Optimization with Long-Term Constraints
Matteo Castiglioni
A. Celli
A. Marchesi
Giulia Romano
N. Gatti
20
34
0
15 Sep 2022
Learning in Stackelberg Games with Non-myopic Agents
Nika Haghtalab
Thodoris Lykouris
Sloan Nietert
Alexander Wei
15
29
0
19 Aug 2022
A Near-Optimal Algorithm for Univariate Zeroth-Order Budget Convex Optimization
F. Bachoc
Tommaso Cesari
Roberto Colomboni
Andrea Paudice
24
2
0
13 Aug 2022
A Note on Zeroth-Order Optimization on the Simplex
Tijana Zrnic
Eric Mazumdar
26
0
0
02 Aug 2022
Lazy Queries Can Reduce Variance in Zeroth-order Optimization
Quan-Wu Xiao
Qing Ling
Tianyi Chen
41
0
0
14 Jun 2022
Building Robust Ensembles via Margin Boosting
Dinghuai Zhang
Hongyang R. Zhang
Aaron Courville
Yoshua Bengio
Pradeep Ravikumar
A. Suggala
AAML
UQCV
45
15
0
07 Jun 2022
Uncoupled Bandit Learning towards Rationalizability: Benchmarks, Barriers, and Algorithms
Jibang Wu
Haifeng Xu
Fan Yao
22
1
0
10 Nov 2021
Who Leads and Who Follows in Strategic Classification?
Tijana Zrnic
Eric Mazumdar
S. Shankar Sastry
Michael I. Jordan
26
50
0
23 Jun 2021
Bandit Linear Optimization for Sequential Decision Making and Extensive-Form Games
Gabriele Farina
Robin Schmucker
T. Sandholm
23
21
0
08 Mar 2021
Quantum Algorithm for Online Convex Optimization
Jianhao He
Feidiao Yang
Jialin Zhang
Lvzhou Li
25
4
0
29 Jul 2020
Zeroth-Order Algorithms for Nonconvex Minimax Problems with Improved Complexities
Zhongruo Wang
Krishnakumar Balasubramanian
Shiqian Ma
Meisam Razaviyayn
13
25
0
22 Jan 2020
Distributed Online Optimization with Long-Term Constraints
Deming Yuan
Alexandre Proutière
Guodong Shi
18
57
0
20 Dec 2019
Learning Strategy-Aware Linear Classifiers
Yiling Chen
Yang Liu
Chara Podimata
13
9
0
10 Nov 2019
Minimax Regret of Switching-Constrained Online Convex Optimization: No Phase Transition
Lin Chen
Qian-long Yu
Hannah Lawrence
Amin Karbasi
24
20
0
24 Oct 2019
No-Regret Learning in Unknown Games with Correlated Payoffs
Pier Giuseppe Sessa
Ilija Bogunovic
Maryam Kamgarpour
Andreas Krause
OffRL
32
39
0
18 Sep 2019
Non-stationary Stochastic Optimization under
L
p
,
q
L_{p,q}
L
p
,
q
-Variation Measures
Xi Chen
Yining Wang
Yu-Xiang Wang
14
12
0
09 Aug 2017
Bandits with Movement Costs and Adaptive Pricing
Tomer Koren
Roi Livni
Yishay Mansour
19
20
0
24 Feb 2017
Fast Rates for Bandit Optimization with Upper-Confidence Frank-Wolfe
Quentin Berthet
Vianney Perchet
33
31
0
22 Feb 2017
Corralling a Band of Bandit Algorithms
Alekh Agarwal
Haipeng Luo
Behnam Neyshabur
Robert Schapire
27
154
0
19 Dec 2016
1