Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1209.2388
Cited By
On the Complexity of Bandit and Derivative-Free Stochastic Convex Optimization
11 September 2012
Ohad Shamir
Re-assign community
ArXiv
PDF
HTML
Papers citing
"On the Complexity of Bandit and Derivative-Free Stochastic Convex Optimization"
41 / 41 papers shown
Title
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure
Aleksandrs Slivkins
Yunzong Xu
Shiliang Zuo
86
1
0
06 Mar 2025
Gradient-free stochastic optimization for additive models
A. Akhavan
Alexandre B. Tsybakov
41
0
0
03 Mar 2025
Batched Stochastic Bandit for Nondegenerate Functions
Yu Liu
Yunlu Shu
Tianyu Wang
52
0
0
09 May 2024
Private Fine-tuning of Large Language Models with Zeroth-order Optimization
Xinyu Tang
Ashwinee Panda
Milad Nasr
Saeed Mahloujifar
Prateek Mittal
50
18
0
09 Jan 2024
Optimal Rates for Bandit Nonstochastic Control
Y. Jennifer Sun
Stephen Newman
Elad Hazan
37
7
0
24 May 2023
Faster Gradient-Free Algorithms for Nonsmooth Nonconvex Stochastic Optimization
Le‐Yu Chen
Jing Xu
Luo Luo
36
15
0
16 Jan 2023
Invariant Lipschitz Bandits: A Side Observation Approach
Nam-Phuong Tran
Long Tran-Thanh
51
1
0
14 Dec 2022
Zero-Order One-Point Estimate with Distributed Stochastic Gradient-Tracking Technique
Elissa Mhanna
Mohamad Assaad
38
4
0
11 Oct 2022
Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies
Bin Hu
Kaipeng Zhang
Na Li
M. Mesbahi
Maryam Fazel
Tamer Bacsar
87
27
0
10 Oct 2022
A Note on Zeroth-Order Optimization on the Simplex
Tijana Zrnic
Eric Mazumdar
36
0
0
02 Aug 2022
A gradient estimator via L1-randomization for online zero-order optimization with two point feedback
A. Akhavan
Evgenii Chzhen
Massimiliano Pontil
Alexandre B. Tsybakov
30
19
0
27 May 2022
On the Rate of Convergence of Payoff-based Algorithms to Nash Equilibrium in Strongly Monotone Games
T. Tatarenko
Maryam Kamgarpour
22
8
0
22 Feb 2022
From the Greene--Wu Convolution to Gradient Estimation over Riemannian Manifolds
Tianyu Wang
Yifeng Huang
Didong Li
17
8
0
17 Aug 2021
Preference learning along multiple criteria: A game-theoretic perspective
Kush S. Bhatia
A. Pananjady
Peter L. Bartlett
Anca Dragan
Martin J. Wainwright
38
13
0
05 May 2021
Learning-NUM: Network Utility Maximization with Unknown Utility Functions and Queueing Delay
Xinzhe Fu
E. Modiano
21
18
0
16 Dec 2020
Quantum Algorithm for Online Convex Optimization
Jianhao He
Feidiao Yang
Jialin Zhang
Lvzhou Li
43
4
0
29 Jul 2020
Bandit Linear Control
Asaf B. Cassel
Tomer Koren
8
17
0
01 Jul 2020
A New One-Point Residual-Feedback Oracle For Black-Box Learning and Control
Yan Zhang
Yi Zhou
Kaiyi Ji
Michael M. Zavlanos
23
40
0
18 Jun 2020
Zeroth-Order Regularized Optimization (ZORO): Approximately Sparse Gradients and Adaptive Sampling
HanQin Cai
Daniel McKenzie
W. Yin
Zhenliang Zhang
63
49
0
29 Mar 2020
Naive Exploration is Optimal for Online LQR
Max Simchowitz
Dylan J. Foster
13
182
0
27 Jan 2020
Distributed Reinforcement Learning for Decentralized Linear Quadratic Control: A Derivative-Free Policy Optimization Approach
Yingying Li
Yujie Tang
Runyu Zhang
Na Li
24
101
0
19 Dec 2019
Bandit Convex Optimization in Non-stationary Environments
Peng Zhao
G. Wang
Lijun Zhang
Zhi-Hua Zhou
36
41
0
29 Jul 2019
Memory-Sample Tradeoffs for Linear Regression with Small Error
Vatsal Sharan
Aaron Sidford
Gregory Valiant
23
35
0
18 Apr 2019
Batched Multi-armed Bandits Problem
Zijun Gao
Yanjun Han
Zhimei Ren
Zhengqing Zhou
16
138
0
03 Apr 2019
Contrasting Exploration in Parameter and Action Space: A Zeroth-Order Optimization Perspective
Anirudh Vemula
Wen Sun
J. Andrew Bagnell
19
40
0
31 Jan 2019
Bandit learning in concave
N
N
N
-person games
Mario Bravo
David S. Leslie
P. Mertikopoulos
16
121
0
03 Oct 2018
Zeroth-order Nonconvex Stochastic Optimization: Handling Constraints, High-Dimensionality and Saddle-Points
Krishnakumar Balasubramanian
Saeed Ghadimi
ODL
22
100
0
17 Sep 2018
On the Local Minima of the Empirical Risk
Chi Jin
Lydia T. Liu
Rong Ge
Michael I. Jordan
FedML
16
56
0
25 Mar 2018
Regret Analysis for Continuous Dueling Bandit
Wataru Kumagai
34
27
0
21 Nov 2017
Stochastic Zeroth-order Optimization in High Dimensions
Yining Wang
S. Du
Sivaraman Balakrishnan
Aarti Singh
29
105
0
29 Oct 2017
Recursive Exponential Weighting for Online Non-convex Optimization
Ling Yang
Cheng Tan
W. Wong
9
2
0
13 Sep 2017
Fast Rates for Bandit Optimization with Upper-Confidence Frank-Wolfe
Quentin Berthet
Vianney Perchet
36
31
0
22 Feb 2017
Highly-Smooth Zero-th Order Online Optimization Vianney Perchet
Francis R. Bach
Vianney Perchet
20
83
0
26 May 2016
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
115
6,690
0
19 Feb 2015
Escaping the Local Minima via Simulated Annealing: Optimization of Approximately Convex Functions
A. Belloni
Tengyuan Liang
Hariharan Narayanan
Alexander Rakhlin
30
77
0
28 Jan 2015
On the Complexity of Bandit Linear Optimization
Ohad Shamir
42
14
0
11 Aug 2014
Exploiting Smoothness in Statistical Learning, Sequential Prediction, and Stochastic Optimization
M. Mahdavi
63
4
0
19 Jul 2014
Unimodal Bandits without Smoothness
Richard Combes
Alexandre Proutiere
33
18
0
28 Jun 2014
Noisy Optimization: Convergence with a Fixed Number of Resamplings
Marie-Liesse Cauwet
25
3
0
09 Apr 2014
On Zeroth-Order Stochastic Convex Optimization via Random Walks
Tengyuan Liang
Hariharan Narayanan
Alexander Rakhlin
31
24
0
11 Feb 2014
Online Learning with Switching Costs and Other Adaptive Adversaries
Nicolò Cesa-Bianchi
O. Dekel
Ohad Shamir
21
119
0
18 Feb 2013
1