ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1209.2388
  4. Cited By
On the Complexity of Bandit and Derivative-Free Stochastic Convex
  Optimization

On the Complexity of Bandit and Derivative-Free Stochastic Convex Optimization

11 September 2012
Ohad Shamir
ArXivPDFHTML

Papers citing "On the Complexity of Bandit and Derivative-Free Stochastic Convex Optimization"

41 / 41 papers shown
Title
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure
Aleksandrs Slivkins
Yunzong Xu
Shiliang Zuo
86
1
0
06 Mar 2025
Gradient-free stochastic optimization for additive models
Gradient-free stochastic optimization for additive models
A. Akhavan
Alexandre B. Tsybakov
41
0
0
03 Mar 2025
Batched Stochastic Bandit for Nondegenerate Functions
Batched Stochastic Bandit for Nondegenerate Functions
Yu Liu
Yunlu Shu
Tianyu Wang
52
0
0
09 May 2024
Private Fine-tuning of Large Language Models with Zeroth-order Optimization
Private Fine-tuning of Large Language Models with Zeroth-order Optimization
Xinyu Tang
Ashwinee Panda
Milad Nasr
Saeed Mahloujifar
Prateek Mittal
50
18
0
09 Jan 2024
Optimal Rates for Bandit Nonstochastic Control
Optimal Rates for Bandit Nonstochastic Control
Y. Jennifer Sun
Stephen Newman
Elad Hazan
37
7
0
24 May 2023
Faster Gradient-Free Algorithms for Nonsmooth Nonconvex Stochastic
  Optimization
Faster Gradient-Free Algorithms for Nonsmooth Nonconvex Stochastic Optimization
Le‐Yu Chen
Jing Xu
Luo Luo
36
15
0
16 Jan 2023
Invariant Lipschitz Bandits: A Side Observation Approach
Invariant Lipschitz Bandits: A Side Observation Approach
Nam-Phuong Tran
Long Tran-Thanh
51
1
0
14 Dec 2022
Zero-Order One-Point Estimate with Distributed Stochastic
  Gradient-Tracking Technique
Zero-Order One-Point Estimate with Distributed Stochastic Gradient-Tracking Technique
Elissa Mhanna
Mohamad Assaad
38
4
0
11 Oct 2022
Towards a Theoretical Foundation of Policy Optimization for Learning
  Control Policies
Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies
Bin Hu
Kaipeng Zhang
Na Li
M. Mesbahi
Maryam Fazel
Tamer Bacsar
89
27
0
10 Oct 2022
A Note on Zeroth-Order Optimization on the Simplex
A Note on Zeroth-Order Optimization on the Simplex
Tijana Zrnic
Eric Mazumdar
36
0
0
02 Aug 2022
A gradient estimator via L1-randomization for online zero-order
  optimization with two point feedback
A gradient estimator via L1-randomization for online zero-order optimization with two point feedback
A. Akhavan
Evgenii Chzhen
Massimiliano Pontil
Alexandre B. Tsybakov
32
19
0
27 May 2022
On the Rate of Convergence of Payoff-based Algorithms to Nash
  Equilibrium in Strongly Monotone Games
On the Rate of Convergence of Payoff-based Algorithms to Nash Equilibrium in Strongly Monotone Games
T. Tatarenko
Maryam Kamgarpour
22
8
0
22 Feb 2022
From the Greene--Wu Convolution to Gradient Estimation over Riemannian
  Manifolds
From the Greene--Wu Convolution to Gradient Estimation over Riemannian Manifolds
Tianyu Wang
Yifeng Huang
Didong Li
17
8
0
17 Aug 2021
Preference learning along multiple criteria: A game-theoretic
  perspective
Preference learning along multiple criteria: A game-theoretic perspective
Kush S. Bhatia
A. Pananjady
Peter L. Bartlett
Anca Dragan
Martin J. Wainwright
40
13
0
05 May 2021
Learning-NUM: Network Utility Maximization with Unknown Utility
  Functions and Queueing Delay
Learning-NUM: Network Utility Maximization with Unknown Utility Functions and Queueing Delay
Xinzhe Fu
E. Modiano
21
18
0
16 Dec 2020
Quantum Algorithm for Online Convex Optimization
Quantum Algorithm for Online Convex Optimization
Jianhao He
Feidiao Yang
Jialin Zhang
Lvzhou Li
43
4
0
29 Jul 2020
Bandit Linear Control
Bandit Linear Control
Asaf B. Cassel
Tomer Koren
10
17
0
01 Jul 2020
A New One-Point Residual-Feedback Oracle For Black-Box Learning and
  Control
A New One-Point Residual-Feedback Oracle For Black-Box Learning and Control
Yan Zhang
Yi Zhou
Kaiyi Ji
Michael M. Zavlanos
23
40
0
18 Jun 2020
Zeroth-Order Regularized Optimization (ZORO): Approximately Sparse
  Gradients and Adaptive Sampling
Zeroth-Order Regularized Optimization (ZORO): Approximately Sparse Gradients and Adaptive Sampling
HanQin Cai
Daniel McKenzie
W. Yin
Zhenliang Zhang
63
49
0
29 Mar 2020
Naive Exploration is Optimal for Online LQR
Naive Exploration is Optimal for Online LQR
Max Simchowitz
Dylan J. Foster
21
182
0
27 Jan 2020
Distributed Reinforcement Learning for Decentralized Linear Quadratic
  Control: A Derivative-Free Policy Optimization Approach
Distributed Reinforcement Learning for Decentralized Linear Quadratic Control: A Derivative-Free Policy Optimization Approach
Yingying Li
Yujie Tang
Runyu Zhang
Na Li
24
101
0
19 Dec 2019
Bandit Convex Optimization in Non-stationary Environments
Bandit Convex Optimization in Non-stationary Environments
Peng Zhao
G. Wang
Lijun Zhang
Zhi-Hua Zhou
36
41
0
29 Jul 2019
Memory-Sample Tradeoffs for Linear Regression with Small Error
Memory-Sample Tradeoffs for Linear Regression with Small Error
Vatsal Sharan
Aaron Sidford
Gregory Valiant
23
35
0
18 Apr 2019
Batched Multi-armed Bandits Problem
Batched Multi-armed Bandits Problem
Zijun Gao
Yanjun Han
Zhimei Ren
Zhengqing Zhou
16
138
0
03 Apr 2019
Contrasting Exploration in Parameter and Action Space: A Zeroth-Order
  Optimization Perspective
Contrasting Exploration in Parameter and Action Space: A Zeroth-Order Optimization Perspective
Anirudh Vemula
Wen Sun
J. Andrew Bagnell
21
40
0
31 Jan 2019
Bandit learning in concave $N$-person games
Bandit learning in concave NNN-person games
Mario Bravo
David S. Leslie
P. Mertikopoulos
16
121
0
03 Oct 2018
Zeroth-order Nonconvex Stochastic Optimization: Handling Constraints,
  High-Dimensionality and Saddle-Points
Zeroth-order Nonconvex Stochastic Optimization: Handling Constraints, High-Dimensionality and Saddle-Points
Krishnakumar Balasubramanian
Saeed Ghadimi
ODL
24
100
0
17 Sep 2018
On the Local Minima of the Empirical Risk
On the Local Minima of the Empirical Risk
Chi Jin
Lydia T. Liu
Rong Ge
Michael I. Jordan
FedML
21
56
0
25 Mar 2018
Regret Analysis for Continuous Dueling Bandit
Regret Analysis for Continuous Dueling Bandit
Wataru Kumagai
34
27
0
21 Nov 2017
Stochastic Zeroth-order Optimization in High Dimensions
Stochastic Zeroth-order Optimization in High Dimensions
Yining Wang
S. Du
Sivaraman Balakrishnan
Aarti Singh
29
105
0
29 Oct 2017
Recursive Exponential Weighting for Online Non-convex Optimization
Recursive Exponential Weighting for Online Non-convex Optimization
Ling Yang
Cheng Tan
W. Wong
9
2
0
13 Sep 2017
Fast Rates for Bandit Optimization with Upper-Confidence Frank-Wolfe
Fast Rates for Bandit Optimization with Upper-Confidence Frank-Wolfe
Quentin Berthet
Vianney Perchet
36
31
0
22 Feb 2017
Highly-Smooth Zero-th Order Online Optimization Vianney Perchet
Highly-Smooth Zero-th Order Online Optimization Vianney Perchet
Francis R. Bach
Vianney Perchet
22
83
0
26 May 2016
Trust Region Policy Optimization
Trust Region Policy Optimization
John Schulman
Sergey Levine
Philipp Moritz
Michael I. Jordan
Pieter Abbeel
121
6,690
0
19 Feb 2015
Escaping the Local Minima via Simulated Annealing: Optimization of
  Approximately Convex Functions
Escaping the Local Minima via Simulated Annealing: Optimization of Approximately Convex Functions
A. Belloni
Tengyuan Liang
Hariharan Narayanan
Alexander Rakhlin
32
77
0
28 Jan 2015
On the Complexity of Bandit Linear Optimization
On the Complexity of Bandit Linear Optimization
Ohad Shamir
45
14
0
11 Aug 2014
Exploiting Smoothness in Statistical Learning, Sequential Prediction,
  and Stochastic Optimization
Exploiting Smoothness in Statistical Learning, Sequential Prediction, and Stochastic Optimization
M. Mahdavi
63
4
0
19 Jul 2014
Unimodal Bandits without Smoothness
Unimodal Bandits without Smoothness
Richard Combes
Alexandre Proutiere
35
18
0
28 Jun 2014
Noisy Optimization: Convergence with a Fixed Number of Resamplings
Noisy Optimization: Convergence with a Fixed Number of Resamplings
Marie-Liesse Cauwet
25
3
0
09 Apr 2014
On Zeroth-Order Stochastic Convex Optimization via Random Walks
On Zeroth-Order Stochastic Convex Optimization via Random Walks
Tengyuan Liang
Hariharan Narayanan
Alexander Rakhlin
33
24
0
11 Feb 2014
Online Learning with Switching Costs and Other Adaptive Adversaries
Online Learning with Switching Costs and Other Adaptive Adversaries
Nicolò Cesa-Bianchi
O. Dekel
Ohad Shamir
23
119
0
18 Feb 2013
1