On the Complexity of Bandit and Derivative-Free Stochastic Convex Optimization

11 September 2012

Papers citing "On the Complexity of Bandit and Derivative-Free Stochastic Convex Optimization"

41 / 41 papers shown

Title
Greedy Algorithm for Structured Bandits: A Sharp Characterization of Asymptotic Success / Failure Aleksandrs Slivkins Yunzong Xu Shiliang Zuo 86 1 0 06 Mar 2025
Gradient-free stochastic optimization for additive models A. Akhavan Alexandre B. Tsybakov 41 0 0 03 Mar 2025
Batched Stochastic Bandit for Nondegenerate Functions Yu Liu Yunlu Shu Tianyu Wang 52 0 0 09 May 2024
Private Fine-tuning of Large Language Models with Zeroth-order Optimization Xinyu Tang Ashwinee Panda Milad Nasr Saeed Mahloujifar Prateek Mittal 50 18 0 09 Jan 2024
Optimal Rates for Bandit Nonstochastic Control Y. Jennifer Sun Stephen Newman Elad Hazan 37 7 0 24 May 2023
Faster Gradient-Free Algorithms for Nonsmooth Nonconvex Stochastic Optimization Le‐Yu Chen Jing Xu Luo Luo 36 15 0 16 Jan 2023
Invariant Lipschitz Bandits: A Side Observation Approach Nam-Phuong Tran Long Tran-Thanh 51 1 0 14 Dec 2022
Zero-Order One-Point Estimate with Distributed Stochastic Gradient-Tracking Technique Elissa Mhanna Mohamad Assaad 38 4 0 11 Oct 2022
Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies Bin Hu Kaipeng Zhang Na Li M. Mesbahi Maryam Fazel Tamer Bacsar 87 27 0 10 Oct 2022
A Note on Zeroth-Order Optimization on the Simplex Tijana Zrnic Eric Mazumdar 36 0 0 02 Aug 2022
A gradient estimator via L1-randomization for online zero-order optimization with two point feedback A. Akhavan Evgenii Chzhen Massimiliano Pontil Alexandre B. Tsybakov 30 19 0 27 May 2022
On the Rate of Convergence of Payoff-based Algorithms to Nash Equilibrium in Strongly Monotone Games T. Tatarenko Maryam Kamgarpour 22 8 0 22 Feb 2022
From the Greene--Wu Convolution to Gradient Estimation over Riemannian Manifolds Tianyu Wang Yifeng Huang Didong Li 17 8 0 17 Aug 2021
Preference learning along multiple criteria: A game-theoretic perspective Kush S. Bhatia A. Pananjady Peter L. Bartlett Anca Dragan Martin J. Wainwright 38 13 0 05 May 2021
Learning-NUM: Network Utility Maximization with Unknown Utility Functions and Queueing Delay Xinzhe Fu E. Modiano 21 18 0 16 Dec 2020
Quantum Algorithm for Online Convex Optimization Jianhao He Feidiao Yang Jialin Zhang Lvzhou Li 43 4 0 29 Jul 2020
Bandit Linear Control Asaf B. Cassel Tomer Koren 8 17 0 01 Jul 2020
A New One-Point Residual-Feedback Oracle For Black-Box Learning and Control Yan Zhang Yi Zhou Kaiyi Ji Michael M. Zavlanos 23 40 0 18 Jun 2020
Zeroth-Order Regularized Optimization (ZORO): Approximately Sparse Gradients and Adaptive Sampling HanQin Cai Daniel McKenzie W. Yin Zhenliang Zhang 63 49 0 29 Mar 2020
Naive Exploration is Optimal for Online LQR Max Simchowitz Dylan J. Foster 13 182 0 27 Jan 2020
Distributed Reinforcement Learning for Decentralized Linear Quadratic Control: A Derivative-Free Policy Optimization Approach Yingying Li Yujie Tang Runyu Zhang Na Li 24 101 0 19 Dec 2019
Bandit Convex Optimization in Non-stationary Environments Peng Zhao G. Wang Lijun Zhang Zhi-Hua Zhou 36 41 0 29 Jul 2019
Memory-Sample Tradeoffs for Linear Regression with Small Error Vatsal Sharan Aaron Sidford Gregory Valiant 23 35 0 18 Apr 2019
Batched Multi-armed Bandits Problem Zijun Gao Yanjun Han Zhimei Ren Zhengqing Zhou 16 138 0 03 Apr 2019
Contrasting Exploration in Parameter and Action Space: A Zeroth-Order Optimization Perspective Anirudh Vemula Wen Sun J. Andrew Bagnell 19 40 0 31 Jan 2019
Bandit learning in concave $N$ -person games Mario Bravo David S. Leslie P. Mertikopoulos 16 121 0 03 Oct 2018
Zeroth-order Nonconvex Stochastic Optimization: Handling Constraints, High-Dimensionality and Saddle-Points Krishnakumar Balasubramanian Saeed Ghadimi ODL 22 100 0 17 Sep 2018
On the Local Minima of the Empirical Risk Chi Jin Lydia T. Liu Rong Ge Michael I. Jordan FedML 16 56 0 25 Mar 2018
Regret Analysis for Continuous Dueling Bandit Wataru Kumagai 34 27 0 21 Nov 2017
Stochastic Zeroth-order Optimization in High Dimensions Yining Wang S. Du Sivaraman Balakrishnan Aarti Singh 29 105 0 29 Oct 2017
Recursive Exponential Weighting for Online Non-convex Optimization Ling Yang Cheng Tan W. Wong 9 2 0 13 Sep 2017
Fast Rates for Bandit Optimization with Upper-Confidence Frank-Wolfe Quentin Berthet Vianney Perchet 36 31 0 22 Feb 2017
Highly-Smooth Zero-th Order Online Optimization Vianney Perchet Francis R. Bach Vianney Perchet 20 83 0 26 May 2016
Trust Region Policy Optimization John Schulman Sergey Levine Philipp Moritz Michael I. Jordan Pieter Abbeel 115 6,690 0 19 Feb 2015
Escaping the Local Minima via Simulated Annealing: Optimization of Approximately Convex Functions A. Belloni Tengyuan Liang Hariharan Narayanan Alexander Rakhlin 30 77 0 28 Jan 2015
On the Complexity of Bandit Linear Optimization Ohad Shamir 42 14 0 11 Aug 2014
Exploiting Smoothness in Statistical Learning, Sequential Prediction, and Stochastic Optimization M. Mahdavi 63 4 0 19 Jul 2014
Unimodal Bandits without Smoothness Richard Combes Alexandre Proutiere 33 18 0 28 Jun 2014
Noisy Optimization: Convergence with a Fixed Number of Resamplings Marie-Liesse Cauwet 25 3 0 09 Apr 2014
On Zeroth-Order Stochastic Convex Optimization via Random Walks Tengyuan Liang Hariharan Narayanan Alexander Rakhlin 31 24 0 11 Feb 2014
Online Learning with Switching Costs and Other Adaptive Adversaries Nicolò Cesa-Bianchi O. Dekel Ohad Shamir 21 119 0 18 Feb 2013