v1v2 (latest)

Minimizing Finite Sums with the Stochastic Average Gradient

10 September 2013

Mark Schmidt

Nicolas Le Roux

Francis R. Bach

ArXiv (abs)PDF HTML

Papers citing "Minimizing Finite Sums with the Stochastic Average Gradient"

50 / 506 papers shown

Title
Team QCRI-MIT at SemEval-2019 Task 4: Propaganda Analysis Meets Hyperpartisan News Detection Abdelrhman Saleh R. Baly Alberto Barrón-Cedeño Giovanni Da San Martino Mitra Mohtarami Preslav Nakov James R. Glass 82 18 0 06 Apr 2019
Convergence rates for optimised adaptive importance samplers Ömer Deniz Akyildiz Joaquín Míguez 133 31 0 28 Mar 2019
Block stochastic gradient descent for large-scale tomographic reconstruction in a parallel network Yushan Gao A. Biguri T. Blumensath 60 3 0 28 Mar 2019
Cocoercivity, Smoothness and Bias in Variance-Reduced Stochastic Gradient Methods Martin Morin Pontus Giselsson 55 2 0 21 Mar 2019
Recovery Bounds on Class-Based Optimal Transport: A Sum-of-Norms Regularization Framework Arman Rahbar Ashkan Panahi M. Chehreghani Devdatt Dubhashi Hamid Krim 135 0 0 09 Mar 2019
SGD without Replacement: Sharper Rates for General Smooth Convex Functions Prateek Jain Dheeraj M. Nagaraj Praneeth Netrapalli 87 87 0 04 Mar 2019
Stochastic Conditional Gradient++ Hamed Hassani Amin Karbasi Aryan Mokhtari Zebang Shen 70 23 0 19 Feb 2019
ProxSARAH: An Efficient Algorithmic Framework for Stochastic Composite Nonconvex Optimization Nhan H. Pham Lam M. Nguyen Dzung Phan Quoc Tran-Dinh 80 141 0 15 Feb 2019
Do Subsampled Newton Methods Work for High-Dimensional Data? Xiang Li Shusen Wang Zhihua Zhang 68 13 0 13 Feb 2019
Efficient Primal-Dual Algorithms for Large-Scale Multiclass Classification Dmitry Babichev Dmitrii Ostrovskii Francis R. Bach VLM 51 3 0 11 Feb 2019
A Smoother Way to Train Structured Prediction Models Krishna Pillutla Vincent Roulet Sham Kakade Zaïd Harchaoui 77 20 0 08 Feb 2019
Momentum Schemes with Stochastic Variance Reduction for Nonconvex Composite Optimization Yi Zhou Zhe Wang Kaiyi Ji Yingbin Liang Vahid Tarokh ODL 82 14 0 07 Feb 2019
Stochastic first-order methods: non-asymptotic and computer-aided analyses via potential functions Adrien B. Taylor Francis R. Bach 79 64 0 03 Feb 2019
Stochastic Gradient Descent for Nonconvex Learning without Bounded Gradient Assumptions Yunwen Lei Ting Hu Guiying Li K. Tang MLT 93 119 0 03 Feb 2019
Sharp Analysis for Nonconvex SGD Escaping from Saddle Points Cong Fang Zhouchen Lin Tong Zhang 85 104 0 01 Feb 2019
Optimal mini-batch and step sizes for SAGA Nidham Gazagnadou Robert Mansel Gower Joseph Salmon 90 35 0 31 Jan 2019
Quasi-Newton Methods for Machine Learning: Forget the Past, Just Sample A. Berahas Majid Jahani Peter Richtárik Martin Takávc 102 41 0 28 Jan 2019
Asynchronous Accelerated Proximal Stochastic Gradient for Strongly Convex Distributed Finite Sums Hadrien Hendrikx Francis R. Bach Laurent Massoulié FedML 67 26 0 28 Jan 2019
99% of Distributed Optimization is a Waste of Time: The Issue and How to Fix it Konstantin Mishchenko Filip Hanzely Peter Richtárik 59 13 0 27 Jan 2019
Estimate Sequences for Stochastic Composite Optimization: Variance Reduction, Acceleration, and Robustness to Noise A. Kulunchakov Julien Mairal 88 45 0 25 Jan 2019
Don't Jump Through Hoops and Remove Those Loops: SVRG and Katyusha are Better Without the Outer Loop D. Kovalev Samuel Horváth Peter Richtárik 122 156 0 24 Jan 2019
SAGA with Arbitrary Sampling Xun Qian Zheng Qu Peter Richtárik 83 26 0 24 Jan 2019
Trajectory Normalized Gradients for Distributed Optimization Jianqiao Wangni Ke Li Jianbo Shi Jitendra Malik 44 2 0 24 Jan 2019
Finite-Sum Smooth Optimization with SARAH Lam M. Nguyen Marten van Dijk Dzung Phan Phuong Ha Nguyen Tsui-Wei Weng Jayant Kalagnanam 69 23 0 22 Jan 2019
DTN: A Learning Rate Scheme with Convergence Rate of $\mathcal{O}(1/t)$ for SGD Lam M. Nguyen Phuong Ha Nguyen Dzung Phan Jayant Kalagnanam Marten van Dijk 41 0 0 22 Jan 2019
Quantized Epoch-SGD for Communication-Efficient Distributed Learning Shen-Yi Zhao Hao Gao Wu-Jun Li FedML 53 3 0 10 Jan 2019
The Lingering of Gradients: Theory and Applications Zeyuan Allen-Zhu D. Simchi-Levi Xinshang Wang 102 4 0 09 Jan 2019
SGD Converges to Global Minimum in Deep Learning via Star-convex Path Yi Zhou Junjie Yang Huishuai Zhang Yingbin Liang Vahid Tarokh 77 74 0 02 Jan 2019
A continuous-time analysis of distributed stochastic gradient Nicholas M. Boffi Jean-Jacques E. Slotine 46 15 0 28 Dec 2018
Stochastic Trust Region Inexact Newton Method for Large-scale Machine Learning Vinod Kumar Chauhan A. Sharma Kalpana Dahiya 21 6 0 26 Dec 2018
Tight Analyses for Non-Smooth Stochastic Gradient Descent Nicholas J. A. Harvey Christopher Liaw Y. Plan Sikander Randhawa 79 138 0 13 Dec 2018
On the Ineffectiveness of Variance Reduced Optimization for Deep Learning Aaron Defazio Léon Bottou UQCV DRL 93 113 0 11 Dec 2018
Inexact SARAH Algorithm for Stochastic Optimization Lam M. Nguyen K. Scheinberg Martin Takáč 88 51 0 25 Nov 2018
Asynchronous Stochastic Composition Optimization with Variance Reduction Shuheng Shen Linli Xu Jingchang Liu Junliang Guo Qing Ling 64 2 0 15 Nov 2018
R-SPIDER: A Fast Riemannian Stochastic Optimization Algorithm with Curvature Independent Rate J.N. Zhang Hongyi Zhang S. Sra 76 39 0 10 Nov 2018
Machine Learning Methods for Track Classification in the AT-TPC M. Kuchera R. Ramanujan Jack Z. Taylor R. Strauss D. Bazin J. Bradt Ruiming Chen 47 33 0 21 Oct 2018
Multi-Agent Fully Decentralized Value Function Learning with Linear Convergence Rates Lucas Cassano Kun Yuan Ali H. Sayed 81 40 0 17 Oct 2018
Fast and Faster Convergence of SGD for Over-Parameterized Models and an Accelerated Perceptron Sharan Vaswani Francis R. Bach Mark Schmidt 116 301 0 16 Oct 2018
Quasi-hyperbolic momentum and Adam for deep learning Jerry Ma Denis Yarats ODL 159 130 0 16 Oct 2018
Real time expert system for anomaly detection of aerators based on computer vision technology and existing surveillance cameras Yeqi Liu Yingyi Chen Huihui Yu X. Fang Chuanyang Gong 36 2 0 09 Oct 2018
Characterization of Convex Objective Functions and Optimal Expected Convergence Rates for SGD Marten van Dijk Lam M. Nguyen Phuong Ha Nguyen Dzung Phan 86 6 0 09 Oct 2018
ASVRG: Accelerated Proximal SVRG Fanhua Shang L. Jiao Kaiwen Zhou James Cheng Yan Ren Yufei Jin ODL 96 31 0 07 Oct 2018
A fast quasi-Newton-type method for large-scale stochastic optimisation A. Wills Carl Jidling Thomas B. Schon ODL 57 7 0 29 Sep 2018
Sparsified SGD with Memory Sebastian U. Stich Jean-Baptiste Cordonnier Martin Jaggi 106 753 0 20 Sep 2018
Quantum Algorithms for Structured Prediction Behrooz Sepehry E. Iranmanesh M. Friedlander Pooya Ronagh 32 2 0 11 Sep 2018
Compositional Stochastic Average Gradient for Machine Learning and Related Applications Tsung-Yu Hsieh Y. El-Manzalawy Yiwei Sun Vasant Honavar 44 1 0 04 Sep 2018
Ensemble Kalman Inversion: A Derivative-Free Technique For Machine Learning Tasks Nikola B. Kovachki Andrew M. Stuart BDL 107 138 0 10 Aug 2018
Fast Variance Reduction Method with Stochastic Batch Size Xuanqing Liu Cho-Jui Hsieh 91 5 0 07 Aug 2018
Efficient Training on Very Large Corpora via Gramian Estimation Walid Krichene Nicolas Mayoraz Steffen Rendle Li Zhang Xinyang Yi Lichan Hong Ed H. Chi John R. Anderson 65 48 0 18 Jul 2018
On the Acceleration of L-BFGS with Second-Order Information and Stochastic Batches Jie Liu Yu Rong Martin Takáč Junzhou Huang ODL 68 7 0 14 Jul 2018