Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1309.2388
Cited By
v1
v2 (latest)
Minimizing Finite Sums with the Stochastic Average Gradient
10 September 2013
Mark Schmidt
Nicolas Le Roux
Francis R. Bach
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Minimizing Finite Sums with the Stochastic Average Gradient"
50 / 506 papers shown
Title
Team QCRI-MIT at SemEval-2019 Task 4: Propaganda Analysis Meets Hyperpartisan News Detection
Abdelrhman Saleh
R. Baly
Alberto Barrón-Cedeño
Giovanni Da San Martino
Mitra Mohtarami
Preslav Nakov
James R. Glass
82
18
0
06 Apr 2019
Convergence rates for optimised adaptive importance samplers
Ömer Deniz Akyildiz
Joaquín Míguez
133
31
0
28 Mar 2019
Block stochastic gradient descent for large-scale tomographic reconstruction in a parallel network
Yushan Gao
A. Biguri
T. Blumensath
60
3
0
28 Mar 2019
Cocoercivity, Smoothness and Bias in Variance-Reduced Stochastic Gradient Methods
Martin Morin
Pontus Giselsson
55
2
0
21 Mar 2019
Recovery Bounds on Class-Based Optimal Transport: A Sum-of-Norms Regularization Framework
Arman Rahbar
Ashkan Panahi
M. Chehreghani
Devdatt Dubhashi
Hamid Krim
135
0
0
09 Mar 2019
SGD without Replacement: Sharper Rates for General Smooth Convex Functions
Prateek Jain
Dheeraj M. Nagaraj
Praneeth Netrapalli
87
87
0
04 Mar 2019
Stochastic Conditional Gradient++
Hamed Hassani
Amin Karbasi
Aryan Mokhtari
Zebang Shen
70
23
0
19 Feb 2019
ProxSARAH: An Efficient Algorithmic Framework for Stochastic Composite Nonconvex Optimization
Nhan H. Pham
Lam M. Nguyen
Dzung Phan
Quoc Tran-Dinh
80
141
0
15 Feb 2019
Do Subsampled Newton Methods Work for High-Dimensional Data?
Xiang Li
Shusen Wang
Zhihua Zhang
68
13
0
13 Feb 2019
Efficient Primal-Dual Algorithms for Large-Scale Multiclass Classification
Dmitry Babichev
Dmitrii Ostrovskii
Francis R. Bach
VLM
51
3
0
11 Feb 2019
A Smoother Way to Train Structured Prediction Models
Krishna Pillutla
Vincent Roulet
Sham Kakade
Zaïd Harchaoui
77
20
0
08 Feb 2019
Momentum Schemes with Stochastic Variance Reduction for Nonconvex Composite Optimization
Yi Zhou
Zhe Wang
Kaiyi Ji
Yingbin Liang
Vahid Tarokh
ODL
82
14
0
07 Feb 2019
Stochastic first-order methods: non-asymptotic and computer-aided analyses via potential functions
Adrien B. Taylor
Francis R. Bach
79
64
0
03 Feb 2019
Stochastic Gradient Descent for Nonconvex Learning without Bounded Gradient Assumptions
Yunwen Lei
Ting Hu
Guiying Li
K. Tang
MLT
93
119
0
03 Feb 2019
Sharp Analysis for Nonconvex SGD Escaping from Saddle Points
Cong Fang
Zhouchen Lin
Tong Zhang
85
104
0
01 Feb 2019
Optimal mini-batch and step sizes for SAGA
Nidham Gazagnadou
Robert Mansel Gower
Joseph Salmon
90
35
0
31 Jan 2019
Quasi-Newton Methods for Machine Learning: Forget the Past, Just Sample
A. Berahas
Majid Jahani
Peter Richtárik
Martin Takávc
102
41
0
28 Jan 2019
Asynchronous Accelerated Proximal Stochastic Gradient for Strongly Convex Distributed Finite Sums
Hadrien Hendrikx
Francis R. Bach
Laurent Massoulié
FedML
67
26
0
28 Jan 2019
99% of Distributed Optimization is a Waste of Time: The Issue and How to Fix it
Konstantin Mishchenko
Filip Hanzely
Peter Richtárik
59
13
0
27 Jan 2019
Estimate Sequences for Stochastic Composite Optimization: Variance Reduction, Acceleration, and Robustness to Noise
A. Kulunchakov
Julien Mairal
88
45
0
25 Jan 2019
Don't Jump Through Hoops and Remove Those Loops: SVRG and Katyusha are Better Without the Outer Loop
D. Kovalev
Samuel Horváth
Peter Richtárik
122
156
0
24 Jan 2019
SAGA with Arbitrary Sampling
Xun Qian
Zheng Qu
Peter Richtárik
83
26
0
24 Jan 2019
Trajectory Normalized Gradients for Distributed Optimization
Jianqiao Wangni
Ke Li
Jianbo Shi
Jitendra Malik
44
2
0
24 Jan 2019
Finite-Sum Smooth Optimization with SARAH
Lam M. Nguyen
Marten van Dijk
Dzung Phan
Phuong Ha Nguyen
Tsui-Wei Weng
Jayant Kalagnanam
69
23
0
22 Jan 2019
DTN: A Learning Rate Scheme with Convergence Rate of
O
(
1
/
t
)
\mathcal{O}(1/t)
O
(
1/
t
)
for SGD
Lam M. Nguyen
Phuong Ha Nguyen
Dzung Phan
Jayant Kalagnanam
Marten van Dijk
41
0
0
22 Jan 2019
Quantized Epoch-SGD for Communication-Efficient Distributed Learning
Shen-Yi Zhao
Hao Gao
Wu-Jun Li
FedML
53
3
0
10 Jan 2019
The Lingering of Gradients: Theory and Applications
Zeyuan Allen-Zhu
D. Simchi-Levi
Xinshang Wang
102
4
0
09 Jan 2019
SGD Converges to Global Minimum in Deep Learning via Star-convex Path
Yi Zhou
Junjie Yang
Huishuai Zhang
Yingbin Liang
Vahid Tarokh
77
74
0
02 Jan 2019
A continuous-time analysis of distributed stochastic gradient
Nicholas M. Boffi
Jean-Jacques E. Slotine
46
15
0
28 Dec 2018
Stochastic Trust Region Inexact Newton Method for Large-scale Machine Learning
Vinod Kumar Chauhan
A. Sharma
Kalpana Dahiya
21
6
0
26 Dec 2018
Tight Analyses for Non-Smooth Stochastic Gradient Descent
Nicholas J. A. Harvey
Christopher Liaw
Y. Plan
Sikander Randhawa
79
138
0
13 Dec 2018
On the Ineffectiveness of Variance Reduced Optimization for Deep Learning
Aaron Defazio
Léon Bottou
UQCV
DRL
93
113
0
11 Dec 2018
Inexact SARAH Algorithm for Stochastic Optimization
Lam M. Nguyen
K. Scheinberg
Martin Takáč
88
51
0
25 Nov 2018
Asynchronous Stochastic Composition Optimization with Variance Reduction
Shuheng Shen
Linli Xu
Jingchang Liu
Junliang Guo
Qing Ling
64
2
0
15 Nov 2018
R-SPIDER: A Fast Riemannian Stochastic Optimization Algorithm with Curvature Independent Rate
J.N. Zhang
Hongyi Zhang
S. Sra
76
39
0
10 Nov 2018
Machine Learning Methods for Track Classification in the AT-TPC
M. Kuchera
R. Ramanujan
Jack Z. Taylor
R. Strauss
D. Bazin
J. Bradt
Ruiming Chen
47
33
0
21 Oct 2018
Multi-Agent Fully Decentralized Value Function Learning with Linear Convergence Rates
Lucas Cassano
Kun Yuan
Ali H. Sayed
81
40
0
17 Oct 2018
Fast and Faster Convergence of SGD for Over-Parameterized Models and an Accelerated Perceptron
Sharan Vaswani
Francis R. Bach
Mark Schmidt
116
301
0
16 Oct 2018
Quasi-hyperbolic momentum and Adam for deep learning
Jerry Ma
Denis Yarats
ODL
159
130
0
16 Oct 2018
Real time expert system for anomaly detection of aerators based on computer vision technology and existing surveillance cameras
Yeqi Liu
Yingyi Chen
Huihui Yu
X. Fang
Chuanyang Gong
36
2
0
09 Oct 2018
Characterization of Convex Objective Functions and Optimal Expected Convergence Rates for SGD
Marten van Dijk
Lam M. Nguyen
Phuong Ha Nguyen
Dzung Phan
86
6
0
09 Oct 2018
ASVRG: Accelerated Proximal SVRG
Fanhua Shang
L. Jiao
Kaiwen Zhou
James Cheng
Yan Ren
Yufei Jin
ODL
96
31
0
07 Oct 2018
A fast quasi-Newton-type method for large-scale stochastic optimisation
A. Wills
Carl Jidling
Thomas B. Schon
ODL
57
7
0
29 Sep 2018
Sparsified SGD with Memory
Sebastian U. Stich
Jean-Baptiste Cordonnier
Martin Jaggi
106
753
0
20 Sep 2018
Quantum Algorithms for Structured Prediction
Behrooz Sepehry
E. Iranmanesh
M. Friedlander
Pooya Ronagh
32
2
0
11 Sep 2018
Compositional Stochastic Average Gradient for Machine Learning and Related Applications
Tsung-Yu Hsieh
Y. El-Manzalawy
Yiwei Sun
Vasant Honavar
44
1
0
04 Sep 2018
Ensemble Kalman Inversion: A Derivative-Free Technique For Machine Learning Tasks
Nikola B. Kovachki
Andrew M. Stuart
BDL
107
138
0
10 Aug 2018
Fast Variance Reduction Method with Stochastic Batch Size
Xuanqing Liu
Cho-Jui Hsieh
91
5
0
07 Aug 2018
Efficient Training on Very Large Corpora via Gramian Estimation
Walid Krichene
Nicolas Mayoraz
Steffen Rendle
Li Zhang
Xinyang Yi
Lichan Hong
Ed H. Chi
John R. Anderson
65
48
0
18 Jul 2018
On the Acceleration of L-BFGS with Second-Order Information and Stochastic Batches
Jie Liu
Yu Rong
Martin Takáč
Junzhou Huang
ODL
68
7
0
14 Jul 2018
Previous
1
2
3
...
5
6
7
...
9
10
11
Next