Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1511.05950
Cited By
Staleness-aware Async-SGD for Distributed Deep Learning
18 November 2015
Wei Zhang
Suyog Gupta
Xiangru Lian
Ji Liu
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Staleness-aware Async-SGD for Distributed Deep Learning"
13 / 13 papers shown
Title
ZenFlow: Enabling Stall-Free Offloading Training via Asynchronous Updates
Tingfeng Lan
Yusen Wu
Bin Ma
Zhaoyuan Su
Rui Yang
Tekin Bicer
Dong Li
Yue Cheng
43
0
0
18 May 2025
No Need to Talk: Asynchronous Mixture of Language Models
Anastasiia Filippova
Angelos Katharopoulos
David Grangier
Ronan Collobert
MoE
49
0
0
04 Oct 2024
Ordered Momentum for Asynchronous SGD
Chang-Wei Shi
Yi-Rui Yang
Wu-Jun Li
ODL
101
0
0
27 Jul 2024
Distributed Stochastic Gradient Descent with Staleness: A Stochastic Delay Differential Equation Based Framework
Siyuan Yu
Wei Chen
H. V. Poor
59
0
0
17 Jun 2024
Revolutionizing Wireless Networks with Federated Learning: A Comprehensive Review
Sajjad Emdadi Mahdimahalleh
AI4CE
48
0
0
01 Aug 2023
Model Accuracy and Runtime Tradeoff in Distributed Deep Learning:A Systematic Study
Suyog Gupta
Wei Zhang
Fei Wang
30
171
0
14 Sep 2015
Asynchronous Distributed Semi-Stochastic Gradient Optimization
Ruiliang Zhang
Shuai Zheng
James T. Kwok
ODL
26
22
0
07 Aug 2015
Asynchronous Parallel Stochastic Gradient for Nonconvex Optimization
Xiangru Lian
Yijun Huang
Y. Li
Ji Liu
111
499
0
27 Jun 2015
Deep Image: Scaling up Image Recognition
Ren Wu
Shengen Yan
Yi Shan
Qingqing Dang
Gang Sun
VLM
43
373
0
13 Jan 2015
Example Selection For Dictionary Learning
Tomoki Tsuchida
G. Cottrell
26
47
0
18 Dec 2014
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky
Jia Deng
Hao Su
J. Krause
S. Satheesh
...
A. Karpathy
A. Khosla
Michael S. Bernstein
Alexander C. Berg
Li Fei-Fei
VLM
ObjD
544
39,383
0
01 Sep 2014
Caffe: Convolutional Architecture for Fast Feature Embedding
Yangqing Jia
Evan Shelhamer
Jeff Donahue
Sergey Karayev
Jonathan Long
Ross B. Girshick
S. Guadarrama
Trevor Darrell
VLM
BDL
3DV
116
14,700
0
20 Jun 2014
Distributed Delayed Stochastic Optimization
Alekh Agarwal
John C. Duchi
74
626
0
28 Apr 2011
1