Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1805.07891
Cited By
Parameter Hub: a Rack-Scale Parameter Server for Distributed Deep Neural Network Training
21 May 2018
Liang Luo
Jacob Nelson
Luis Ceze
Amar Phanishayee
Arvind Krishnamurthy
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Parameter Hub: a Rack-Scale Parameter Server for Distributed Deep Neural Network Training"
21 / 21 papers shown
Title
TBD: Benchmarking and Analyzing Deep Neural Network Training
Hongyu Zhu
Mohamed Akrout
Bojian Zheng
Andrew Pelegris
Amar Phanishayee
Bianca Schroeder
Gennady Pekhimenko
48
80
0
16 Mar 2018
Horovod: fast and easy distributed deep learning in TensorFlow
Alexander Sergeev
Mike Del Balso
59
1,218
0
15 Feb 2018
DeepConfig: Automating Data Center Network Topologies Management with Machine Learning
Christopher Streiffer
Huan Chen
Theophilus A. Benson
Asim Kadav
14
64
0
11 Dec 2017
Deep Gradient Compression: Reducing the Communication Bandwidth for Distributed Training
Chengyue Wu
Song Han
Huizi Mao
Yu Wang
W. Dally
102
1,399
0
05 Dec 2017
Poseidon: An Efficient Communication Architecture for Distributed Deep Learning on GPU Clusters
Huatian Zhang
Zeyu Zheng
Shizhen Xu
Wei-Ming Dai
Qirong Ho
Xiaodan Liang
Zhiting Hu
Jinliang Wei
P. Xie
Eric Xing
GNN
50
343
0
11 Jun 2017
Accurate, Large Minibatch SGD: Training ImageNet in 1 Hour
Priya Goyal
Piotr Dollár
Ross B. Girshick
P. Noordhuis
Lukasz Wesolowski
Aapo Kyrola
Andrew Tulloch
Yangqing Jia
Kaiming He
3DH
91
3,666
0
08 Jun 2017
Aggregated Residual Transformations for Deep Neural Networks
Saining Xie
Ross B. Girshick
Piotr Dollár
Zhuowen Tu
Kaiming He
430
10,281
0
16 Nov 2016
On Large-Batch Training for Deep Learning: Generalization Gap and Sharp Minima
N. Keskar
Dheevatsa Mudigere
J. Nocedal
M. Smelyanskiy
P. T. P. Tang
ODL
355
2,922
0
15 Sep 2016
Benchmarking State-of-the-Art Deep Learning Software Tools
Shaoshuai Shi
Qiang-qiang Wang
Pengfei Xu
Xiaowen Chu
BDL
39
329
0
25 Aug 2016
TensorFlow: A system for large-scale machine learning
Martín Abadi
P. Barham
Jianmin Chen
Zhiwen Chen
Andy Davis
...
Vijay Vasudevan
Pete Warden
Martin Wicke
Yuan Yu
Xiaoqiang Zhang
GNN
AI4CE
336
18,300
0
27 May 2016
Training Deep Nets with Sublinear Memory Cost
Tianqi Chen
Bing Xu
Chiyuan Zhang
Carlos Guestrin
84
1,156
0
21 Apr 2016
Revisiting Distributed Synchronous SGD
Jianmin Chen
Xinghao Pan
R. Monga
Samy Bengio
Rafal Jozefowicz
64
799
0
04 Apr 2016
Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning
Christian Szegedy
Sergey Ioffe
Vincent Vanhoucke
Alexander A. Alemi
294
14,196
0
23 Feb 2016
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
1.4K
192,638
0
10 Dec 2015
MXNet: A Flexible and Efficient Machine Learning Library for Heterogeneous Distributed Systems
Tianqi Chen
Mu Li
Yutian Li
Min Lin
Naiyan Wang
Minjie Wang
Tianjun Xiao
Bing Xu
Chiyuan Zhang
Zheng Zhang
117
2,243
0
03 Dec 2015
Rethinking the Inception Architecture for Computer Vision
Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jonathon Shlens
Z. Wojna
3DV
BDL
478
27,231
0
02 Dec 2015
FireCaffe: near-linear acceleration of deep neural network training on compute clusters
F. Iandola
Khalid Ashraf
Matthew W. Moskewicz
Kurt Keutzer
50
302
0
31 Oct 2015
High-Performance Distributed ML at Scale through Parameter Server Consistency Models
Wei-Ming Dai
Abhimanu Kumar
Jinliang Wei
Qirong Ho
Garth A. Gibson
Eric Xing
37
120
0
29 Oct 2014
Distributed Machine Learning via Sufficient Factor Broadcasting
P. Xie
Jin Kyu Kim
Yi Zhou
Qirong Ho
Abhimanu Kumar
Yaoliang Yu
Eric Xing
21
1
0
19 Sep 2014
DimmWitted: A Study of Main-Memory Statistical Analytics
Ce Zhang
Christopher Ré
105
145
0
28 Mar 2014
HOGWILD!: A Lock-Free Approach to Parallelizing Stochastic Gradient Descent
Feng Niu
Benjamin Recht
Christopher Ré
Stephen J. Wright
137
2,272
0
28 Jun 2011
1