Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1802.09941
Cited By
v1
v2 (latest)
Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis
26 February 2018
Tal Ben-Nun
Torsten Hoefler
GNN
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis"
50 / 154 papers shown
Title
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
210
8,882
0
04 Feb 2016
Poseidon: A System Architecture for Efficient GPU-based Deep Learning on Multiple Machines
Huatian Zhang
Zhiting Hu
Jinliang Wei
P. Xie
Gunhee Kim
Qirong Ho
Eric Xing
GNN
32
48
0
19 Dec 2015
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.3K
194,641
0
10 Dec 2015
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
...
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
146
2,976
0
08 Dec 2015
Multi-Scale Context Aggregation by Dilated Convolutions
Feng Yu
V. Koltun
SSeg
278
8,464
0
23 Nov 2015
Compression of Deep Convolutional Neural Networks for Fast and Low Power Mobile Applications
Yong-Deok Kim
Eunhyeok Park
S. Yoo
Taelim Choi
Lu Yang
Dongjun Shin
124
895
0
20 Nov 2015
Why M Heads are Better than One: Training a Diverse Ensemble of Deep Networks
Stefan Lee
Senthil Purushwalkam
Michael Cogswell
David J. Crandall
Dhruv Batra
FedML
UQCV
110
316
0
19 Nov 2015
SparkNet: Training Deep Networks in Spark
Philipp Moritz
Robert Nishihara
Ion Stoica
Michael I. Jordan
113
171
0
19 Nov 2015
Staleness-aware Async-SGD for Distributed Deep Learning
Wei Zhang
Suyog Gupta
Xiangru Lian
Ji Liu
88
266
0
18 Nov 2015
8-Bit Approximations for Parallelism in Deep Learning
Tim Dettmers
81
176
0
14 Nov 2015
BinaryConnect: Training Deep Neural Networks with binary weights during propagations
Matthieu Courbariaux
Yoshua Bengio
J. David
MQ
225
2,994
0
02 Nov 2015
FireCaffe: near-linear acceleration of deep neural network training on compute clusters
F. Iandola
Khalid Ashraf
Matthew W. Moskewicz
Kurt Keutzer
86
302
0
31 Oct 2015
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
Song Han
Huizi Mao
W. Dally
3DGS
263
8,864
0
01 Oct 2015
Fast Algorithms for Convolutional Neural Networks
Andrew Lavin
Scott Gray
74
879
0
30 Sep 2015
On the Expressive Power of Deep Learning: A Tensor Analysis
Nadav Cohen
Or Sharir
Amnon Shashua
83
472
0
16 Sep 2015
Model Accuracy and Runtime Tradeoff in Distributed Deep Learning:A Systematic Study
Suyog Gupta
Wei Zhang
Fei Wang
79
172
0
14 Sep 2015
A Linearly-Convergent Stochastic L-BFGS Algorithm
Philipp Moritz
Robert Nishihara
Michael I. Jordan
ODL
90
235
0
09 Aug 2015
Listen, Attend and Spell
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
RALM
165
2,269
0
05 Aug 2015
Asynchronous Parallel Stochastic Gradient for Nonconvex Optimization
Xiangru Lian
Yijun Huang
Y. Li
Ji Liu
147
499
0
27 Jun 2015
Taming the Wild: A Unified Analysis of Hogwild!-Style Algorithms
Christopher De Sa
Ce Zhang
K. Olukotun
Christopher Ré
97
204
0
22 Jun 2015
Deep SimNets
Nadav Cohen
Or Sharir
Amnon Shashua
85
46
0
09 Jun 2015
Instant Learning: Parallel Deep Neural Networks and Convolutional Bootstrapping
Andrew J. R. Simpson
29
4
0
22 May 2015
Asynchronous Parallel Stochastic Gradient Descent - A Numeric Core for Scalable Distributed Machine Learning Algorithms
J. Keuper
Franz-Josef Pfreundt
65
38
0
19 May 2015
Distilling the Knowledge in a Neural Network
Geoffrey E. Hinton
Oriol Vinyals
J. Dean
FedML
367
19,764
0
09 Mar 2015
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Sergey Ioffe
Christian Szegedy
OOD
471
43,357
0
11 Feb 2015
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
352
10,091
0
10 Feb 2015
Deep Learning with Limited Numerical Precision
Suyog Gupta
A. Agrawal
K. Gopalakrishnan
P. Narayanan
HAI
209
2,049
0
09 Feb 2015
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
VLM
355
18,661
0
06 Feb 2015
Fast Convolutional Nets With fbfft: A GPU Performance Evaluation
Nicolas Vasilache
Jeff Johnson
Michaël Mathieu
Soumith Chintala
Serkan Piantino
Yann LeCun
69
347
0
24 Dec 2014
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
2.1K
150,433
0
22 Dec 2014
Deep learning with Elastic Averaging SGD
Sixin Zhang
A. Choromańska
Yann LeCun
FedML
100
611
0
20 Dec 2014
Parallel training of DNNs with Natural Gradient and Parameter Averaging
Daniel Povey
Xiaohui Zhang
Sanjeev Khudanpur
FedML
101
251
0
27 Oct 2014
cuDNN: Efficient Primitives for Deep Learning
Sharan Chetlur
Cliff Woolley
Philippe Vandermersch
Jonathan M. Cohen
J. Tran
Bryan Catanzaro
Evan Shelhamer
144
1,849
0
03 Oct 2014
Going Deeper with Convolutions
Christian Szegedy
Wei Liu
Yangqing Jia
P. Sermanet
Scott E. Reed
Dragomir Anguelov
D. Erhan
Vincent Vanhoucke
Andrew Rabinovich
499
43,717
0
17 Sep 2014
Very Deep Convolutional Networks for Large-Scale Image Recognition
Karen Simonyan
Andrew Zisserman
FAtt
MDE
1.7K
100,575
0
04 Sep 2014
Caffe: Convolutional Architecture for Fast Feature Embedding
Yangqing Jia
Evan Shelhamer
Jeff Donahue
Sergey Karayev
Jonathan Long
Ross B. Girshick
S. Guadarrama
Trevor Darrell
VLM
BDL
3DV
298
14,717
0
20 Jun 2014
Generative Adversarial Networks
Ian Goodfellow
Jean Pouget-Abadie
M. Berk Mirza
Bing Xu
David Warde-Farley
Sherjil Ozair
Aaron Courville
Yoshua Bengio
GAN
148
2,198
0
10 Jun 2014
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
Kyunghyun Cho
B. V. Merrienboer
Çağlar Gülçehre
Dzmitry Bahdanau
Fethi Bougares
Holger Schwenk
Yoshua Bengio
AIMat
1.1K
23,414
0
03 Jun 2014
Deep Learning in Neural Networks: An Overview
Jürgen Schmidhuber
HAI
252
16,405
0
30 Apr 2014
One weird trick for parallelizing convolutional neural networks
A. Krizhevsky
GNN
103
1,303
0
23 Apr 2014
A Stochastic Quasi-Newton Method for Large-Scale Optimization
R. Byrd
Samantha Hansen
J. Nocedal
Y. Singer
ODL
119
473
0
27 Jan 2014
GPU Asynchronous Stochastic Gradient Descent to Speed Up Neural Network Training
T. Paine
Hailin Jin
Jianchao Yang
Zhe Lin
Thomas Huang
113
98
0
21 Dec 2013
Do Deep Nets Really Need to be Deep?
Lei Jimmy Ba
R. Caruana
188
2,120
0
21 Dec 2013
Multi-GPU Training of ConvNets
Guillermo A. Castillo
Keith Adams
Yaniv Taigman
Ayonga Hereid
82
101
0
20 Dec 2013
Fast Training of Convolutional Networks through FFTs
Michaël Mathieu
Mikael Henaff
Yann LeCun
146
611
0
20 Dec 2013
Network In Network
Min Lin
Qiang Chen
Shuicheng Yan
306
6,290
0
16 Dec 2013
Deep Learning of Representations: Looking Forward
Yoshua Bengio
225
683
0
02 May 2013
On the difficulty of training Recurrent Neural Networks
Razvan Pascanu
Tomas Mikolov
Yoshua Bengio
ODL
218
5,361
0
21 Nov 2012
Practical Bayesian Optimization of Machine Learning Algorithms
Jasper Snoek
Hugo Larochelle
Ryan P. Adams
384
7,981
0
13 Jun 2012
Building high-level features using large scale unsupervised learning
Quoc V. Le
MarcÁurelio Ranzato
R. Monga
M. Devin
Kai Chen
G. Corrado
J. Dean
A. Ng
SSL
OffRL
CVBM
146
2,272
0
29 Dec 2011
Previous
1
2
3
4
Next