ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1802.09941
  4. Cited By
Demystifying Parallel and Distributed Deep Learning: An In-Depth
  Concurrency Analysis
v1v2 (latest)

Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis

26 February 2018
Tal Ben-Nun
Torsten Hoefler
    GNN
ArXiv (abs)PDFHTML

Papers citing "Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis"

50 / 154 papers shown
Title
Asynchronous Methods for Deep Reinforcement Learning
Asynchronous Methods for Deep Reinforcement Learning
Volodymyr Mnih
Adria Puigdomenech Badia
M. Berk Mirza
Alex Graves
Timothy Lillicrap
Tim Harley
David Silver
Koray Kavukcuoglu
210
8,882
0
04 Feb 2016
Poseidon: A System Architecture for Efficient GPU-based Deep Learning on
  Multiple Machines
Poseidon: A System Architecture for Efficient GPU-based Deep Learning on Multiple Machines
Huatian Zhang
Zhiting Hu
Jinliang Wei
P. Xie
Gunhee Kim
Qirong Ho
Eric Xing
GNN
32
48
0
19 Dec 2015
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.3K
194,641
0
10 Dec 2015
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
...
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
146
2,976
0
08 Dec 2015
Multi-Scale Context Aggregation by Dilated Convolutions
Multi-Scale Context Aggregation by Dilated Convolutions
Feng Yu
V. Koltun
SSeg
278
8,464
0
23 Nov 2015
Compression of Deep Convolutional Neural Networks for Fast and Low Power
  Mobile Applications
Compression of Deep Convolutional Neural Networks for Fast and Low Power Mobile Applications
Yong-Deok Kim
Eunhyeok Park
S. Yoo
Taelim Choi
Lu Yang
Dongjun Shin
124
895
0
20 Nov 2015
Why M Heads are Better than One: Training a Diverse Ensemble of Deep
  Networks
Why M Heads are Better than One: Training a Diverse Ensemble of Deep Networks
Stefan Lee
Senthil Purushwalkam
Michael Cogswell
David J. Crandall
Dhruv Batra
FedMLUQCV
110
316
0
19 Nov 2015
SparkNet: Training Deep Networks in Spark
SparkNet: Training Deep Networks in Spark
Philipp Moritz
Robert Nishihara
Ion Stoica
Michael I. Jordan
113
171
0
19 Nov 2015
Staleness-aware Async-SGD for Distributed Deep Learning
Staleness-aware Async-SGD for Distributed Deep Learning
Wei Zhang
Suyog Gupta
Xiangru Lian
Ji Liu
88
266
0
18 Nov 2015
8-Bit Approximations for Parallelism in Deep Learning
8-Bit Approximations for Parallelism in Deep Learning
Tim Dettmers
81
176
0
14 Nov 2015
BinaryConnect: Training Deep Neural Networks with binary weights during
  propagations
BinaryConnect: Training Deep Neural Networks with binary weights during propagations
Matthieu Courbariaux
Yoshua Bengio
J. David
MQ
225
2,994
0
02 Nov 2015
FireCaffe: near-linear acceleration of deep neural network training on
  compute clusters
FireCaffe: near-linear acceleration of deep neural network training on compute clusters
F. Iandola
Khalid Ashraf
Matthew W. Moskewicz
Kurt Keutzer
86
302
0
31 Oct 2015
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
Song Han
Huizi Mao
W. Dally
3DGS
263
8,864
0
01 Oct 2015
Fast Algorithms for Convolutional Neural Networks
Fast Algorithms for Convolutional Neural Networks
Andrew Lavin
Scott Gray
74
879
0
30 Sep 2015
On the Expressive Power of Deep Learning: A Tensor Analysis
On the Expressive Power of Deep Learning: A Tensor Analysis
Nadav Cohen
Or Sharir
Amnon Shashua
83
472
0
16 Sep 2015
Model Accuracy and Runtime Tradeoff in Distributed Deep Learning:A
  Systematic Study
Model Accuracy and Runtime Tradeoff in Distributed Deep Learning:A Systematic Study
Suyog Gupta
Wei Zhang
Fei Wang
79
172
0
14 Sep 2015
A Linearly-Convergent Stochastic L-BFGS Algorithm
A Linearly-Convergent Stochastic L-BFGS Algorithm
Philipp Moritz
Robert Nishihara
Michael I. Jordan
ODL
90
235
0
09 Aug 2015
Listen, Attend and Spell
Listen, Attend and Spell
William Chan
Navdeep Jaitly
Quoc V. Le
Oriol Vinyals
RALM
165
2,269
0
05 Aug 2015
Asynchronous Parallel Stochastic Gradient for Nonconvex Optimization
Asynchronous Parallel Stochastic Gradient for Nonconvex Optimization
Xiangru Lian
Yijun Huang
Y. Li
Ji Liu
147
499
0
27 Jun 2015
Taming the Wild: A Unified Analysis of Hogwild!-Style Algorithms
Taming the Wild: A Unified Analysis of Hogwild!-Style Algorithms
Christopher De Sa
Ce Zhang
K. Olukotun
Christopher Ré
97
204
0
22 Jun 2015
Deep SimNets
Deep SimNets
Nadav Cohen
Or Sharir
Amnon Shashua
85
46
0
09 Jun 2015
Instant Learning: Parallel Deep Neural Networks and Convolutional
  Bootstrapping
Instant Learning: Parallel Deep Neural Networks and Convolutional Bootstrapping
Andrew J. R. Simpson
29
4
0
22 May 2015
Asynchronous Parallel Stochastic Gradient Descent - A Numeric Core for
  Scalable Distributed Machine Learning Algorithms
Asynchronous Parallel Stochastic Gradient Descent - A Numeric Core for Scalable Distributed Machine Learning Algorithms
J. Keuper
Franz-Josef Pfreundt
65
38
0
19 May 2015
Distilling the Knowledge in a Neural Network
Distilling the Knowledge in a Neural Network
Geoffrey E. Hinton
Oriol Vinyals
J. Dean
FedML
367
19,764
0
09 Mar 2015
Batch Normalization: Accelerating Deep Network Training by Reducing
  Internal Covariate Shift
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Sergey Ioffe
Christian Szegedy
OOD
471
43,357
0
11 Feb 2015
Show, Attend and Tell: Neural Image Caption Generation with Visual
  Attention
Show, Attend and Tell: Neural Image Caption Generation with Visual Attention
Ke Xu
Jimmy Ba
Ryan Kiros
Kyunghyun Cho
Aaron Courville
Ruslan Salakhutdinov
R. Zemel
Yoshua Bengio
DiffM
352
10,091
0
10 Feb 2015
Deep Learning with Limited Numerical Precision
Deep Learning with Limited Numerical Precision
Suyog Gupta
A. Agrawal
K. Gopalakrishnan
P. Narayanan
HAI
209
2,049
0
09 Feb 2015
Delving Deep into Rectifiers: Surpassing Human-Level Performance on
  ImageNet Classification
Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
VLM
355
18,661
0
06 Feb 2015
Fast Convolutional Nets With fbfft: A GPU Performance Evaluation
Fast Convolutional Nets With fbfft: A GPU Performance Evaluation
Nicolas Vasilache
Jeff Johnson
Michaël Mathieu
Soumith Chintala
Serkan Piantino
Yann LeCun
69
347
0
24 Dec 2014
Adam: A Method for Stochastic Optimization
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
2.1K
150,433
0
22 Dec 2014
Deep learning with Elastic Averaging SGD
Deep learning with Elastic Averaging SGD
Sixin Zhang
A. Choromańska
Yann LeCun
FedML
100
611
0
20 Dec 2014
Parallel training of DNNs with Natural Gradient and Parameter Averaging
Parallel training of DNNs with Natural Gradient and Parameter Averaging
Daniel Povey
Xiaohui Zhang
Sanjeev Khudanpur
FedML
101
251
0
27 Oct 2014
cuDNN: Efficient Primitives for Deep Learning
cuDNN: Efficient Primitives for Deep Learning
Sharan Chetlur
Cliff Woolley
Philippe Vandermersch
Jonathan M. Cohen
J. Tran
Bryan Catanzaro
Evan Shelhamer
144
1,849
0
03 Oct 2014
Going Deeper with Convolutions
Going Deeper with Convolutions
Christian Szegedy
Wei Liu
Yangqing Jia
P. Sermanet
Scott E. Reed
Dragomir Anguelov
D. Erhan
Vincent Vanhoucke
Andrew Rabinovich
499
43,717
0
17 Sep 2014
Very Deep Convolutional Networks for Large-Scale Image Recognition
Very Deep Convolutional Networks for Large-Scale Image Recognition
Karen Simonyan
Andrew Zisserman
FAttMDE
1.7K
100,575
0
04 Sep 2014
Caffe: Convolutional Architecture for Fast Feature Embedding
Caffe: Convolutional Architecture for Fast Feature Embedding
Yangqing Jia
Evan Shelhamer
Jeff Donahue
Sergey Karayev
Jonathan Long
Ross B. Girshick
S. Guadarrama
Trevor Darrell
VLMBDL3DV
298
14,717
0
20 Jun 2014
Generative Adversarial Networks
Generative Adversarial Networks
Ian Goodfellow
Jean Pouget-Abadie
M. Berk Mirza
Bing Xu
David Warde-Farley
Sherjil Ozair
Aaron Courville
Yoshua Bengio
GAN
148
2,198
0
10 Jun 2014
Learning Phrase Representations using RNN Encoder-Decoder for
  Statistical Machine Translation
Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation
Kyunghyun Cho
B. V. Merrienboer
Çağlar Gülçehre
Dzmitry Bahdanau
Fethi Bougares
Holger Schwenk
Yoshua Bengio
AIMat
1.1K
23,414
0
03 Jun 2014
Deep Learning in Neural Networks: An Overview
Deep Learning in Neural Networks: An Overview
Jürgen Schmidhuber
HAI
252
16,405
0
30 Apr 2014
One weird trick for parallelizing convolutional neural networks
One weird trick for parallelizing convolutional neural networks
A. Krizhevsky
GNN
103
1,303
0
23 Apr 2014
A Stochastic Quasi-Newton Method for Large-Scale Optimization
A Stochastic Quasi-Newton Method for Large-Scale Optimization
R. Byrd
Samantha Hansen
J. Nocedal
Y. Singer
ODL
119
473
0
27 Jan 2014
GPU Asynchronous Stochastic Gradient Descent to Speed Up Neural Network
  Training
GPU Asynchronous Stochastic Gradient Descent to Speed Up Neural Network Training
T. Paine
Hailin Jin
Jianchao Yang
Zhe Lin
Thomas Huang
113
98
0
21 Dec 2013
Do Deep Nets Really Need to be Deep?
Do Deep Nets Really Need to be Deep?
Lei Jimmy Ba
R. Caruana
188
2,120
0
21 Dec 2013
Multi-GPU Training of ConvNets
Multi-GPU Training of ConvNets
Guillermo A. Castillo
Keith Adams
Yaniv Taigman
Ayonga Hereid
82
101
0
20 Dec 2013
Fast Training of Convolutional Networks through FFTs
Fast Training of Convolutional Networks through FFTs
Michaël Mathieu
Mikael Henaff
Yann LeCun
146
611
0
20 Dec 2013
Network In Network
Network In Network
Min Lin
Qiang Chen
Shuicheng Yan
306
6,290
0
16 Dec 2013
Deep Learning of Representations: Looking Forward
Deep Learning of Representations: Looking Forward
Yoshua Bengio
225
683
0
02 May 2013
On the difficulty of training Recurrent Neural Networks
On the difficulty of training Recurrent Neural Networks
Razvan Pascanu
Tomas Mikolov
Yoshua Bengio
ODL
218
5,361
0
21 Nov 2012
Practical Bayesian Optimization of Machine Learning Algorithms
Practical Bayesian Optimization of Machine Learning Algorithms
Jasper Snoek
Hugo Larochelle
Ryan P. Adams
384
7,981
0
13 Jun 2012
Building high-level features using large scale unsupervised learning
Building high-level features using large scale unsupervised learning
Quoc V. Le
MarcÁurelio Ranzato
R. Monga
M. Devin
Kai Chen
G. Corrado
J. Dean
A. Ng
SSLOffRLCVBM
146
2,272
0
29 Dec 2011
Previous
1234
Next