Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1410.0759
Cited By
cuDNN: Efficient Primitives for Deep Learning
3 October 2014
Sharan Chetlur
Cliff Woolley
Philippe Vandermersch
Jonathan M. Cohen
J. Tran
Bryan Catanzaro
Evan Shelhamer
Re-assign community
ArXiv
PDF
HTML
Papers citing
"cuDNN: Efficient Primitives for Deep Learning"
49 / 249 papers shown
Title
Gabor Filter Assisted Energy Efficient Fast Learning Convolutional Neural Networks
Syed Shakib Sarwar
Priyadarshini Panda
Kaushik Roy
CVBM
24
100
0
12 May 2017
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks
Minsoo Rhu
Mike O'Connor
Niladrish Chatterjee
Jeff Pool
S. Keckler
33
176
0
03 May 2017
Speeding up Convolutional Neural Networks By Exploiting the Sparsity of Rectifier Units
Shaoshuai Shi
Xuming Hu
29
43
0
25 Apr 2017
CBinfer: Change-Based Inference for Convolutional Neural Networks on Video Data
Lukas Cavigelli
Philippe Degen
Luca Benini
BDL
25
51
0
14 Apr 2017
Parallel Multi Channel Convolution using General Matrix Multiplication
Aravind Vasudevan
Andrew Anderson
David Gregg
16
139
0
06 Apr 2017
Active Convolution: Learning the Shape of Convolution for Image Classification
Yunho Jeon
Junmo Kim
29
171
0
27 Mar 2017
Deep Embedding Forest: Forest-based Serving with Deep Embedding Features
Jiehan Zhu
Ying Shan
JC Mao
Dong Yu
Holakou Rahmanian
Yi Zhang
30
52
0
15 Mar 2017
Leveraging Large Amounts of Weakly Supervised Data for Multi-Language Sentiment Classification
Jan Deriu
Aurelien Lucchi
V. D. Luca
Aliaksei Severyn
Simon Müller
Mark Cieliebak
Thomas Hofmann
Martin Jaggi
17
133
0
07 Mar 2017
Chain-NN: An Energy-Efficient 1D Chain Architecture for Accelerating Deep Convolutional Neural Networks
Shihao Wang
Dajiang Zhou
Xushen Han
T. Yoshimura
3DV
19
51
0
04 Mar 2017
Symbolic, Distributed and Distributional Representations for Natural Language Processing in the Era of Deep Learning: a Survey
L. Ferrone
Fabio Massimo Zanzotto
39
37
0
02 Feb 2017
Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks
Wenjie Qu
Mohammad Pezeshki
Philemon Brakel
Saizheng Zhang
Yoshua Bengio
Aaron Courville
27
366
0
10 Jan 2017
Exploring the Design Space of Deep Convolutional Neural Networks at Large Scale
F. Iandola
3DV
26
18
0
20 Dec 2016
SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural Networks for Real-Time Object Detection for Autonomous Driving
Bichen Wu
Alvin Wan
F. Iandola
Peter H. Jin
Kurt Keutzer
44
512
0
04 Dec 2016
CAS-CNN: A Deep Convolutional Neural Network for Image Compression Artifact Suppression
Lukas Cavigelli
P. Hager
Luca Benini
22
195
0
22 Nov 2016
Factorized Bilinear Models for Image Recognition
Yanghao Li
Naiyan Wang
Jiaying Liu
Xiaodi Hou
19
96
0
17 Nov 2016
How to scale distributed deep learning?
Peter H. Jin
Qiaochu Yuan
F. Iandola
Kurt Keutzer
3DH
27
136
0
14 Nov 2016
Caffeinated FPGAs: FPGA Framework For Convolutional Neural Networks
R. Dicecco
Griffin Lacey
Jasmina Vasiljevic
P. Chow
Graham W. Taylor
S. Areibi
23
92
0
30 Sep 2016
ContextLocNet: Context-Aware Deep Network Models for Weakly Supervised Localization
Vadim Kantorov
Maxime Oquab
Minsu Cho
Ivan Laptev
WSOL
25
305
0
14 Sep 2016
Benchmarking State-of-the-Art Deep Learning Software Tools
Shaoshuai Shi
Qiang-qiang Wang
Pengfei Xu
Xuming Hu
BDL
19
327
0
25 Aug 2016
Design of Efficient Convolutional Layers using Single Intra-channel Convolution, Topological Subdivisioning and Spatial "Bottleneck" Structure
Min Wang
Baoyuan Liu
H. Foroosh
27
51
0
15 Aug 2016
Learning Structured Sparsity in Deep Neural Networks
W. Wen
Chunpeng Wu
Yandan Wang
Yiran Chen
Hai Helen Li
47
2,323
0
12 Aug 2016
Accelerating Eulerian Fluid Simulation With Convolutional Networks
Jonathan Tompson
Kristofer Schlachter
Pablo Sprechmann
Ken Perlin
58
530
0
13 Jul 2016
Omnivore: An Optimizer for Multi-device Deep Learning on CPUs and GPUs
Stefan Hadjis
Ce Zhang
Ioannis Mitliagkas
Dan Iter
Christopher Ré
20
65
0
14 Jun 2016
Structured Convolution Matrices for Energy-efficient Deep learning
R. Appuswamy
T. Nayak
John V. Arthur
S. K. Esser
P. Merolla
J. McKinstry
T. Melano
M. Flickner
D. Modha
38
11
0
08 Jun 2016
ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation
Adam Paszke
Abhishek Chaurasia
Sangpil Kim
Eugenio Culurciello
SSeg
235
2,059
0
07 Jun 2016
Boda-RTC: Productive Generation of Portable, Efficient Code for Convolutional Neural Networks on Mobile Computing Platforms
Matthew W. Moskewicz
F. Iandola
Kurt Keutzer
17
8
0
01 Jun 2016
TensorFlow: A system for large-scale machine learning
Martín Abadi
P. Barham
Jianmin Chen
Zhehuai Chen
Andy Davis
...
Vijay Vasudevan
Pete Warden
Martin Wicke
Yuan Yu
Xiaoqiang Zhang
GNN
AI4CE
140
18,268
0
27 May 2016
An Analysis of Deep Neural Network Models for Practical Applications
A. Canziani
Adam Paszke
Eugenio Culurciello
19
1,165
0
24 May 2016
Ristretto: Hardware-Oriented Approximation of Convolutional Neural Networks
Philipp Gysel
29
127
0
20 May 2016
Theano: A Python framework for fast computation of mathematical expressions
The Theano Development Team
Rami Al-Rfou
Guillaume Alain
Amjad Almahairi
Christof Angermüller
...
Kelvin Xu
Lijun Xue
Li Yao
Saizheng Zhang
Ying Zhang
40
2,335
0
09 May 2016
Perceptual Losses for Real-Time Style Transfer and Super-Resolution
Justin Johnson
Alexandre Alahi
Li Fei-Fei
SupR
108
10,176
0
27 Mar 2016
TTC: A high-performance Compiler for Tensor Transpositions
P. Springer
J. Hammond
Paolo Bientinesi
30
17
0
07 Mar 2016
Convolutional Neural Networks using Logarithmic Data Representation
Daisuke Miyashita
Edward H. Lee
B. Murmann
MQ
30
425
0
03 Mar 2016
Automatic learning of gait signatures for people identification
F. M. Castro
M. Marín-Jiménez
Nicolás Guil Mata
N. P. D. L. Blanca
CVBM
22
96
0
03 Mar 2016
DeepSpark: A Spark-Based Distributed Deep Learning Framework for Commodity Clusters
Hanjoo Kim
Jaehong Park
Jaehee Jang
Sungroh Yoon
BDL
32
37
0
26 Feb 2016
Deep Learning on FPGAs: Past, Present, and Future
Griffin Lacey
Graham W. Taylor
S. Areibi
GNN
29
180
0
13 Feb 2016
PN-Net: Conjoined Triple Deep Network for Learning Local Image Descriptors
Vassileios Balntas
Edward Johns
Lilian Tang
K. Mikolajczyk
20
172
0
19 Jan 2016
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
...
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
63
2,956
0
08 Dec 2015
FireCaffe: near-linear acceleration of deep neural network training on compute clusters
F. Iandola
Khalid Ashraf
Matthew W. Moskewicz
Kurt Keutzer
30
302
0
31 Oct 2015
Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches
Jure Zbontar
Yann LeCun
3DV
24
1,384
0
20 Oct 2015
Semantic Image Segmentation via Deep Parsing Network
Ziwei Liu
Xiaoxiao Li
Ping Luo
Chen Change Loy
Xiaoou Tang
25
659
0
09 Sep 2015
Parallel Multi-Dimensional LSTM, With Application to Fast Biomedical Volumetric Image Segmentation
Marijn F. Stollenga
Wonmin Byeon
Marcus Liwicki
Jürgen Schmidhuber
32
294
0
24 Jun 2015
Fast ConvNets Using Group-wise Brain Damage
V. Lebedev
Victor Lempitsky
AAML
44
447
0
08 Jun 2015
PerforatedCNNs: Acceleration through Elimination of Redundant Convolutions
Mikhail Figurnov
Aizhan Ibraimova
Dmitry Vetrov
Pushmeet Kohli
32
137
0
30 Apr 2015
Caffe con Troll: Shallow Ideas to Speed Up Deep Learning
Stefan Hadjis
Firas Abuzaid
Ce Zhang
Christopher Ré
BDL
23
71
0
16 Apr 2015
Learning to Compare Image Patches via Convolutional Neural Networks
Sergey Zagoruyko
N. Komodakis
SSL
36
1,434
0
14 Apr 2015
Automatic differentiation in machine learning: a survey
A. G. Baydin
Barak A. Pearlmutter
Alexey Radul
J. Siskind
PINN
AI4CE
ODL
75
2,754
0
20 Feb 2015
Fast Convolutional Nets With fbfft: A GPU Performance Evaluation
Nicolas Vasilache
Jeff Johnson
Michaël Mathieu
Soumith Chintala
Serkan Piantino
Yann LeCun
34
346
0
24 Dec 2014
Deep Speech: Scaling up end-to-end speech recognition
Awni Y. Hannun
Carl Case
Jared Casper
Bryan Catanzaro
G. Diamos
...
R. Prenger
S. Satheesh
Shubho Sengupta
Adam Coates
A. Ng
113
2,110
0
17 Dec 2014
Previous
1
2
3
4
5