ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1410.0759
  4. Cited By
cuDNN: Efficient Primitives for Deep Learning

cuDNN: Efficient Primitives for Deep Learning

3 October 2014
Sharan Chetlur
Cliff Woolley
Philippe Vandermersch
Jonathan M. Cohen
J. Tran
Bryan Catanzaro
Evan Shelhamer
ArXivPDFHTML

Papers citing "cuDNN: Efficient Primitives for Deep Learning"

49 / 249 papers shown
Title
Gabor Filter Assisted Energy Efficient Fast Learning Convolutional
  Neural Networks
Gabor Filter Assisted Energy Efficient Fast Learning Convolutional Neural Networks
Syed Shakib Sarwar
Priyadarshini Panda
Kaushik Roy
CVBM
24
100
0
12 May 2017
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep
  Neural Networks
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks
Minsoo Rhu
Mike O'Connor
Niladrish Chatterjee
Jeff Pool
S. Keckler
33
176
0
03 May 2017
Speeding up Convolutional Neural Networks By Exploiting the Sparsity of
  Rectifier Units
Speeding up Convolutional Neural Networks By Exploiting the Sparsity of Rectifier Units
Shaoshuai Shi
Xuming Hu
29
43
0
25 Apr 2017
CBinfer: Change-Based Inference for Convolutional Neural Networks on
  Video Data
CBinfer: Change-Based Inference for Convolutional Neural Networks on Video Data
Lukas Cavigelli
Philippe Degen
Luca Benini
BDL
25
51
0
14 Apr 2017
Parallel Multi Channel Convolution using General Matrix Multiplication
Parallel Multi Channel Convolution using General Matrix Multiplication
Aravind Vasudevan
Andrew Anderson
David Gregg
16
139
0
06 Apr 2017
Active Convolution: Learning the Shape of Convolution for Image
  Classification
Active Convolution: Learning the Shape of Convolution for Image Classification
Yunho Jeon
Junmo Kim
29
171
0
27 Mar 2017
Deep Embedding Forest: Forest-based Serving with Deep Embedding Features
Deep Embedding Forest: Forest-based Serving with Deep Embedding Features
Jiehan Zhu
Ying Shan
JC Mao
Dong Yu
Holakou Rahmanian
Yi Zhang
30
52
0
15 Mar 2017
Leveraging Large Amounts of Weakly Supervised Data for Multi-Language
  Sentiment Classification
Leveraging Large Amounts of Weakly Supervised Data for Multi-Language Sentiment Classification
Jan Deriu
Aurelien Lucchi
V. D. Luca
Aliaksei Severyn
Simon Müller
Mark Cieliebak
Thomas Hofmann
Martin Jaggi
17
133
0
07 Mar 2017
Chain-NN: An Energy-Efficient 1D Chain Architecture for Accelerating
  Deep Convolutional Neural Networks
Chain-NN: An Energy-Efficient 1D Chain Architecture for Accelerating Deep Convolutional Neural Networks
Shihao Wang
Dajiang Zhou
Xushen Han
T. Yoshimura
3DV
19
51
0
04 Mar 2017
Symbolic, Distributed and Distributional Representations for Natural
  Language Processing in the Era of Deep Learning: a Survey
Symbolic, Distributed and Distributional Representations for Natural Language Processing in the Era of Deep Learning: a Survey
L. Ferrone
Fabio Massimo Zanzotto
39
37
0
02 Feb 2017
Towards End-to-End Speech Recognition with Deep Convolutional Neural
  Networks
Towards End-to-End Speech Recognition with Deep Convolutional Neural Networks
Wenjie Qu
Mohammad Pezeshki
Philemon Brakel
Saizheng Zhang
Yoshua Bengio
Aaron Courville
27
366
0
10 Jan 2017
Exploring the Design Space of Deep Convolutional Neural Networks at
  Large Scale
Exploring the Design Space of Deep Convolutional Neural Networks at Large Scale
F. Iandola
3DV
26
18
0
20 Dec 2016
SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural
  Networks for Real-Time Object Detection for Autonomous Driving
SqueezeDet: Unified, Small, Low Power Fully Convolutional Neural Networks for Real-Time Object Detection for Autonomous Driving
Bichen Wu
Alvin Wan
F. Iandola
Peter H. Jin
Kurt Keutzer
44
512
0
04 Dec 2016
CAS-CNN: A Deep Convolutional Neural Network for Image Compression
  Artifact Suppression
CAS-CNN: A Deep Convolutional Neural Network for Image Compression Artifact Suppression
Lukas Cavigelli
P. Hager
Luca Benini
22
195
0
22 Nov 2016
Factorized Bilinear Models for Image Recognition
Factorized Bilinear Models for Image Recognition
Yanghao Li
Naiyan Wang
Jiaying Liu
Xiaodi Hou
19
96
0
17 Nov 2016
How to scale distributed deep learning?
How to scale distributed deep learning?
Peter H. Jin
Qiaochu Yuan
F. Iandola
Kurt Keutzer
3DH
27
136
0
14 Nov 2016
Caffeinated FPGAs: FPGA Framework For Convolutional Neural Networks
Caffeinated FPGAs: FPGA Framework For Convolutional Neural Networks
R. Dicecco
Griffin Lacey
Jasmina Vasiljevic
P. Chow
Graham W. Taylor
S. Areibi
23
92
0
30 Sep 2016
ContextLocNet: Context-Aware Deep Network Models for Weakly Supervised
  Localization
ContextLocNet: Context-Aware Deep Network Models for Weakly Supervised Localization
Vadim Kantorov
Maxime Oquab
Minsu Cho
Ivan Laptev
WSOL
25
305
0
14 Sep 2016
Benchmarking State-of-the-Art Deep Learning Software Tools
Benchmarking State-of-the-Art Deep Learning Software Tools
Shaoshuai Shi
Qiang-qiang Wang
Pengfei Xu
Xuming Hu
BDL
19
327
0
25 Aug 2016
Design of Efficient Convolutional Layers using Single Intra-channel
  Convolution, Topological Subdivisioning and Spatial "Bottleneck" Structure
Design of Efficient Convolutional Layers using Single Intra-channel Convolution, Topological Subdivisioning and Spatial "Bottleneck" Structure
Min Wang
Baoyuan Liu
H. Foroosh
27
51
0
15 Aug 2016
Learning Structured Sparsity in Deep Neural Networks
Learning Structured Sparsity in Deep Neural Networks
W. Wen
Chunpeng Wu
Yandan Wang
Yiran Chen
Hai Helen Li
47
2,323
0
12 Aug 2016
Accelerating Eulerian Fluid Simulation With Convolutional Networks
Accelerating Eulerian Fluid Simulation With Convolutional Networks
Jonathan Tompson
Kristofer Schlachter
Pablo Sprechmann
Ken Perlin
58
530
0
13 Jul 2016
Omnivore: An Optimizer for Multi-device Deep Learning on CPUs and GPUs
Omnivore: An Optimizer for Multi-device Deep Learning on CPUs and GPUs
Stefan Hadjis
Ce Zhang
Ioannis Mitliagkas
Dan Iter
Christopher Ré
20
65
0
14 Jun 2016
Structured Convolution Matrices for Energy-efficient Deep learning
Structured Convolution Matrices for Energy-efficient Deep learning
R. Appuswamy
T. Nayak
John V. Arthur
S. K. Esser
P. Merolla
J. McKinstry
T. Melano
M. Flickner
D. Modha
38
11
0
08 Jun 2016
ENet: A Deep Neural Network Architecture for Real-Time Semantic
  Segmentation
ENet: A Deep Neural Network Architecture for Real-Time Semantic Segmentation
Adam Paszke
Abhishek Chaurasia
Sangpil Kim
Eugenio Culurciello
SSeg
235
2,059
0
07 Jun 2016
Boda-RTC: Productive Generation of Portable, Efficient Code for
  Convolutional Neural Networks on Mobile Computing Platforms
Boda-RTC: Productive Generation of Portable, Efficient Code for Convolutional Neural Networks on Mobile Computing Platforms
Matthew W. Moskewicz
F. Iandola
Kurt Keutzer
17
8
0
01 Jun 2016
TensorFlow: A system for large-scale machine learning
TensorFlow: A system for large-scale machine learning
Martín Abadi
P. Barham
Jianmin Chen
Zhehuai Chen
Andy Davis
...
Vijay Vasudevan
Pete Warden
Martin Wicke
Yuan Yu
Xiaoqiang Zhang
GNN
AI4CE
140
18,268
0
27 May 2016
An Analysis of Deep Neural Network Models for Practical Applications
An Analysis of Deep Neural Network Models for Practical Applications
A. Canziani
Adam Paszke
Eugenio Culurciello
19
1,165
0
24 May 2016
Ristretto: Hardware-Oriented Approximation of Convolutional Neural
  Networks
Ristretto: Hardware-Oriented Approximation of Convolutional Neural Networks
Philipp Gysel
29
127
0
20 May 2016
Theano: A Python framework for fast computation of mathematical
  expressions
Theano: A Python framework for fast computation of mathematical expressions
The Theano Development Team
Rami Al-Rfou
Guillaume Alain
Amjad Almahairi
Christof Angermüller
...
Kelvin Xu
Lijun Xue
Li Yao
Saizheng Zhang
Ying Zhang
40
2,335
0
09 May 2016
Perceptual Losses for Real-Time Style Transfer and Super-Resolution
Perceptual Losses for Real-Time Style Transfer and Super-Resolution
Justin Johnson
Alexandre Alahi
Li Fei-Fei
SupR
108
10,176
0
27 Mar 2016
TTC: A high-performance Compiler for Tensor Transpositions
TTC: A high-performance Compiler for Tensor Transpositions
P. Springer
J. Hammond
Paolo Bientinesi
30
17
0
07 Mar 2016
Convolutional Neural Networks using Logarithmic Data Representation
Convolutional Neural Networks using Logarithmic Data Representation
Daisuke Miyashita
Edward H. Lee
B. Murmann
MQ
30
425
0
03 Mar 2016
Automatic learning of gait signatures for people identification
Automatic learning of gait signatures for people identification
F. M. Castro
M. Marín-Jiménez
Nicolás Guil Mata
N. P. D. L. Blanca
CVBM
22
96
0
03 Mar 2016
DeepSpark: A Spark-Based Distributed Deep Learning Framework for
  Commodity Clusters
DeepSpark: A Spark-Based Distributed Deep Learning Framework for Commodity Clusters
Hanjoo Kim
Jaehong Park
Jaehee Jang
Sungroh Yoon
BDL
32
37
0
26 Feb 2016
Deep Learning on FPGAs: Past, Present, and Future
Deep Learning on FPGAs: Past, Present, and Future
Griffin Lacey
Graham W. Taylor
S. Areibi
GNN
29
180
0
13 Feb 2016
PN-Net: Conjoined Triple Deep Network for Learning Local Image
  Descriptors
PN-Net: Conjoined Triple Deep Network for Learning Local Image Descriptors
Vassileios Balntas
Edward Johns
Lilian Tang
K. Mikolajczyk
20
172
0
19 Jan 2016
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
...
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
63
2,956
0
08 Dec 2015
FireCaffe: near-linear acceleration of deep neural network training on
  compute clusters
FireCaffe: near-linear acceleration of deep neural network training on compute clusters
F. Iandola
Khalid Ashraf
Matthew W. Moskewicz
Kurt Keutzer
30
302
0
31 Oct 2015
Stereo Matching by Training a Convolutional Neural Network to Compare
  Image Patches
Stereo Matching by Training a Convolutional Neural Network to Compare Image Patches
Jure Zbontar
Yann LeCun
3DV
24
1,384
0
20 Oct 2015
Semantic Image Segmentation via Deep Parsing Network
Semantic Image Segmentation via Deep Parsing Network
Ziwei Liu
Xiaoxiao Li
Ping Luo
Chen Change Loy
Xiaoou Tang
25
659
0
09 Sep 2015
Parallel Multi-Dimensional LSTM, With Application to Fast Biomedical
  Volumetric Image Segmentation
Parallel Multi-Dimensional LSTM, With Application to Fast Biomedical Volumetric Image Segmentation
Marijn F. Stollenga
Wonmin Byeon
Marcus Liwicki
Jürgen Schmidhuber
32
294
0
24 Jun 2015
Fast ConvNets Using Group-wise Brain Damage
Fast ConvNets Using Group-wise Brain Damage
V. Lebedev
Victor Lempitsky
AAML
44
447
0
08 Jun 2015
PerforatedCNNs: Acceleration through Elimination of Redundant
  Convolutions
PerforatedCNNs: Acceleration through Elimination of Redundant Convolutions
Mikhail Figurnov
Aizhan Ibraimova
Dmitry Vetrov
Pushmeet Kohli
32
137
0
30 Apr 2015
Caffe con Troll: Shallow Ideas to Speed Up Deep Learning
Caffe con Troll: Shallow Ideas to Speed Up Deep Learning
Stefan Hadjis
Firas Abuzaid
Ce Zhang
Christopher Ré
BDL
23
71
0
16 Apr 2015
Learning to Compare Image Patches via Convolutional Neural Networks
Learning to Compare Image Patches via Convolutional Neural Networks
Sergey Zagoruyko
N. Komodakis
SSL
36
1,434
0
14 Apr 2015
Automatic differentiation in machine learning: a survey
Automatic differentiation in machine learning: a survey
A. G. Baydin
Barak A. Pearlmutter
Alexey Radul
J. Siskind
PINN
AI4CE
ODL
75
2,754
0
20 Feb 2015
Fast Convolutional Nets With fbfft: A GPU Performance Evaluation
Fast Convolutional Nets With fbfft: A GPU Performance Evaluation
Nicolas Vasilache
Jeff Johnson
Michaël Mathieu
Soumith Chintala
Serkan Piantino
Yann LeCun
34
346
0
24 Dec 2014
Deep Speech: Scaling up end-to-end speech recognition
Deep Speech: Scaling up end-to-end speech recognition
Awni Y. Hannun
Carl Case
Jared Casper
Bryan Catanzaro
G. Diamos
...
R. Prenger
S. Satheesh
Shubho Sengupta
Adam Coates
A. Ng
113
2,110
0
17 Dec 2014
Previous
12345