Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1703.09039
Cited By
Efficient Processing of Deep Neural Networks: A Tutorial and Survey
27 March 2017
Vivienne Sze
Yu-hsin Chen
Tien-Ju Yang
J. Emer
AAML
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Efficient Processing of Deep Neural Networks: A Tutorial and Survey"
31 / 231 papers shown
Title
Trading-off Accuracy and Energy of Deep Inference on Embedded Systems: A Co-Design Approach
Nitthilan Kanappan Jayakodi
Anwesha Chatterjee
Wonje Choi
J. Doppa
P. Pande
13
27
0
29 Jan 2019
A Survey of the Recent Architectures of Deep Convolutional Neural Networks
Asifullah Khan
A. Sohail
Umme Zahoora
Aqsa Saeed Qureshi
OOD
53
2,268
0
17 Jan 2019
Efficient Winograd Convolution via Integer Arithmetic
Lingchuan Meng
J. Brothers
11
29
0
07 Jan 2019
FPGA-based Accelerators of Deep Learning Networks for Learning and Classification: A Review
Ahmad Shawahna
S. M. Sait
A. El-Maleh
28
372
0
01 Jan 2019
Bayesian State Estimation for Unobservable Distribution Systems via Deep Learning
Kursat Rasim Mestav
Jaime Luengo-Rozas
L. Tong
BDL
31
133
0
07 Nov 2018
NestDNN: Resource-Aware Multi-Tenant On-Device Deep Learning for Continuous Mobile Vision
Biyi Fang
Xiao Zeng
Mi Zhang
3DH
25
263
0
23 Oct 2018
MBS: Macroblock Scaling for CNN Model Reduction
Yu-Hsun Lin
Chun-Nan Chou
Edward Y. Chang
MQ
16
4
0
18 Sep 2018
Normalization in Training U-Net for 2D Biomedical Semantic Segmentation
Xiao-Yun Zhou
Guang-Zhong Yang
18
77
0
11 Sep 2018
DFTerNet: Towards 2-bit Dynamic Fusion Networks for Accurate Human Activity Recognition
Zhan Yang
Osolo Ian Raymond
Chengyuan Zhang
Ying Wan
J. Long
CVBM
39
36
0
31 Jul 2018
2P-DNN : Privacy-Preserving Deep Neural Networks Based on Homomorphic Cryptosystem
Qiang Zhu
Xixiang Lv
22
16
0
23 Jul 2018
FINN-L: Library Extensions and Design Trade-off Analysis for Variable Precision LSTM Networks on FPGAs
Vladimir Rybalkin
Alessandro Pappalardo
M. M. Ghaffar
Giulio Gambardella
Norbert Wehn
Michaela Blott
11
72
0
11 Jul 2018
Eyeriss v2: A Flexible Accelerator for Emerging Deep Neural Networks on Mobile Devices
Yu-hsin Chen
Tien-Ju Yang
J. Emer
Vivienne Sze
MQ
16
70
0
10 Jul 2018
Quantizing deep convolutional networks for efficient inference: A whitepaper
Raghuraman Krishnamoorthi
MQ
48
993
0
21 Jun 2018
On the Resilience of RTL NN Accelerators: Fault Characterization and Mitigation
Behzad Salami
O. Unsal
A. Cristal
23
66
0
14 Jun 2018
Accelerating CNN inference on FPGAs: A Survey
K. Abdelouahab
Maxime Pelcat
Jocelyn Serot
F. Berry
AI4CE
27
147
0
26 May 2018
EVA
2
^2
2
: Exploiting Temporal Redundancy in Live Computer Vision
Mark Buckler
Philip Bedoukian
Suren Jayasuriya
Adrian Sampson
39
75
0
16 Mar 2018
Deep Learning in Mobile and Wireless Networking: A Survey
Chaoyun Zhang
P. Patras
Hamed Haddadi
45
1,304
0
12 Mar 2018
Hyperdrive: A Multi-Chip Systolically Scalable Binary-Weight CNN Inference Engine
Renzo Andri
Lukas Cavigelli
D. Rossi
Luca Benini
MQ
24
19
0
05 Mar 2018
The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches
Md. Zahangir Alom
T. Taha
C. Yakopcic
Stefan Westberg
P. Sidike
Mst Shamima Nasrin
B. Van Essen
A. Awwal
V. Asari
VLM
29
873
0
03 Mar 2018
Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis
Tal Ben-Nun
Torsten Hoefler
GNN
33
702
0
26 Feb 2018
A Scalable Near-Memory Architecture for Training Deep Neural Networks on Large In-Memory Datasets
Fabian Schuiki
Michael Schaffner
Frank K. Gürkaynak
Luca Benini
31
70
0
19 Feb 2018
Training and Inference with Integers in Deep Neural Networks
Shuang Wu
Guoqi Li
F. Chen
Luping Shi
MQ
32
389
0
13 Feb 2018
On the Universal Approximability and Complexity Bounds of Quantized ReLU Neural Networks
Yukun Ding
Jinglan Liu
Jinjun Xiong
Yiyu Shi
MQ
34
21
0
10 Feb 2018
JointDNN: An Efficient Training and Inference Engine for Intelligent Mobile Cloud Computing Services
Amir Erfan Eshratifar
M. Abrishami
Massoud Pedram
FedML
34
247
0
25 Jan 2018
Design Automation for Binarized Neural Networks: A Quantum Leap Opportunity?
Manuele Rusci
Lukas Cavigelli
Luca Benini
MQ
23
20
0
21 Nov 2017
Streaming Architecture for Large-Scale Quantized Neural Networks on an FPGA-Based Dataflow Platform
Chaim Baskin
Natan Liss
Evgenii Zheltonozhskii
A. Bronstein
A. Mendelson
GNN
MQ
36
35
0
31 Jul 2017
ShiftCNN: Generalized Low-Precision Architecture for Inference of Convolutional Neural Networks
Denis A. Gudovskiy
Luca Rigazio
MQ
27
52
0
07 Jun 2017
Bayesian Compression for Deep Learning
Christos Louizos
Karen Ullrich
Max Welling
UQCV
BDL
23
479
0
24 May 2017
Speeding up Convolutional Neural Networks By Exploiting the Sparsity of Rectifier Units
S. Shi
Xiaowen Chu
15
43
0
25 Apr 2017
Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights
Aojun Zhou
Anbang Yao
Yiwen Guo
Lin Xu
Yurong Chen
MQ
337
1,049
0
10 Feb 2017
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
104
1,503
0
25 Jan 2017
Previous
1
2
3
4
5