Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1703.09039
Cited By
Efficient Processing of Deep Neural Networks: A Tutorial and Survey
27 March 2017
Vivienne Sze
Yu-hsin Chen
Tien-Ju Yang
J. Emer
AAML
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Efficient Processing of Deep Neural Networks: A Tutorial and Survey"
50 / 231 papers shown
Title
Single Shot Structured Pruning Before Training
Joost R. van Amersfoort
Milad Alizadeh
Sebastian Farquhar
Nicholas D. Lane
Y. Gal
19
22
0
01 Jul 2020
Exploring Weight Importance and Hessian Bias in Model Pruning
Mingchen Li
Yahya Sattar
Christos Thrampoulidis
Samet Oymak
28
3
0
19 Jun 2020
Multi-Precision Policy Enforced Training (MuPPET): A precision-switching strategy for quantised fixed-point training of CNNs
A. Rajagopal
D. A. Vink
Stylianos I. Venieris
C. Bouganis
MQ
16
14
0
16 Jun 2020
Autonomous Driving with Deep Learning: A Survey of State-of-Art Technologies
Yu Huang
Yue Chen
3DPC
49
83
0
10 Jun 2020
STONNE: A Detailed Architectural Simulator for Flexible Neural Network Accelerators
Francisco Munoz-Martínez
José L. Abellán
M. Acacio
T. Krishna
27
11
0
10 Jun 2020
Making Convolutions Resilient via Algorithm-Based Error Detection Techniques
S. Hari
Michael B. Sullivan
Timothy Tsai
S. Keckler
29
66
0
08 Jun 2020
AdaDeep: A Usage-Driven, Automated Deep Model Compression Framework for Enabling Ubiquitous Intelligent Mobiles
Sicong Liu
Junzhao Du
Kaiming Nan
Zimu Zhou
Zhangyang Wang
Yingyan Lin
32
30
0
08 Jun 2020
Deep Learning for LiDAR Point Clouds in Autonomous Driving: A Review
Ying Li
Lingfei Ma
Zilong Zhong
Fei Liu
Dongpu Cao
Jonathan Li
M. Chapman
3DPC
44
390
0
20 May 2020
Cross-filter compression for CNN inference acceleration
Fuyuan Lyu
Shien Zhu
Weichen Liu
MQ
12
0
0
18 May 2020
Pruning Algorithms to Accelerate Convolutional Neural Networks for Edge Applications: A Survey
Jiayi Liu
S. Tripathi
Unmesh Kurup
Mohak Shah
3DPC
MedIm
30
52
0
08 May 2020
Data-Free Network Quantization With Adversarial Knowledge Distillation
Yoojin Choi
Jihwan P. Choi
Mostafa El-Khamy
Jungwon Lee
MQ
27
119
0
08 May 2020
TIMELY: Pushing Data Movements and Interfaces in PIM Accelerators Towards Local and in Time Domain
Weitao Li
Pengfei Xu
Yang Katie Zhao
Haitong Li
Yuan Xie
Yingyan Lin
9
68
0
03 May 2020
Lupulus: A Flexible Hardware Accelerator for Neural Networks
Andreas Toftegaard Kristensen
R. Giterman
Alexios Balatsoukas-Stimming
A. Burg
36
0
0
03 May 2020
Memristors -- from In-memory computing, Deep Learning Acceleration, Spiking Neural Networks, to the Future of Neuromorphic and Bio-inspired Computing
A. Mehonic
Abu Sebastian
Bipin Rajendran
Osvaldo Simeone
Eleni Vasilaki
A. Kenyon
16
200
0
30 Apr 2020
A Survey on Impact of Transient Faults on BNN Inference Accelerators
N. Khoshavi
Connor Broyles
Yu Bi
19
8
0
10 Apr 2020
PointAR: Efficient Lighting Estimation for Mobile Augmented Reality
Yiqin Zhao
Tian Guo
3DPC
22
29
0
30 Mar 2020
Ternary Compression for Communication-Efficient Federated Learning
Jinjin Xu
W. Du
Ran Cheng
Wangli He
Yaochu Jin
MQ
FedML
42
174
0
07 Mar 2020
Cluster Pruning: An Efficient Filter Pruning Method for Edge AI Vision Applications
Chinthaka Gamanayake
Lahiru Jayasinghe
Benny Kai Kiat Ng
Chau Yuen
VLM
23
45
0
05 Mar 2020
SELD-TCN: Sound Event Localization & Detection via Temporal Convolutional Networks
Karim Guirguis
Christoph Schorn
A. Guntoro
Sherif Abdulatif
Bin Yang
20
55
0
03 Mar 2020
Machine Learning based prediction of noncentrosymmetric crystal materials
Yuqi Song
Joseph Lindsay
Yong Zhao
Alireza Nasiri
Steph-Yves M. Louis
Jie Ling
Ming Hu
Jianjun Hu
14
21
0
26 Feb 2020
Sparse Optimization for Green Edge AI Inference
Xiangyu Yang
Sheng Hua
Yuanming Shi
Hao Wang
Jun Zhang
Khaled B. Letaief
19
14
0
24 Feb 2020
HarDNN: Feature Map Vulnerability Evaluation in CNNs
Abdulrahman Mahmoud
S. Hari
Christopher W. Fletcher
Sarita Adve
Charbel Sakr
Naresh R Shanbhag
Pavlo Molchanov
Michael B. Sullivan
Timothy Tsai
S. Keckler
19
38
0
22 Feb 2020
Communication-Efficient Edge AI: Algorithms and Systems
Yuanming Shi
Kai Yang
Tao Jiang
Jun Zhang
Khaled B. Letaief
GNN
20
326
0
22 Feb 2020
Noisy Machines: Understanding Noisy Neural Networks and Enhancing Robustness to Analog Hardware Errors Using Distillation
Chuteng Zhou
Prad Kadambi
Matthew Mattina
P. Whatmough
19
35
0
14 Jan 2020
Modeling of Pruning Techniques for Deep Neural Networks Simplification
Morteza Mousa Pasandi
M. Hajabdollahi
N. Karimi
S. Samavi
3DPC
14
18
0
13 Jan 2020
PANTHER: A Programmable Architecture for Neural Network Training Harnessing Energy-efficient ReRAM
Aayush Ankit
I. E. Hajj
S. R. Chalamalasetti
S. Agarwal
M. Marinella
M. Foltin
J. Strachan
D. Milojicic
Wen-mei W. Hwu
Kaushik Roy
21
65
0
24 Dec 2019
Design Considerations for Efficient Deep Neural Networks on Processing-in-Memory Accelerators
Tien-Ju Yang
Vivienne Sze
3DH
19
44
0
18 Dec 2019
On-Device Machine Learning: An Algorithms and Learning Theory Perspective
Sauptik Dhar
Junyao Guo
Jiayi Liu
S. Tripathi
Unmesh Kurup
Mohak Shah
28
141
0
02 Nov 2019
Device-Circuit-Architecture Co-Exploration for Computing-in-Memory Neural Accelerators
Weiwen Jiang
Qiuwen Lou
Zheyu Yan
Lei Yang
Jiaxi Hu
X. S. Hu
Yiyu Shi
11
71
0
31 Oct 2019
Deep Learning and Control Algorithms of Direct Perception for Autonomous Driving
Der-Hau Lee
Kuan-Lin Chen
Kuan-Han Liou
Chang-Lun Liu
Jinn-Liang Liu
BDL
22
58
0
26 Oct 2019
Autonomous Navigation via Deep Reinforcement Learning for Resource Constraint Edge Nodes using Transfer Learning
Aqeel Anwar
A. Raychowdhury
21
73
0
12 Oct 2019
A Survey of Machine Learning Applied to Computer Architecture Design
Drew Penney
Lizhong Chen
AI4CE
24
28
0
26 Sep 2019
A Data-Center FPGA Acceleration Platform for Convolutional Neural Networks
Xiaoyu Yu
Yuwei Wang
Jie Miao
Ephrem Wu
Heng Zhang
Yu Meng
Bo Zhang
Biao Min
Dewei Chen
Jianlin Gao
14
21
0
17 Sep 2019
Complexity-Scalable Neural Network Based MIMO Detection With Learnable Weight Scaling
A. Mohammad
C. Masouros
Y. Andreopoulos
24
28
0
12 Sep 2019
Edge Intelligence: The Confluence of Edge Computing and Artificial Intelligence
Shuiguang Deng
Hailiang Zhao
Weijia Fang
Jianwei Yin
Schahram Dustdar
Albert Y. Zomaya
74
605
0
02 Sep 2019
GDRQ: Group-based Distribution Reshaping for Quantization
Haibao Yu
Tuopu Wen
Guangliang Cheng
Jiankai Sun
Qi Han
Jianping Shi
MQ
33
3
0
05 Aug 2019
Energy-Efficient Processing and Robust Wireless Cooperative Transmission for Edge Inference
Kai Yang
Yuanming Shi
Wei Yu
Z. Ding
16
42
0
29 Jul 2019
Recurrent Neural Networks: An Embedded Computing Perspective
Nesma M. Rezk
M. Purnaprajna
Tomas Nordstrom
Z. Ul-Abdin
35
81
0
23 Jul 2019
Learning Multimodal Fixed-Point Weights using Gradient Descent
Lukas Enderich
Fabian Timm
Lars Rosenbaum
Wolfram Burgard
MQ
17
9
0
16 Jul 2019
Parameterized Structured Pruning for Deep Neural Networks
Günther Schindler
Wolfgang Roth
Franz Pernkopf
Holger Froening
21
6
0
12 Jun 2019
Edge Intelligence: Paving the Last Mile of Artificial Intelligence with Edge Computing
Zhi Zhou
Xu Chen
En Li
Liekang Zeng
Ke Luo
Junshan Zhang
26
1,420
0
24 May 2019
DeepCABAC: Context-adaptive binary arithmetic coding for deep neural network compression
Simon Wiedemann
H. Kirchhoffer
Stefan Matlage
Paul Haase
Arturo Marbán
...
Ahmed Osman
D. Marpe
H. Schwarz
Thomas Wiegand
Wojciech Samek
MQ
19
21
0
15 May 2019
NeuPart: Using Analytical Models to Drive Energy-Efficient Partitioning of CNN Computations on Cloud-Connected Mobile Clients
Susmita Dey Manasi
F. S. Snigdha
S. Sapatnekar
26
16
0
09 May 2019
Progressive Stochastic Binarization of Deep Networks
David Hartmann
Michael Wand
MQ
17
1
0
03 Apr 2019
Multi-vision Attention Networks for On-line Red Jujube Grading
Xiaoye Sun
Liyan Ma
Gongyang Li
9
9
0
31 Mar 2019
Automated Circuit Approximation Method Driven by Data Distribution
Z. Vašíček
Vojtěch Mrázek
Lukás Sekanina
7
17
0
11 Mar 2019
Efficient Winograd or Cook-Toom Convolution Kernel Implementation on Widely Used Mobile CPUs
Partha P. Maji
Andrew Mundy
Ganesh S. Dasika
Jesse G. Beu
Matthew Mattina
Robert D. Mullins
24
26
0
04 Mar 2019
Speeding up Deep Learning with Transient Servers
Shijian Li
R. Walls
Lijie Xu
Tian Guo
27
12
0
28 Feb 2019
Single-shot Channel Pruning Based on Alternating Direction Method of Multipliers
Chengcheng Li
Zehao Wang
Xiangyang Wang
Hairong Qi
16
5
0
18 Feb 2019
Optimally Scheduling CNN Convolutions for Efficient Memory Access
Arthur Stoutchinin
Francesco Conti
Luca Benini
36
43
0
04 Feb 2019
Previous
1
2
3
4
5
Next