Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1703.09039
Cited By
Efficient Processing of Deep Neural Networks: A Tutorial and Survey
27 March 2017
Vivienne Sze
Yu-hsin Chen
Tien-Ju Yang
J. Emer
AAML
3DV
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Efficient Processing of Deep Neural Networks: A Tutorial and Survey"
50 / 231 papers shown
Title
SECDA: Efficient Hardware/Software Co-Design of FPGA-based DNN Accelerators for Edge Inference
Jude Haris
Perry Gibson
José Cano
Nicolas Bohm Agostini
David Kaeli
41
19
0
01 Oct 2021
AI Accelerator Survey and Trends
Albert Reuther
Peter Michaleas
Michael Jones
V. Gadepally
S. Samsi
J. Kepner
48
79
0
18 Sep 2021
RAPID-RL: A Reconfigurable Architecture with Preemptive-Exits for Efficient Deep-Reinforcement Learning
Adarsh Kosta
Malik Aqeel Anwar
Priyadarshini Panda
A. Raychowdhury
Kaushik Roy
10
4
0
16 Sep 2021
Power-Based Attacks on Spatial DNN Accelerators
Ge Li
Mohit Tiwari
Michael Orshansky
30
8
0
28 Aug 2021
Learning from Images: Proactive Caching with Parallel Convolutional Neural Networks
Yantong Wang
Ye Hu
Zhaohui Yang
Walid Saad
Kai‐Kit Wong
V. Friderikos
31
4
0
15 Aug 2021
Artificial Intelligence-Driven Customized Manufacturing Factory: Key Technologies, Applications, and Challenges
J. Wan
Xiaomin Li
Hongning Dai
A. Kusiak
Miguel Martínez-García
Di Li
43
154
0
07 Aug 2021
A New Clustering-Based Technique for the Acceleration of Deep Convolutional Networks
Erion-Vasilis M. Pikoulis
C. Mavrokefalidis
Aris S. Lalos
31
10
0
19 Jul 2021
Trustworthy AI: A Computational Perspective
Haochen Liu
Yiqi Wang
Wenqi Fan
Xiaorui Liu
Yaxin Li
Shaili Jain
Yunhao Liu
Anil K. Jain
Jiliang Tang
FaML
104
196
0
12 Jul 2021
Uncertainty Modeling of Emerging Device-based Computing-in-Memory Neural Accelerators with Application to Neural Architecture Search
Zheyu Yan
Da-Cheng Juan
X. S. Hu
Yiyu Shi
38
24
0
06 Jul 2021
A visual introduction to Gaussian Belief Propagation
Joseph Ortiz
Talfan Evans
Andrew J. Davison
15
32
0
05 Jul 2021
CarSNN: An Efficient Spiking Neural Network for Event-Based Autonomous Cars on the Loihi Neuromorphic Research Processor
Alberto Viale
Alberto Marchisio
Maurizio Martina
Guido Masera
Muhammad Shafique
37
45
0
01 Jul 2021
Efficient Deep Learning: A Survey on Making Deep Learning Models Smaller, Faster, and Better
Gaurav Menghani
VLM
MedIm
23
366
0
16 Jun 2021
OODIn: An Optimised On-Device Inference Framework for Heterogeneous Mobile Devices
Stylianos I. Venieris
Ioannis Panopoulos
I. Venieris
42
14
0
08 Jun 2021
ATRIA: A Bit-Parallel Stochastic Arithmetic Based Accelerator for In-DRAM CNN Processing
Supreeth Mysore Shivanandamurthy
Ishan G. Thakkar
S. A. Salehi
12
5
0
26 May 2021
Agatha: Smart Contract for DNN Computation
Zihan Zheng
Peichen Xie
Xian Zhang
Shuo Chen
Yang Chen
Xiaobing Guo
Guangzhong Sun
Guangyu Sun
Lidong Zhou
GNN
31
11
0
11 May 2021
Deep Neural Networks Based Weight Approximation and Computation Reuse for 2-D Image Classification
M. Tolba
H. Tesfai
H. Saleh
B. Mohammad
Mahmoud Al-Qutayri
21
4
0
28 Apr 2021
An optical neural network using less than 1 photon per multiplication
Tianyu Wang
Shifan Ma
Logan G. Wright
Tatsuhiro Onodera
Brian C. Richard
Peter L. McMahon
48
177
0
27 Apr 2021
DynO: Dynamic Onloading of Deep Neural Networks from Cloud to Device
Mario Almeida
Stefanos Laskaridis
Stylianos I. Venieris
Ilias Leontiadis
Nicholas D. Lane
17
36
0
20 Apr 2021
Random and Adversarial Bit Error Robustness: Energy-Efficient and Secure DNN Accelerators
David Stutz
Nandhini Chandramoorthy
Matthias Hein
Bernt Schiele
AAML
MQ
24
18
0
16 Apr 2021
Training Multi-bit Quantized and Binarized Networks with A Learnable Symmetric Quantizer
Phuoc Pham
J. Abraham
Jaeyong Chung
MQ
37
11
0
01 Apr 2021
SETGAN: Scale and Energy Trade-off GANs for Image Applications on Mobile Platforms
Nitthilan Kanappan Jayakodi
J. Doppa
P. Pande
GAN
28
4
0
23 Mar 2021
Compacting Deep Neural Networks for Internet of Things: Methods and Applications
Ke Zhang
Hanbo Ying
Hongning Dai
Lin Li
Yuangyuang Peng
Keyi Guo
Hongfang Yu
16
38
0
20 Mar 2021
FastNeRF: High-Fidelity Neural Rendering at 200FPS
Stephan J. Garbin
Marek Kowalski
Matthew W. Johnson
Jamie Shotton
Julien P. C. Valentin
13
629
0
18 Mar 2021
Recent Advances on Neural Network Pruning at Initialization
Huan Wang
Can Qin
Yue Bai
Yulun Zhang
Yun Fu
CVBM
33
64
0
11 Mar 2021
Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture
S. Min
Kun Wu
Sitao Huang
Mert Hidayetouglu
Jinjun Xiong
Eiman Ebrahimi
Deming Chen
Wen-mei W. Hwu
GNN
10
67
0
04 Mar 2021
A Little Energy Goes a Long Way: Build an Energy-Efficient, Accurate Spiking Neural Network from Convolutional Neural Network
Dengyu Wu
Xinping Yi
Xiaowei Huang
22
16
0
01 Mar 2021
Reduced-Order Neural Network Synthesis with Robustness Guarantees
R. Drummond
M. Turner
S. Duncan
19
9
0
18 Feb 2021
A Survey of Machine Learning for Computer Architecture and Systems
Nan Wu
Yuan Xie
AI4TS
AI4CE
20
145
0
16 Feb 2021
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
Tailin Liang
C. Glossner
Lei Wang
Shaobo Shi
Xiaotong Zhang
MQ
150
675
0
24 Jan 2021
NetCut: Real-Time DNN Inference Using Layer Removal
Mehrshad Zandigohar
Deniz Erdogmus
G. Schirner
17
5
0
13 Jan 2021
Robust Machine Learning Systems: Challenges, Current Trends, Perspectives, and the Road Ahead
Muhammad Shafique
Mahum Naseer
T. Theocharides
C. Kyrkou
O. Mutlu
Lois Orosa
Jungwook Choi
OOD
81
100
0
04 Jan 2021
Hardware and Software Optimizations for Accelerating Deep Neural Networks: Survey of Current Trends, Challenges, and the Road Ahead
Maurizio Capra
Beatrice Bussolino
Alberto Marchisio
Guido Masera
Maurizio Martina
Muhammad Shafique
BDL
59
140
0
21 Dec 2020
FantastIC4: A Hardware-Software Co-Design Approach for Efficiently Running 4bit-Compact Multilayer Perceptrons
Simon Wiedemann
Suhas Shivapakash
P. Wiedemann
Daniel Becking
Wojciech Samek
F. Gerfers
Thomas Wiegand
MQ
23
7
0
17 Dec 2020
Provable Benefits of Overparameterization in Model Compression: From Double Descent to Pruning Neural Networks
Xiangyu Chang
Yingcong Li
Samet Oymak
Christos Thrampoulidis
35
50
0
16 Dec 2020
CoEdge: Cooperative DNN Inference with Adaptive Workload Partitioning over Heterogeneous Edge Devices
Liekang Zeng
Xu Chen
Zhi Zhou
Lei Yang
Junshan Zhang
39
201
0
06 Dec 2020
Bringing AI To Edge: From Deep Learning's Perspective
Di Liu
Hao Kong
Xiangzhong Luo
Weichen Liu
Ravi Subramaniam
52
116
0
25 Nov 2020
SegBlocks: Block-Based Dynamic Resolution Networks for Real-Time Segmentation
Thomas Verelst
Tinne Tuytelaars
SSeg
10
16
0
24 Nov 2020
DARE: AI-based Diver Action Recognition System using Multi-Channel CNNs for AUV Supervision
Jing Yang
James P. Wilson
Shalabh Gupta
16
6
0
16 Nov 2020
Power Side-Channel Attacks on BNN Accelerators in Remote FPGAs
Shayan Moini
Shanquan Tian
Jakub Szefer
Daniel E. Holcomb
R. Tessier
21
39
0
15 Nov 2020
Augmenting Organizational Decision-Making with Deep Learning Algorithms: Principles, Promises, and Challenges
Yash Raj Shrestha
Vaibhav Krishna
G. Krogh
37
165
0
02 Nov 2020
DistPrivacy: Privacy-Aware Distributed Deep Neural Networks in IoT surveillance systems
Emna Baccour
A. Erbad
Amr M. Mohamed
Mounir Hamdi
Mohsen Guizani
6
19
0
25 Oct 2020
Lightweight End-to-End Speech Recognition from Raw Audio Data Using Sinc-Convolutions
Ludwig Kurzinger
Nicolas Lindae
Palle Klewitz
Gerhard Rigoll
24
5
0
15 Oct 2020
Computing Graph Neural Networks: A Survey from Algorithms to Accelerators
S. Abadal
Akshay Jain
Robert Guirado
Jorge López-Alonso
Eduard Alarcón
GNN
30
225
0
30 Sep 2020
Kernel Based Progressive Distillation for Adder Neural Networks
Yixing Xu
Chang Xu
Xinghao Chen
Wei Zhang
Chunjing Xu
Yunhe Wang
41
47
0
28 Sep 2020
MARS: Mixed Virtual and Real Wearable Sensors for Human Activity Recognition with Multi-Domain Deep Learning Model
Ling Pei
Songpengcheng Xia
Lei Chu
Fanyi Xiao
Qi Wu
Wenxian Yu
Zixuan Zhang
37
30
0
20 Sep 2020
Transform Quantization for CNN (Convolutional Neural Network) Compression
Sean I. Young
Wang Zhe
David S. Taubman
B. Girod
MQ
29
69
0
02 Sep 2020
Classification of Diabetic Retinopathy Using Unlabeled Data and Knowledge Distillation
Sajjad Abbasi
M. Hajabdollahi
P. Khadivi
N. Karimi
Roshank Roshandel
S. Shirani
S. Samavi
14
18
0
01 Sep 2020
HAPI: Hardware-Aware Progressive Inference
Stefanos Laskaridis
Stylianos I. Venieris
Hyeji Kim
Nicholas D. Lane
22
45
0
10 Aug 2020
Always-On 674uW @ 4GOP/s Error Resilient Binary Neural Networks with Aggressive SRAM Voltage Scaling on a 22nm IoT End-Node
Alfio Di Mauro
Francesco Conti
Pasquale Davide Schiavone
D. Rossi
Luca Benini
19
9
0
17 Jul 2020
Hardware Acceleration of Sparse and Irregular Tensor Computations of ML Models: A Survey and Insights
Shail Dave
Riyadh Baghdadi
Tony Nowatzki
Sasikanth Avancha
Aviral Shrivastava
Baoxin Li
59
82
0
02 Jul 2020
Previous
1
2
3
4
5
Next