Efficient Processing of Deep Neural Networks: A Tutorial and Survey

27 March 2017

Papers citing "Efficient Processing of Deep Neural Networks: A Tutorial and Survey"

31 / 231 papers shown

Title
Trading-off Accuracy and Energy of Deep Inference on Embedded Systems: A Co-Design Approach Nitthilan Kanappan Jayakodi Anwesha Chatterjee Wonje Choi J. Doppa P. Pande 13 27 0 29 Jan 2019
A Survey of the Recent Architectures of Deep Convolutional Neural Networks Asifullah Khan A. Sohail Umme Zahoora Aqsa Saeed Qureshi OOD 53 2,268 0 17 Jan 2019
Efficient Winograd Convolution via Integer Arithmetic Lingchuan Meng J. Brothers 11 29 0 07 Jan 2019
FPGA-based Accelerators of Deep Learning Networks for Learning and Classification: A Review Ahmad Shawahna S. M. Sait A. El-Maleh 28 372 0 01 Jan 2019
Bayesian State Estimation for Unobservable Distribution Systems via Deep Learning Kursat Rasim Mestav Jaime Luengo-Rozas L. Tong BDL 31 133 0 07 Nov 2018
NestDNN: Resource-Aware Multi-Tenant On-Device Deep Learning for Continuous Mobile Vision Biyi Fang Xiao Zeng Mi Zhang 3DH 25 263 0 23 Oct 2018
MBS: Macroblock Scaling for CNN Model Reduction Yu-Hsun Lin Chun-Nan Chou Edward Y. Chang MQ 16 4 0 18 Sep 2018
Normalization in Training U-Net for 2D Biomedical Semantic Segmentation Xiao-Yun Zhou Guang-Zhong Yang 18 77 0 11 Sep 2018
DFTerNet: Towards 2-bit Dynamic Fusion Networks for Accurate Human Activity Recognition Zhan Yang Osolo Ian Raymond Chengyuan Zhang Ying Wan J. Long CVBM 39 36 0 31 Jul 2018
2P-DNN : Privacy-Preserving Deep Neural Networks Based on Homomorphic Cryptosystem Qiang Zhu Xixiang Lv 22 16 0 23 Jul 2018
FINN-L: Library Extensions and Design Trade-off Analysis for Variable Precision LSTM Networks on FPGAs Vladimir Rybalkin Alessandro Pappalardo M. M. Ghaffar Giulio Gambardella Norbert Wehn Michaela Blott 11 72 0 11 Jul 2018
Eyeriss v2: A Flexible Accelerator for Emerging Deep Neural Networks on Mobile Devices Yu-hsin Chen Tien-Ju Yang J. Emer Vivienne Sze MQ 16 70 0 10 Jul 2018
Quantizing deep convolutional networks for efficient inference: A whitepaper Raghuraman Krishnamoorthi MQ 48 993 0 21 Jun 2018
On the Resilience of RTL NN Accelerators: Fault Characterization and Mitigation Behzad Salami O. Unsal A. Cristal 23 66 0 14 Jun 2018
Accelerating CNN inference on FPGAs: A Survey K. Abdelouahab Maxime Pelcat Jocelyn Serot F. Berry AI4CE 27 147 0 26 May 2018
EVA $^2$ : Exploiting Temporal Redundancy in Live Computer Vision Mark Buckler Philip Bedoukian Suren Jayasuriya Adrian Sampson 39 75 0 16 Mar 2018
Deep Learning in Mobile and Wireless Networking: A Survey Chaoyun Zhang P. Patras Hamed Haddadi 45 1,304 0 12 Mar 2018
Hyperdrive: A Multi-Chip Systolically Scalable Binary-Weight CNN Inference Engine Renzo Andri Lukas Cavigelli D. Rossi Luca Benini MQ 24 19 0 05 Mar 2018
The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches Md. Zahangir Alom T. Taha C. Yakopcic Stefan Westberg P. Sidike Mst Shamima Nasrin B. Van Essen A. Awwal V. Asari VLM 29 873 0 03 Mar 2018
Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis Tal Ben-Nun Torsten Hoefler GNN 33 702 0 26 Feb 2018
A Scalable Near-Memory Architecture for Training Deep Neural Networks on Large In-Memory Datasets Fabian Schuiki Michael Schaffner Frank K. Gürkaynak Luca Benini 31 70 0 19 Feb 2018
Training and Inference with Integers in Deep Neural Networks Shuang Wu Guoqi Li F. Chen Luping Shi MQ 32 389 0 13 Feb 2018
On the Universal Approximability and Complexity Bounds of Quantized ReLU Neural Networks Yukun Ding Jinglan Liu Jinjun Xiong Yiyu Shi MQ 34 21 0 10 Feb 2018
JointDNN: An Efficient Training and Inference Engine for Intelligent Mobile Cloud Computing Services Amir Erfan Eshratifar M. Abrishami Massoud Pedram FedML 34 247 0 25 Jan 2018
Design Automation for Binarized Neural Networks: A Quantum Leap Opportunity? Manuele Rusci Lukas Cavigelli Luca Benini MQ 23 20 0 21 Nov 2017
Streaming Architecture for Large-Scale Quantized Neural Networks on an FPGA-Based Dataflow Platform Chaim Baskin Natan Liss Evgenii Zheltonozhskii A. Bronstein A. Mendelson GNN MQ 36 35 0 31 Jul 2017
ShiftCNN: Generalized Low-Precision Architecture for Inference of Convolutional Neural Networks Denis A. Gudovskiy Luca Rigazio MQ 27 52 0 07 Jun 2017
Bayesian Compression for Deep Learning Christos Louizos Karen Ullrich Max Welling UQCV BDL 23 479 0 24 May 2017
Speeding up Convolutional Neural Networks By Exploiting the Sparsity of Rectifier Units S. Shi Xiaowen Chu 15 43 0 25 Apr 2017
Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights Aojun Zhou Anbang Yao Yiwen Guo Lin Xu Yurong Chen MQ 337 1,049 0 10 Feb 2017
Deep Reinforcement Learning: An Overview Yuxi Li OffRL VLM 104 1,503 0 25 Jan 2017