EIE: Efficient Inference Engine on Compressed Deep Neural Network

4 February 2016

Song Han

Papers citing "EIE: Efficient Inference Engine on Compressed Deep Neural Network"

50 / 325 papers shown

Title
Analysis of DAWNBench, a Time-to-Accuracy Machine Learning Performance Benchmark Cody Coleman Daniel Kang Deepak Narayanan Luigi Nardi Tian Zhao Jian Zhang Peter Bailis K. Olukotun Christopher Ré Matei A. Zaharia 13 117 0 04 Jun 2018
Channel Gating Neural Networks Weizhe Hua Yuan Zhou Christopher De Sa Zhiru Zhang G. E. Suh 15 180 0 29 May 2018
Compact and Computationally Efficient Representation of Deep Neural Networks Simon Wiedemann K. Müller Wojciech Samek MQ 42 67 0 27 May 2018
Echo: Compiler-based GPU Memory Footprint Reduction for LSTM RNN Training Bojian Zheng Abhishek Tiwari Nandita Vijaykumar Gennady Pekhimenko 27 44 0 22 May 2018
GANAX: A Unified MIMD-SIMD Acceleration for Generative Adversarial Networks Amir Yazdanbakhsh Hajar Falahati Philip J. Wolfe K. Samadi Nam Sung Kim H. Esmaeilzadeh 30 71 0 10 May 2018
Quantization Mimic: Towards Very Tiny CNN for Object Detection Yi Wei Xinyu Pan Hongwei Qin Wanli Ouyang Junjie Yan ObjD 22 88 0 06 May 2018
SIPs: Succinct Interest Points from Unsupervised Inlierness Probability Learning Titus Cieslewski Konstantinos G. Derpanis Davide Scaramuzza 28 7 0 03 May 2018
Sparse Persistent RNNs: Squeezing Large Recurrent Networks On-Chip Feiwen Zhu Jeff Pool M. Andersch J. Appleyard Fung Xie 22 29 0 26 Apr 2018
Accelerator-Aware Pruning for Convolutional Neural Networks Hyeong-Ju Kang 13 88 0 26 Apr 2018
Co-Design of Deep Neural Nets and Neural Net Accelerators for Embedded Vision Applications K. Kwon Alon Amid A. Gholami Bichen Wu Krste Asanović Kurt Keutzer 3DV OOD 27 22 0 20 Apr 2018
UCNN: Exploiting Computational Reuse in Deep Neural Networks via Weight Repetition Kartik Hegde Jiyong Yu R. Agrawal Mengjia Yan Michael Pellauer Christopher W. Fletcher 22 165 0 18 Apr 2018
Fast inference of deep neural networks in FPGAs for particle physics Javier Mauricio Duarte Song Han Philip C. Harris S. Jindariani E. Kreinar ... J. Ngadiuba M. Pierini R. Rivera N. Tran Zhenbin Wu AI4CE 88 389 0 16 Apr 2018
Euphrates: Algorithm-SoC Co-Design for Low-Power Mobile Continuous Vision Yuhao Zhu A. Samajdar Matthew Mattina P. Whatmough 32 87 0 29 Mar 2018
A Survey on Deep Learning Methods for Robot Vision Javier Ruiz-del-Solar P. Loncomilla Naiomi Soto 31 60 0 28 Mar 2018
FPGA Implementations of 3D-SIMD Processor Architecture for Deep Neural Networks Using Relative Indexed Compressed Sparse Filter Encoding Format and Stacked Filters Stationary Flow Yuechao Gao Nianhong Liu Shenmin Zhang 19 1 0 28 Mar 2018
EVA $^2$ : Exploiting Temporal Redundancy in Live Computer Vision Mark Buckler Philip Bedoukian Suren Jayasuriya Adrian Sampson 44 75 0 16 Mar 2018
TBD: Benchmarking and Analyzing Deep Neural Network Training Hongyu Zhu Mohamed Akrout Bojian Zheng Andrew Pelegris Amar Phanishayee Bianca Schroeder Gennady Pekhimenko 31 80 0 16 Mar 2018
Efficient Hardware Realization of Convolutional Neural Networks using Intra-Kernel Regular Pruning Maurice Yang Mahmoud Faraj Assem Hussein V. Gaudet CVBM 22 12 0 15 Mar 2018
Toolflows for Mapping Convolutional Neural Networks on FPGAs: A Survey and Future Directions Stylianos I. Venieris Alexandros Kouris C. Bouganis 19 184 0 15 Mar 2018
DeepN-JPEG: A Deep Neural Network Favorable JPEG-based Image Compression Framework Zihao Liu Tao Liu Wujie Wen Lei Jiang Jie Xu Yanzhi Wang Gang Quan 29 97 0 14 Mar 2018
Newton: Gravitating Towards the Physical Limits of Crossbar Acceleration Anirban Nag Ali Shafiee R. Balasubramonian Vivek Srikumar N. Muralimanohar 16 37 0 10 Mar 2018
Deep Neural Network Compression with Single and Multiple Level Quantization Yuhui Xu Yongzhuang Wang Aojun Zhou Weiyao Lin H. Xiong MQ 20 114 0 06 Mar 2018
Hyperdrive: A Multi-Chip Systolically Scalable Binary-Weight CNN Inference Engine Renzo Andri Lukas Cavigelli D. Rossi Luca Benini MQ 26 19 0 05 Mar 2018
The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches Md. Zahangir Alom T. Taha C. Yakopcic Stefan Westberg P. Sidike Mst Shamima Nasrin B. Van Essen A. Awwal V. Asari VLM 34 875 0 03 Mar 2018
Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs Xuhao Chen 21 25 0 28 Feb 2018
Recurrent Residual Module for Fast Inference in Videos Bowen Pan Wuwei Lin Xiaolin Fang Chaoqin Huang Bolei Zhou Cewu Lu ObjD 28 33 0 27 Feb 2018
A Scalable Near-Memory Architecture for Training Deep Neural Networks on Large In-Memory Datasets Fabian Schuiki Michael Schaffner Frank K. Gürkaynak Luca Benini 31 70 0 19 Feb 2018
Security Analysis and Enhancement of Model Compressed Deep Learning Systems under Adversarial Attacks Qi Liu Tao Liu Zihao Liu Yanzhi Wang Yier Jin Wujie Wen AAML 35 48 0 14 Feb 2018
Edge-Host Partitioning of Deep Neural Networks with Feature Space Encoding for Resource-Constrained Internet-of-Things Platforms J. Ko Taesik Na M. Amir Saibal Mukhopadhyay 24 148 0 11 Feb 2018
AMC: AutoML for Model Compression and Acceleration on Mobile Devices Yihui He Ji Lin Zhijian Liu Hanrui Wang Li Li Song Han 35 1,343 0 10 Feb 2018
Digital Watermarking for Deep Neural Networks Yuki Nagai Yusuke Uchida S. Sakazawa Shiníchi Satoh WIGM 31 144 0 06 Feb 2018
VIBNN: Hardware Acceleration of Bayesian Neural Networks R. Cai Ao Ren Ning Liu Caiwen Ding Luhao Wang Xuehai Qian Massoud Pedram Yanzhi Wang BDL 51 87 0 02 Feb 2018
Stacked Filters Stationary Flow For Hardware-Oriented Acceleration Of Deep Convolutional Neural Networks Yuechao Gao Nianhong Liu Shenmin Zhang 21 0 0 23 Jan 2018
Learning to Prune Filters in Convolutional Neural Networks Qiangui Huang S. Kevin Zhou Suya You Ulrich Neumann VLM 28 177 0 23 Jan 2018
SuperNeurons: Dynamic GPU Memory Management for Training Deep Neural Networks Linnan Wang Jinmian Ye Yiyang Zhao Wei Wu Ang Li Shuaiwen Leon Song Zenglin Xu Tim Kraska 3DH 54 264 0 13 Jan 2018
Automated Pruning for Deep Neural Network Compression Franco Manessi A. Rozza Simone Bianco Paolo Napoletano Raimondo Schettini 44 56 0 05 Dec 2017
Bit Fusion: Bit-Level Dynamically Composable Architecture for Accelerating Deep Neural Networks Hardik Sharma Jongse Park Naveen Suda Liangzhen Lai Benson Chau Joo-Young Kim Vikas Chandra H. Esmaeilzadeh MQ 32 488 0 05 Dec 2017
DeepWear: Adaptive Local Offloading for On-Wearable Deep Learning Mengwei Xu Feng Qian Mengze Zhu Feifan Huang Saumay Pushp Xuanzhe Liu 36 23 0 01 Dec 2017
DeepCache: Principled Cache for Mobile Deep Vision Mengwei Xu Mengze Zhu Yunxin Liu F. Lin Xuanzhe Liu VLM 42 201 0 01 Dec 2017
ADaPTION: Toolbox and Benchmark for Training Convolutional Neural Networks with Reduced Numerical Precision Weights and Activation Moritz B. Milde Daniel Neil Alessandro Aimar T. Delbruck Giacomo Indiveri MQ 42 9 0 13 Nov 2017
SparCE: Sparsity aware General Purpose Core Extensions to Accelerate Deep Neural Networks Sanchari Sen Shubham Jain Swagath Venkataramani A. Raghunathan 24 30 0 07 Nov 2017
Accelerating Training of Deep Neural Networks via Sparse Edge Processing Sourya Dey Yinan Shao K. Chugg Peter A. Beerel 38 16 0 03 Nov 2017
Minimum Energy Quantized Neural Networks Bert Moons Koen Goetschalckx Nick Van Berckelaer Marian Verhelst MQ 33 123 0 01 Nov 2017
DeepRebirth: Accelerating Deep Neural Network Execution on Mobile Devices Dawei Li Xiaolong Wang Deguang Kong 31 98 0 16 Aug 2017
Fine-Pruning: Joint Fine-Tuning and Compression of a Convolutional Network with Bayesian Optimization Frederick Tung S. Muralidharan Greg Mori 35 35 0 28 Jul 2017
Channel Pruning for Accelerating Very Deep Neural Networks Yihui He Xiangyu Zhang Jian Sun 128 2,508 0 19 Jul 2017
A Reconfigurable Streaming Deep Convolutional Neural Network Accelerator for Internet of Things Li Du Yuan Du Yilei Li Mau-Chung Frank Chang 30 168 0 08 Jul 2017
Structured Sparse Ternary Weight Coding of Deep Neural Networks for Efficient Hardware Implementations Yoonho Boo Wonyong Sung MQ 30 11 0 01 Jul 2017
NullHop: A Flexible Convolutional Neural Network Accelerator Based on Sparse Representations of Feature Maps Alessandro Aimar Hesham Mostafa Enrico Calabrese A. Rios-Navarro Ricardo Tapiador-Morales ... Moritz B. Milde Federico Corradi A. Linares-Barranco Shih-Chii Liu T. Delbruck 93 243 0 05 Jun 2017
Exploring the Regularity of Sparse Structure in Convolutional Neural Networks Huizi Mao Song Han Jeff Pool Wenshuo Li Xingyu Liu Yu Wang W. Dally 27 241 0 24 May 2017