EIE: Efficient Inference Engine on Compressed Deep Neural Network

4 February 2016

Song Han

Papers citing "EIE: Efficient Inference Engine on Compressed Deep Neural Network"

50 / 325 papers shown

Title
Compressed Gastric Image Generation Based on Soft-Label Dataset Distillation for Medical Data Sharing Guang Li Ren Togo Takahiro Ogawa Miki Haseyama DD 32 41 0 29 Sep 2022
Accelerating Neural Network Inference with Processing-in-DRAM: From the Edge to the Cloud Geraldo F. Oliveira Juan Gómez Luna Saugata Ghose Amirali Boroumand O. Mutlu 34 24 0 19 Sep 2022
Efficient Quantized Sparse Matrix Operations on Tensor Cores Shigang Li Kazuki Osawa Torsten Hoefler 82 31 0 14 Sep 2022
Sgap: Towards Efficient Sparse Tensor Algebra Compilation for GPU Genghan Zhang Yuetong Zhao Yanting Tao Zhongming Yu Guohao Dai Sitao Huang Yuanyuan Wen Pavlos Petoumenos Yu Wang 49 4 0 07 Sep 2022
Mimose: An Input-Aware Checkpointing Planner for Efficient Training on GPU Jian-He Liao Mingzhen Li Qingxiao Sun Jiwei Hao F. Yu ... Ye Tao Zicheng Zhang Hailong Yang Zhongzhi Luan D. Qian 28 4 0 06 Sep 2022
Complexity-Driven CNN Compression for Resource-constrained Edge AI Muhammad Zawish Steven Davy L. Abraham 43 16 0 26 Aug 2022
SBPF: Sensitiveness Based Pruning Framework For Convolutional Neural Network On Image Classification Yihe Lu Maoguo Gong Wei Zhao Kaiyuan Feng Hao Li VLM 29 0 0 09 Aug 2022
Implementation Of Tiny Machine Learning Models On Arduino 33 BLE For Gesture And Speech Recognition V. Viswanatha Ramachandra A.C R. Prasanna Prem Chowdary Kakarla PJ VivekaSimha Nishanth Mohan 17 15 0 23 Jul 2022
Efficient model compression with Random Operation Access Specific Tile (ROAST) hashing Aditya Desai K. Zhou Anshumali Shrivastava 22 1 0 21 Jul 2022
Associative Memory Based Experience Replay for Deep Reinforcement Learning Mengyuan Li Arman Kazemi Ann Franchesca Laguna Sharon Hu VLM 21 8 0 16 Jul 2022
PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance Qingru Zhang Simiao Zuo Chen Liang Alexander Bukharin Pengcheng He Weizhu Chen T. Zhao 35 78 0 25 Jun 2022
Fault-Tolerant Collaborative Inference through the Edge-PRUNE Framework Jani Boutellier Bo Tan J. Nurmi 26 2 0 16 Jun 2022
Optimal Clipping and Magnitude-aware Differentiation for Improved Quantization-aware Training Charbel Sakr Steve Dai Rangharajan Venkatesan B. Zimmer W. Dally Brucek Khailany MQ 27 41 0 13 Jun 2022
QADAM: Quantization-Aware DNN Accelerator Modeling for Pareto-Optimality A. Inci Siri Garudanagiri Virupaksha Aman Jain Venkata Vivek Thallam Ruizhou Ding Diana Marculescu MQ 38 2 0 20 May 2022
Sharp asymptotics on the compression of two-layer neural networks Mohammad Hossein Amani Simone Bombari Marco Mondelli Rattana Pukdee Stefano Rini MLT 27 0 0 17 May 2022
Training Personalized Recommendation Systems from (GPU) Scratch: Look Forward not Backwards Youngeun Kwon Minsoo Rhu 31 27 0 10 May 2022
Multiply-and-Fire (MNF): An Event-driven Sparse Neural Network Accelerator Miao Yu Tingting Xiang Venkata Pavan Kumar Miriyala Trevor E. Carlson 23 1 0 20 Apr 2022
Receding Neuron Importances for Structured Pruning Mihai Suteu Yike Guo 29 1 0 13 Apr 2022
Accelerating Attention through Gradient-Based Learned Runtime Pruning Zheng Li Soroush Ghodrati Amir Yazdanbakhsh H. Esmaeilzadeh Mingu Kang 32 17 0 07 Apr 2022
DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation Yu Tang Chenyu Wang Yufan Zhang Yuliang Liu Xingcheng Zhang Linbo Qiao Zhiquan Lai Dongsheng Li 26 4 0 30 Mar 2022
FxP-QNet: A Post-Training Quantizer for the Design of Mixed Low-Precision DNNs with Dynamic Fixed-Point Representation Ahmad Shawahna S. M. Sait A. El-Maleh Irfan Ahmad MQ 25 7 0 22 Mar 2022
Energy-Latency Attacks via Sponge Poisoning Antonio Emanuele Cinà Ambra Demontis Battista Biggio Fabio Roli Marcello Pelillo SILM 60 29 0 14 Mar 2022
GROW: A Row-Stationary Sparse-Dense GEMM Accelerator for Memory-Efficient Graph Convolutional Neural Networks Ranggi Hwang M. Kang Jiwon Lee D. Kam Youngjoo Lee Minsoo Rhu GNN 18 22 0 01 Mar 2022
EcoFlow: Efficient Convolutional Dataflows for Low-Power Neural Network Accelerators Lois Orosa Skanda Koppula Yaman Umuroglu Konstantinos Kanellopoulos Juan Gómez Luna Michaela Blott K. Vissers O. Mutlu 48 4 0 04 Feb 2022
Real-Time Gaze Tracking with Event-Driven Eye Segmentation Yu Feng Nathan Goulding Asif Khan Hans Reyserhove Yuhao Zhu 33 39 0 19 Jan 2022
Problem-dependent attention and effort in neural networks with applications to image resolution and model selection Chris Rohlfs 36 4 0 05 Jan 2022
Speedup deep learning models on GPU by taking advantage of efficient unstructured pruning and bit-width reduction Marcin Pietroñ Dominik Zurek 30 13 0 28 Dec 2021
Compact Multi-level Sparse Neural Networks with Input Independent Dynamic Rerouting Minghai Qin Tianyun Zhang Fei Sun Yen-kuang Chen M. Fardad Yanzhi Wang Yuan Xie 51 0 0 21 Dec 2021
Automated Deep Learning: Neural Architecture Search Is Not the End Xuanyi Dong D. Kedziora Katarzyna Musial Bogdan Gabrys 39 26 0 16 Dec 2021
Synapse Compression for Event-Based Convolutional-Neural-Network Accelerators Lennart Bamberg Arash Pourtaherian Luc Waeijen A. Chahar Orlando Moreira 27 4 0 13 Dec 2021
Hidden-Fold Networks: Random Recurrent Residuals Using Sparse Supermasks Ángel López García-Arias Masanori Hashimoto Masato Motomura Jaehoon Yu 39 5 0 24 Nov 2021
A compact butterfly-style silicon photonic-electronic neural chip for hardware-efficient deep learning Chenghao Feng Jiaqi Gu Hanqing Zhu Zhoufeng Ying Zheng Zhao David Z. Pan Ray T. Chen 32 40 0 11 Nov 2021
SPA-GCN: Efficient and Flexible GCN Accelerator with an Application for Graph Similarity Computation Atefeh Sohrabizadeh Yuze Chi Jason Cong GNN 29 1 0 10 Nov 2021
Phantom: A High-Performance Computational Core for Sparse Convolutional Neural Networks Mahmood Azhar Qureshi Arslan Munir 35 0 0 09 Nov 2021
Generalized Depthwise-Separable Convolutions for Adversarially Robust and Efficient Neural Networks Hassan Dbouk Naresh R Shanbhag AAML 21 7 0 28 Oct 2021
Demystifying and Generalizing BinaryConnect Abhishek Sharma Yaoliang Yu Eyyub Sari Mahdi Zolnouri V. Nia MQ 24 8 0 25 Oct 2021
Bandwidth Utilization Side-Channel on ML Inference Accelerators Sarbartha Banerjee Shijia Wei Prakash Ramrakhyani Mohit Tiwari 31 3 0 14 Oct 2021
Memory-Efficient CNN Accelerator Based on Interlayer Feature Map Compression Zhuang Shao Xiaoliang Chen Li Du Lei Chen Yuan Du Weihao Zhuang Huadong Wei Chenjia Xie Zhongfeng Wang 15 26 0 12 Oct 2021
Compact CNN Models for On-device Ocular-based User Recognition in Mobile Devices Ali Almadan A. Rattani CVBM 26 9 0 11 Oct 2021
Prune Your Model Before Distill It Jinhyuk Park Albert No VLM 54 27 0 30 Sep 2021
AI Accelerator Survey and Trends Albert Reuther Peter Michaleas Michael Jones V. Gadepally S. Samsi J. Kepner 53 79 0 18 Sep 2021
On the Accuracy of Analog Neural Network Inference Accelerators T. Xiao Ben Feinberg C. Bennett V. Prabhakar Prashant Saxena V. Agrawal S. Agarwal M. Marinella 30 34 0 03 Sep 2021
Design and Scaffolded Training of an Efficient DNN Operator for Computer Vision on the Edge Vinod Ganesan Pratyush Kumar 45 2 0 25 Aug 2021
Differentiable Subset Pruning of Transformer Heads Jiaoda Li Ryan Cotterell Mrinmaya Sachan 45 54 0 10 Aug 2021
MFAGAN: A Compression Framework for Memory-Efficient On-Device Super-Resolution GAN Wenlong Cheng Mingbo Zhao Zhiling Ye Shuhang Gu 24 22 0 27 Jul 2021
Developing efficient transfer learning strategies for robust scene recognition in mobile robotics using pre-trained convolutional neural networks H. Baumgartl Ricardo Buettner 3DPC 54 3 0 23 Jul 2021
A High-Performance Adaptive Quantization Approach for Edge CNN Applications Hsu-Hsun Chin R. Tsay Hsin-I Wu MQ 24 5 0 18 Jul 2021
S2TA: Exploiting Structured Sparsity for Energy-Efficient Mobile CNN Acceleration Zhi-Gang Liu P. Whatmough Yuhao Zhu Matthew Mattina MQ 35 75 0 16 Jul 2021
Training Compact CNNs for Image Classification using Dynamic-coded Filter Fusion Mingbao Lin Bohong Chen Rongrong Ji Rongrong Ji VLM 35 23 0 14 Jul 2021
Trustworthy AI: A Computational Perspective Haochen Liu Yiqi Wang Wenqi Fan Xiaorui Liu Yaxin Li Shaili Jain Yunhao Liu Anil K. Jain Jiliang Tang FaML 104 197 0 12 Jul 2021