ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1602.01528
  4. Cited By
EIE: Efficient Inference Engine on Compressed Deep Neural Network

EIE: Efficient Inference Engine on Compressed Deep Neural Network

4 February 2016
Song Han
Xingyu Liu
Huizi Mao
Jing Pu
A. Pedram
M. Horowitz
W. Dally
ArXivPDFHTML

Papers citing "EIE: Efficient Inference Engine on Compressed Deep Neural Network"

50 / 325 papers shown
Title
Compressed Gastric Image Generation Based on Soft-Label Dataset
  Distillation for Medical Data Sharing
Compressed Gastric Image Generation Based on Soft-Label Dataset Distillation for Medical Data Sharing
Guang Li
Ren Togo
Takahiro Ogawa
Miki Haseyama
DD
32
41
0
29 Sep 2022
Accelerating Neural Network Inference with Processing-in-DRAM: From the
  Edge to the Cloud
Accelerating Neural Network Inference with Processing-in-DRAM: From the Edge to the Cloud
Geraldo F. Oliveira
Juan Gómez Luna
Saugata Ghose
Amirali Boroumand
O. Mutlu
34
24
0
19 Sep 2022
Efficient Quantized Sparse Matrix Operations on Tensor Cores
Efficient Quantized Sparse Matrix Operations on Tensor Cores
Shigang Li
Kazuki Osawa
Torsten Hoefler
82
31
0
14 Sep 2022
Sgap: Towards Efficient Sparse Tensor Algebra Compilation for GPU
Sgap: Towards Efficient Sparse Tensor Algebra Compilation for GPU
Genghan Zhang
Yuetong Zhao
Yanting Tao
Zhongming Yu
Guohao Dai
Sitao Huang
Yuanyuan Wen
Pavlos Petoumenos
Yu Wang
49
4
0
07 Sep 2022
Mimose: An Input-Aware Checkpointing Planner for Efficient Training on
  GPU
Mimose: An Input-Aware Checkpointing Planner for Efficient Training on GPU
Jian-He Liao
Mingzhen Li
Qingxiao Sun
Jiwei Hao
F. Yu
...
Ye Tao
Zicheng Zhang
Hailong Yang
Zhongzhi Luan
D. Qian
28
4
0
06 Sep 2022
Complexity-Driven CNN Compression for Resource-constrained Edge AI
Complexity-Driven CNN Compression for Resource-constrained Edge AI
Muhammad Zawish
Steven Davy
L. Abraham
43
16
0
26 Aug 2022
SBPF: Sensitiveness Based Pruning Framework For Convolutional Neural
  Network On Image Classification
SBPF: Sensitiveness Based Pruning Framework For Convolutional Neural Network On Image Classification
Yihe Lu
Maoguo Gong
Wei Zhao
Kaiyuan Feng
Hao Li
VLM
29
0
0
09 Aug 2022
Implementation Of Tiny Machine Learning Models On Arduino 33 BLE For
  Gesture And Speech Recognition
Implementation Of Tiny Machine Learning Models On Arduino 33 BLE For Gesture And Speech Recognition
V. Viswanatha
Ramachandra A.C
R. Prasanna
Prem Chowdary Kakarla
PJ VivekaSimha
Nishanth Mohan
17
15
0
23 Jul 2022
Efficient model compression with Random Operation Access Specific Tile
  (ROAST) hashing
Efficient model compression with Random Operation Access Specific Tile (ROAST) hashing
Aditya Desai
K. Zhou
Anshumali Shrivastava
22
1
0
21 Jul 2022
Associative Memory Based Experience Replay for Deep Reinforcement
  Learning
Associative Memory Based Experience Replay for Deep Reinforcement Learning
Mengyuan Li
Arman Kazemi
Ann Franchesca Laguna
Sharon Hu
VLM
21
8
0
16 Jul 2022
PLATON: Pruning Large Transformer Models with Upper Confidence Bound of
  Weight Importance
PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance
Qingru Zhang
Simiao Zuo
Chen Liang
Alexander Bukharin
Pengcheng He
Weizhu Chen
T. Zhao
35
78
0
25 Jun 2022
Fault-Tolerant Collaborative Inference through the Edge-PRUNE Framework
Fault-Tolerant Collaborative Inference through the Edge-PRUNE Framework
Jani Boutellier
Bo Tan
J. Nurmi
26
2
0
16 Jun 2022
Optimal Clipping and Magnitude-aware Differentiation for Improved
  Quantization-aware Training
Optimal Clipping and Magnitude-aware Differentiation for Improved Quantization-aware Training
Charbel Sakr
Steve Dai
Rangharajan Venkatesan
B. Zimmer
W. Dally
Brucek Khailany
MQ
27
41
0
13 Jun 2022
QADAM: Quantization-Aware DNN Accelerator Modeling for Pareto-Optimality
QADAM: Quantization-Aware DNN Accelerator Modeling for Pareto-Optimality
A. Inci
Siri Garudanagiri Virupaksha
Aman Jain
Venkata Vivek Thallam
Ruizhou Ding
Diana Marculescu
MQ
38
2
0
20 May 2022
Sharp asymptotics on the compression of two-layer neural networks
Sharp asymptotics on the compression of two-layer neural networks
Mohammad Hossein Amani
Simone Bombari
Marco Mondelli
Rattana Pukdee
Stefano Rini
MLT
27
0
0
17 May 2022
Training Personalized Recommendation Systems from (GPU) Scratch: Look
  Forward not Backwards
Training Personalized Recommendation Systems from (GPU) Scratch: Look Forward not Backwards
Youngeun Kwon
Minsoo Rhu
31
27
0
10 May 2022
Multiply-and-Fire (MNF): An Event-driven Sparse Neural Network
  Accelerator
Multiply-and-Fire (MNF): An Event-driven Sparse Neural Network Accelerator
Miao Yu
Tingting Xiang
Venkata Pavan Kumar Miriyala
Trevor E. Carlson
23
1
0
20 Apr 2022
Receding Neuron Importances for Structured Pruning
Receding Neuron Importances for Structured Pruning
Mihai Suteu
Yike Guo
29
1
0
13 Apr 2022
Accelerating Attention through Gradient-Based Learned Runtime Pruning
Accelerating Attention through Gradient-Based Learned Runtime Pruning
Zheng Li
Soroush Ghodrati
Amir Yazdanbakhsh
H. Esmaeilzadeh
Mingu Kang
32
17
0
07 Apr 2022
DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation
DELTA: Dynamically Optimizing GPU Memory beyond Tensor Recomputation
Yu Tang
Chenyu Wang
Yufan Zhang
Yuliang Liu
Xingcheng Zhang
Linbo Qiao
Zhiquan Lai
Dongsheng Li
26
4
0
30 Mar 2022
FxP-QNet: A Post-Training Quantizer for the Design of Mixed
  Low-Precision DNNs with Dynamic Fixed-Point Representation
FxP-QNet: A Post-Training Quantizer for the Design of Mixed Low-Precision DNNs with Dynamic Fixed-Point Representation
Ahmad Shawahna
S. M. Sait
A. El-Maleh
Irfan Ahmad
MQ
25
7
0
22 Mar 2022
Energy-Latency Attacks via Sponge Poisoning
Energy-Latency Attacks via Sponge Poisoning
Antonio Emanuele Cinà
Ambra Demontis
Battista Biggio
Fabio Roli
Marcello Pelillo
SILM
60
29
0
14 Mar 2022
GROW: A Row-Stationary Sparse-Dense GEMM Accelerator for
  Memory-Efficient Graph Convolutional Neural Networks
GROW: A Row-Stationary Sparse-Dense GEMM Accelerator for Memory-Efficient Graph Convolutional Neural Networks
Ranggi Hwang
M. Kang
Jiwon Lee
D. Kam
Youngjoo Lee
Minsoo Rhu
GNN
18
22
0
01 Mar 2022
EcoFlow: Efficient Convolutional Dataflows for Low-Power Neural Network
  Accelerators
EcoFlow: Efficient Convolutional Dataflows for Low-Power Neural Network Accelerators
Lois Orosa
Skanda Koppula
Yaman Umuroglu
Konstantinos Kanellopoulos
Juan Gómez Luna
Michaela Blott
K. Vissers
O. Mutlu
48
4
0
04 Feb 2022
Real-Time Gaze Tracking with Event-Driven Eye Segmentation
Real-Time Gaze Tracking with Event-Driven Eye Segmentation
Yu Feng
Nathan Goulding
Asif Khan
Hans Reyserhove
Yuhao Zhu
33
39
0
19 Jan 2022
Problem-dependent attention and effort in neural networks with
  applications to image resolution and model selection
Problem-dependent attention and effort in neural networks with applications to image resolution and model selection
Chris Rohlfs
36
4
0
05 Jan 2022
Speedup deep learning models on GPU by taking advantage of efficient
  unstructured pruning and bit-width reduction
Speedup deep learning models on GPU by taking advantage of efficient unstructured pruning and bit-width reduction
Marcin Pietroñ
Dominik Zurek
30
13
0
28 Dec 2021
Compact Multi-level Sparse Neural Networks with Input Independent
  Dynamic Rerouting
Compact Multi-level Sparse Neural Networks with Input Independent Dynamic Rerouting
Minghai Qin
Tianyun Zhang
Fei Sun
Yen-kuang Chen
M. Fardad
Yanzhi Wang
Yuan Xie
51
0
0
21 Dec 2021
Automated Deep Learning: Neural Architecture Search Is Not the End
Automated Deep Learning: Neural Architecture Search Is Not the End
Xuanyi Dong
D. Kedziora
Katarzyna Musial
Bogdan Gabrys
39
26
0
16 Dec 2021
Synapse Compression for Event-Based Convolutional-Neural-Network
  Accelerators
Synapse Compression for Event-Based Convolutional-Neural-Network Accelerators
Lennart Bamberg
Arash Pourtaherian
Luc Waeijen
A. Chahar
Orlando Moreira
27
4
0
13 Dec 2021
Hidden-Fold Networks: Random Recurrent Residuals Using Sparse Supermasks
Hidden-Fold Networks: Random Recurrent Residuals Using Sparse Supermasks
Ángel López García-Arias
Masanori Hashimoto
Masato Motomura
Jaehoon Yu
39
5
0
24 Nov 2021
A compact butterfly-style silicon photonic-electronic neural chip for
  hardware-efficient deep learning
A compact butterfly-style silicon photonic-electronic neural chip for hardware-efficient deep learning
Chenghao Feng
Jiaqi Gu
Hanqing Zhu
Zhoufeng Ying
Zheng Zhao
David Z. Pan
Ray T. Chen
32
40
0
11 Nov 2021
SPA-GCN: Efficient and Flexible GCN Accelerator with an Application for
  Graph Similarity Computation
SPA-GCN: Efficient and Flexible GCN Accelerator with an Application for Graph Similarity Computation
Atefeh Sohrabizadeh
Yuze Chi
Jason Cong
GNN
29
1
0
10 Nov 2021
Phantom: A High-Performance Computational Core for Sparse Convolutional
  Neural Networks
Phantom: A High-Performance Computational Core for Sparse Convolutional Neural Networks
Mahmood Azhar Qureshi
Arslan Munir
35
0
0
09 Nov 2021
Generalized Depthwise-Separable Convolutions for Adversarially Robust
  and Efficient Neural Networks
Generalized Depthwise-Separable Convolutions for Adversarially Robust and Efficient Neural Networks
Hassan Dbouk
Naresh R Shanbhag
AAML
21
7
0
28 Oct 2021
Demystifying and Generalizing BinaryConnect
Demystifying and Generalizing BinaryConnect
Abhishek Sharma
Yaoliang Yu
Eyyub Sari
Mahdi Zolnouri
V. Nia
MQ
24
8
0
25 Oct 2021
Bandwidth Utilization Side-Channel on ML Inference Accelerators
Bandwidth Utilization Side-Channel on ML Inference Accelerators
Sarbartha Banerjee
Shijia Wei
Prakash Ramrakhyani
Mohit Tiwari
31
3
0
14 Oct 2021
Memory-Efficient CNN Accelerator Based on Interlayer Feature Map
  Compression
Memory-Efficient CNN Accelerator Based on Interlayer Feature Map Compression
Zhuang Shao
Xiaoliang Chen
Li Du
Lei Chen
Yuan Du
Weihao Zhuang
Huadong Wei
Chenjia Xie
Zhongfeng Wang
15
26
0
12 Oct 2021
Compact CNN Models for On-device Ocular-based User Recognition in Mobile
  Devices
Compact CNN Models for On-device Ocular-based User Recognition in Mobile Devices
Ali Almadan
A. Rattani
CVBM
26
9
0
11 Oct 2021
Prune Your Model Before Distill It
Prune Your Model Before Distill It
Jinhyuk Park
Albert No
VLM
54
27
0
30 Sep 2021
AI Accelerator Survey and Trends
AI Accelerator Survey and Trends
Albert Reuther
Peter Michaleas
Michael Jones
V. Gadepally
S. Samsi
J. Kepner
53
79
0
18 Sep 2021
On the Accuracy of Analog Neural Network Inference Accelerators
On the Accuracy of Analog Neural Network Inference Accelerators
T. Xiao
Ben Feinberg
C. Bennett
V. Prabhakar
Prashant Saxena
V. Agrawal
S. Agarwal
M. Marinella
30
34
0
03 Sep 2021
Design and Scaffolded Training of an Efficient DNN Operator for Computer
  Vision on the Edge
Design and Scaffolded Training of an Efficient DNN Operator for Computer Vision on the Edge
Vinod Ganesan
Pratyush Kumar
45
2
0
25 Aug 2021
Differentiable Subset Pruning of Transformer Heads
Differentiable Subset Pruning of Transformer Heads
Jiaoda Li
Ryan Cotterell
Mrinmaya Sachan
45
54
0
10 Aug 2021
MFAGAN: A Compression Framework for Memory-Efficient On-Device
  Super-Resolution GAN
MFAGAN: A Compression Framework for Memory-Efficient On-Device Super-Resolution GAN
Wenlong Cheng
Mingbo Zhao
Zhiling Ye
Shuhang Gu
24
22
0
27 Jul 2021
Developing efficient transfer learning strategies for robust scene
  recognition in mobile robotics using pre-trained convolutional neural
  networks
Developing efficient transfer learning strategies for robust scene recognition in mobile robotics using pre-trained convolutional neural networks
H. Baumgartl
Ricardo Buettner
3DPC
54
3
0
23 Jul 2021
A High-Performance Adaptive Quantization Approach for Edge CNN
  Applications
A High-Performance Adaptive Quantization Approach for Edge CNN Applications
Hsu-Hsun Chin
R. Tsay
Hsin-I Wu
MQ
24
5
0
18 Jul 2021
S2TA: Exploiting Structured Sparsity for Energy-Efficient Mobile CNN
  Acceleration
S2TA: Exploiting Structured Sparsity for Energy-Efficient Mobile CNN Acceleration
Zhi-Gang Liu
P. Whatmough
Yuhao Zhu
Matthew Mattina
MQ
35
75
0
16 Jul 2021
Training Compact CNNs for Image Classification using Dynamic-coded
  Filter Fusion
Training Compact CNNs for Image Classification using Dynamic-coded Filter Fusion
Mingbao Lin
Bohong Chen
Rongrong Ji
Rongrong Ji
VLM
35
23
0
14 Jul 2021
Trustworthy AI: A Computational Perspective
Trustworthy AI: A Computational Perspective
Haochen Liu
Yiqi Wang
Wenqi Fan
Xiaorui Liu
Yaxin Li
Shaili Jain
Yunhao Liu
Anil K. Jain
Jiliang Tang
FaML
104
197
0
12 Jul 2021
Previous
1234567
Next