Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01528
Cited By
EIE: Efficient Inference Engine on Compressed Deep Neural Network
4 February 2016
Song Han
Xingyu Liu
Huizi Mao
Jing Pu
A. Pedram
M. Horowitz
W. Dally
Re-assign community
ArXiv
PDF
HTML
Papers citing
"EIE: Efficient Inference Engine on Compressed Deep Neural Network"
50 / 325 papers shown
Title
Event-Based Eye Tracking. 2025 Event-based Vision Workshop
Qinyu Chen
Chang Gao
Min Liu
Daniele Perrone
Yan Ru Pei
...
Hoang M. Truong
Vinh-Thuan Ly
Huy G. Tran
Thuan-Phat Nguyen
Tram T. Doan
46
1
0
25 Apr 2025
A 71.2-
μ
μ
μ
W Speech Recognition Accelerator with Recurrent Spiking Neural Network
Chih-Chyau Yang
Tian-Sheuan Chang
73
1
0
27 Mar 2025
Reservoir Network with Structural Plasticity for Human Activity Recognition
Abdullah M. Zyarah
Alaa M. Abdul-Hadi
Dhireesha Kudithipudi
31
3
0
01 Mar 2025
Advancing Weight and Channel Sparsification with Enhanced Saliency
Xinglong Sun
Maying Shen
Hongxu Yin
Lei Mao
Pavlo Molchanov
Jose M. Alvarez
60
1
0
05 Feb 2025
LUT-DLA: Lookup Table as Efficient Extreme Low-Bit Deep Learning Accelerator
Guoyu Li
Shengyu Ye
Chong Chen
Yang Wang
Fan Yang
Ting Cao
Cheng Liu
Mohamed M. Sabry
Mao Yang
MQ
204
0
0
18 Jan 2025
DPD-NeuralEngine: A 22-nm 6.6-TOPS/W/mm
2
^2
2
Recurrent Neural Network Accelerator for Wideband Power Amplifier Digital Pre-Distortion
Ang Li
Haolin Wu
Yizhuo Wu
Qinyu Chen
Leo C. N. de Vreede
Chang Gao
31
0
0
15 Oct 2024
Accumulator-Aware Post-Training Quantization
Ian Colbert
Fabian Grob
Giuseppe Franco
Jinjie Zhang
Rayan Saab
MQ
37
4
0
25 Sep 2024
Structured Pruning for Efficient Visual Place Recognition
Oliver Grainge
Michael Milford
Indu Bodala
Sarvapali D. Ramchurn
Shoaib Ehsan
51
1
0
12 Sep 2024
Quality Scalable Quantization Methodology for Deep Learning on Edge
S. Khaliq
Rehan Hafiz
MQ
53
1
0
15 Jul 2024
Learning-Based Heavy Hitters and Flow Frequency Estimation in Streams
Rana Shahout
Michael Mitzenmacher
25
2
0
24 Jun 2024
A Generic Layer Pruning Method for Signal Modulation Recognition Deep Learning Models
Yao Lu
Yutao Zhu
Yuqi Li
Dongwei Xu
Yun Lin
Qi Xuan
Xiaoniu Yang
39
5
0
12 Jun 2024
ReDistill: Residual Encoded Distillation for Peak Memory Reduction of CNNs
Fang Chen
Gourav Datta
Mujahid Al Rafi
Hyeran Jeon
Meng Tang
93
1
0
06 Jun 2024
Dual sparse training framework: inducing activation map sparsity via Transformed
ℓ
1
\ell1
ℓ
1
regularization
Xiaolong Yu
Cong Tian
56
0
0
30 May 2024
Neural Network Compression for Reinforcement Learning Tasks
Dmitry A. Ivanov
D. Larionov
Oleg V. Maslennikov
V. Voevodin
OffRL
AI4CE
55
0
0
13 May 2024
Iterative Filter Pruning for Concatenation-based CNN Architectures
Svetlana Pavlitska
Oliver Bagge
Federico Nicolás Peccia
Toghrul Mammadov
J. Marius Zöllner
VLM
3DPC
48
2
0
04 May 2024
Rapid Deployment of DNNs for Edge Computing via Structured Pruning at Initialization
Bailey J. Eccles
Leon Wong
Blesson Varghese
43
2
0
22 Apr 2024
A 1.6-mW Sparse Deep Learning Accelerator for Speech Separation
Chih-Chyau Yang
Tian-Sheuan Chang
31
0
0
15 Dec 2023
Accelerating Convolutional Neural Network Pruning via Spatial Aura Entropy
Bogdan Musat
Razvan Andonie
26
0
0
08 Dec 2023
The Road to On-board Change Detection: A Lightweight Patch-Level Change Detection Network via Exploring the Potential of Pruning and Pooling
Lihui Xue
Zhihao Wang
Xueqian Wang
Gang Li
50
1
0
16 Oct 2023
YFlows: Systematic Dataflow Exploration and Code Generation for Efficient Neural Network Inference using SIMD Architectures on CPUs
Cyrus Zhou
Zack Hassman
Ruize Xu
Dhirpal Shah
Vaughn Richard
Yanjing Li
37
1
0
01 Oct 2023
Approximate Computing Survey, Part II: Application-Specific & Architectural Approximation Techniques and Applications
Vasileios Leon
Muhammad Abdullah Hanif
Giorgos Armeniakos
Xun Jiao
Mohamed Bennai
K. Pekmestzi
Dimitrios Soudris
44
3
0
20 Jul 2023
Minimizing Energy Consumption of Deep Learning Models by Energy-Aware Training
Dario Lazzaro
Antonio Emanuele Cinà
Maura Pintor
Ambra Demontis
Battista Biggio
Fabio Roli
Marcello Pelillo
43
7
0
01 Jul 2023
Filter Pruning for Efficient CNNs via Knowledge-driven Differential Filter Sampler
Shaohui Lin
Wenxuan Huang
Jiao Xie
Baochang Zhang
Yunhang Shen
Zhou Yu
Jungong Han
David Doermann
30
2
0
01 Jul 2023
Group channel pruning and spatial attention distilling for object detection
Yun Chu
Pu Li
Yong Bai
Zhuhua Hu
Yongqing Chen
Jiafeng Lu
VLM
31
13
0
02 Jun 2023
SPADE: Sparse Pillar-based 3D Object Detection Accelerator for Autonomous Driving
Minjae Lee
Seongmin Park
Hyung-Se Kim
Minyong Yoon
Jangwhan Lee
Junwon Choi
Nam Sung Kim
Mingu Kang
Jungwook Choi
3DPC
26
5
0
12 May 2023
Towards Carbon-Neutral Edge Computing: Greening Edge AI by Harnessing Spot and Future Carbon Markets
Huirong Ma
Zhi Zhou
Xiaoxi Zhang
Xu Chen
18
12
0
22 Apr 2023
Tensor Slicing and Optimization for Multicore NPUs
R. Sousa
M. Pereira
Yongin Kwon
Taeho Kim
Namsoon Jung
Chang Soo Kim
Michael Frank
Guido Araujo
35
5
0
06 Apr 2023
Competitive plasticity to reduce the energetic costs of learning
Mark C. W. van Rossum
21
2
0
04 Apr 2023
Physics-aware Roughness Optimization for Diffractive Optical Neural Networks
Shangli Zhou
Yingjie Li
Minhan Lou
Weilu Gao
Zhijie Shi
Cunxi Yu
Caiwen Ding
38
2
0
04 Apr 2023
Low Rank Optimization for Efficient Deep Learning: Making A Balance between Compact Architecture and Fast Training
Xinwei Ou
Zhangxin Chen
Ce Zhu
Yipeng Liu
41
4
0
22 Mar 2023
SR-init: An interpretable layer pruning method
Hui Tang
Yao Lu
Qi Xuan
20
9
0
14 Mar 2023
Full Stack Optimization of Transformer Inference: a Survey
Sehoon Kim
Coleman Hooper
Thanakul Wattanawong
Minwoo Kang
Ruohan Yan
...
Qijing Huang
Kurt Keutzer
Michael W. Mahoney
Y. Shao
A. Gholami
MQ
41
102
0
27 Feb 2023
Workload-Balanced Pruning for Sparse Spiking Neural Networks
Ruokai Yin
Youngeun Kim
Yuhang Li
Abhishek Moitra
Nitin Satpute
Anna Hambitzer
Priyadarshini Panda
44
20
0
13 Feb 2023
SparseProp: Efficient Sparse Backpropagation for Faster Training of Neural Networks
Mahdi Nikdan
Tommaso Pegolotti
Eugenia Iofinova
Eldar Kurtic
Dan Alistarh
31
11
0
09 Feb 2023
A
2
Q
\rm A^2Q
A
2
Q
: Aggregation-Aware Quantization for Graph Neural Networks
Zeyu Zhu
Fanrong Li
Zitao Mo
Qinghao Hu
Gang Li
Zejian Liu
Xiaoyao Liang
Jian Cheng
GNN
MQ
42
4
0
01 Feb 2023
Rewarded meta-pruning: Meta Learning with Rewards for Channel Pruning
Athul Shibu
Abhishek Kumar
Heechul Jung
Dong-Gyu Lee
19
1
0
26 Jan 2023
Slice-and-Forge: Making Better Use of Caches for Graph Convolutional Network Accelerators
Min-hee Yoo
Jaeyong Song
Hyeyoon Lee
Jounghoo Lee
Namhyung Kim
Youngsok Kim
Jinho Lee
GNN
50
5
0
24 Jan 2023
A Theory of I/O-Efficient Sparse Neural Network Inference
Niels Gleinig
Tal Ben-Nun
Torsten Hoefler
33
0
0
03 Jan 2023
FSCNN: A Fast Sparse Convolution Neural Network Inference System
Bo Ji
Tianyi Chen
28
3
0
17 Dec 2022
Algorithm and Hardware Co-Design of Energy-Efficient LSTM Networks for Video Recognition with Hierarchical Tucker Tensor Decomposition
Yu Gong
Miao Yin
Lingyi Huang
Chunhua Deng
Yang Sui
Bo Yuan
26
6
0
05 Dec 2022
CSTAR: Towards Compact and STructured Deep Neural Networks with Adversarial Robustness
Huy Phan
Miao Yin
Yang Sui
Bo Yuan
S. Zonouz
AAML
GNN
38
8
0
04 Dec 2022
Signed Binary Weight Networks
Sachit Kuhar
Alexey Tumanov
Judy Hoffman
MQ
23
1
0
25 Nov 2022
Pruning Very Deep Neural Network Channels for Efficient Inference
Yihui He
35
1
0
14 Nov 2022
LearningGroup: A Real-Time Sparse Training on FPGA via Learnable Weight Grouping for Multi-Agent Reinforcement Learning
Jenny Yang
Jaeuk Kim
Joo-Young Kim
31
2
0
29 Oct 2022
Improved Projection Learning for Lower Dimensional Feature Maps
Ilan Price
Jared Tanner
26
3
0
27 Oct 2022
Gradient-based Weight Density Balancing for Robust Dynamic Sparse Training
Mathias Parger
Alexander Ertl
Paul Eibensteiner
J. H. Mueller
Martin Winter
M. Steinberger
34
0
0
25 Oct 2022
Weight Fixing Networks
Christopher Subia-Waud
S. Dasmahapatra
MQ
32
2
0
24 Oct 2022
RSC: Accelerating Graph Neural Networks Training via Randomized Sparse Computations
Zirui Liu
Sheng-Wei Chen
Kaixiong Zhou
Daochen Zha
Xiao Huang
Xia Hu
42
15
0
19 Oct 2022
MotionDeltaCNN: Sparse CNN Inference of Frame Differences in Moving Camera Videos
Mathias Parger
Chengcheng Tang
Thomas Neff
Christopher D. Twigg
Cem Keskin
Robert Y. Wang
M. Steinberger
32
6
0
18 Oct 2022
Approximate Computing and the Efficient Machine Learning Expedition
J. Henkel
Hai Helen Li
A. Raghunathan
M. Tahoori
Swagath Venkataramani
Xiaoxuan Yang
Georgios Zervakis
28
17
0
02 Oct 2022
1
2
3
4
5
6
7
Next