Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1602.01528
Cited By
EIE: Efficient Inference Engine on Compressed Deep Neural Network
4 February 2016
Song Han
Xingyu Liu
Huizi Mao
Jing Pu
A. Pedram
M. Horowitz
W. Dally
Re-assign community
ArXiv
PDF
HTML
Papers citing
"EIE: Efficient Inference Engine on Compressed Deep Neural Network"
50 / 325 papers shown
Title
Analysis of DAWNBench, a Time-to-Accuracy Machine Learning Performance Benchmark
Cody Coleman
Daniel Kang
Deepak Narayanan
Luigi Nardi
Tian Zhao
Jian Zhang
Peter Bailis
K. Olukotun
Christopher Ré
Matei A. Zaharia
13
117
0
04 Jun 2018
Channel Gating Neural Networks
Weizhe Hua
Yuan Zhou
Christopher De Sa
Zhiru Zhang
G. E. Suh
15
180
0
29 May 2018
Compact and Computationally Efficient Representation of Deep Neural Networks
Simon Wiedemann
K. Müller
Wojciech Samek
MQ
42
67
0
27 May 2018
Echo: Compiler-based GPU Memory Footprint Reduction for LSTM RNN Training
Bojian Zheng
Abhishek Tiwari
Nandita Vijaykumar
Gennady Pekhimenko
27
44
0
22 May 2018
GANAX: A Unified MIMD-SIMD Acceleration for Generative Adversarial Networks
Amir Yazdanbakhsh
Hajar Falahati
Philip J. Wolfe
K. Samadi
Nam Sung Kim
H. Esmaeilzadeh
30
71
0
10 May 2018
Quantization Mimic: Towards Very Tiny CNN for Object Detection
Yi Wei
Xinyu Pan
Hongwei Qin
Wanli Ouyang
Junjie Yan
ObjD
22
88
0
06 May 2018
SIPs: Succinct Interest Points from Unsupervised Inlierness Probability Learning
Titus Cieslewski
Konstantinos G. Derpanis
Davide Scaramuzza
28
7
0
03 May 2018
Sparse Persistent RNNs: Squeezing Large Recurrent Networks On-Chip
Feiwen Zhu
Jeff Pool
M. Andersch
J. Appleyard
Fung Xie
22
29
0
26 Apr 2018
Accelerator-Aware Pruning for Convolutional Neural Networks
Hyeong-Ju Kang
13
88
0
26 Apr 2018
Co-Design of Deep Neural Nets and Neural Net Accelerators for Embedded Vision Applications
K. Kwon
Alon Amid
A. Gholami
Bichen Wu
Krste Asanović
Kurt Keutzer
3DV
OOD
27
22
0
20 Apr 2018
UCNN: Exploiting Computational Reuse in Deep Neural Networks via Weight Repetition
Kartik Hegde
Jiyong Yu
R. Agrawal
Mengjia Yan
Michael Pellauer
Christopher W. Fletcher
22
165
0
18 Apr 2018
Fast inference of deep neural networks in FPGAs for particle physics
Javier Mauricio Duarte
Song Han
Philip C. Harris
S. Jindariani
E. Kreinar
...
J. Ngadiuba
M. Pierini
R. Rivera
N. Tran
Zhenbin Wu
AI4CE
88
389
0
16 Apr 2018
Euphrates: Algorithm-SoC Co-Design for Low-Power Mobile Continuous Vision
Yuhao Zhu
A. Samajdar
Matthew Mattina
P. Whatmough
32
87
0
29 Mar 2018
A Survey on Deep Learning Methods for Robot Vision
Javier Ruiz-del-Solar
P. Loncomilla
Naiomi Soto
31
60
0
28 Mar 2018
FPGA Implementations of 3D-SIMD Processor Architecture for Deep Neural Networks Using Relative Indexed Compressed Sparse Filter Encoding Format and Stacked Filters Stationary Flow
Yuechao Gao
Nianhong Liu
Shenmin Zhang
19
1
0
28 Mar 2018
EVA
2
^2
2
: Exploiting Temporal Redundancy in Live Computer Vision
Mark Buckler
Philip Bedoukian
Suren Jayasuriya
Adrian Sampson
44
75
0
16 Mar 2018
TBD: Benchmarking and Analyzing Deep Neural Network Training
Hongyu Zhu
Mohamed Akrout
Bojian Zheng
Andrew Pelegris
Amar Phanishayee
Bianca Schroeder
Gennady Pekhimenko
31
80
0
16 Mar 2018
Efficient Hardware Realization of Convolutional Neural Networks using Intra-Kernel Regular Pruning
Maurice Yang
Mahmoud Faraj
Assem Hussein
V. Gaudet
CVBM
22
12
0
15 Mar 2018
Toolflows for Mapping Convolutional Neural Networks on FPGAs: A Survey and Future Directions
Stylianos I. Venieris
Alexandros Kouris
C. Bouganis
19
184
0
15 Mar 2018
DeepN-JPEG: A Deep Neural Network Favorable JPEG-based Image Compression Framework
Zihao Liu
Tao Liu
Wujie Wen
Lei Jiang
Jie Xu
Yanzhi Wang
Gang Quan
29
97
0
14 Mar 2018
Newton: Gravitating Towards the Physical Limits of Crossbar Acceleration
Anirban Nag
Ali Shafiee
R. Balasubramonian
Vivek Srikumar
N. Muralimanohar
16
37
0
10 Mar 2018
Deep Neural Network Compression with Single and Multiple Level Quantization
Yuhui Xu
Yongzhuang Wang
Aojun Zhou
Weiyao Lin
H. Xiong
MQ
20
114
0
06 Mar 2018
Hyperdrive: A Multi-Chip Systolically Scalable Binary-Weight CNN Inference Engine
Renzo Andri
Lukas Cavigelli
D. Rossi
Luca Benini
MQ
26
19
0
05 Mar 2018
The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches
Md. Zahangir Alom
T. Taha
C. Yakopcic
Stefan Westberg
P. Sidike
Mst Shamima Nasrin
B. Van Essen
A. Awwal
V. Asari
VLM
34
875
0
03 Mar 2018
Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs
Xuhao Chen
21
25
0
28 Feb 2018
Recurrent Residual Module for Fast Inference in Videos
Bowen Pan
Wuwei Lin
Xiaolin Fang
Chaoqin Huang
Bolei Zhou
Cewu Lu
ObjD
28
33
0
27 Feb 2018
A Scalable Near-Memory Architecture for Training Deep Neural Networks on Large In-Memory Datasets
Fabian Schuiki
Michael Schaffner
Frank K. Gürkaynak
Luca Benini
31
70
0
19 Feb 2018
Security Analysis and Enhancement of Model Compressed Deep Learning Systems under Adversarial Attacks
Qi Liu
Tao Liu
Zihao Liu
Yanzhi Wang
Yier Jin
Wujie Wen
AAML
35
48
0
14 Feb 2018
Edge-Host Partitioning of Deep Neural Networks with Feature Space Encoding for Resource-Constrained Internet-of-Things Platforms
J. Ko
Taesik Na
M. Amir
Saibal Mukhopadhyay
24
148
0
11 Feb 2018
AMC: AutoML for Model Compression and Acceleration on Mobile Devices
Yihui He
Ji Lin
Zhijian Liu
Hanrui Wang
Li Li
Song Han
35
1,343
0
10 Feb 2018
Digital Watermarking for Deep Neural Networks
Yuki Nagai
Yusuke Uchida
S. Sakazawa
Shiníchi Satoh
WIGM
31
144
0
06 Feb 2018
VIBNN: Hardware Acceleration of Bayesian Neural Networks
R. Cai
Ao Ren
Ning Liu
Caiwen Ding
Luhao Wang
Xuehai Qian
Massoud Pedram
Yanzhi Wang
BDL
51
87
0
02 Feb 2018
Stacked Filters Stationary Flow For Hardware-Oriented Acceleration Of Deep Convolutional Neural Networks
Yuechao Gao
Nianhong Liu
Shenmin Zhang
21
0
0
23 Jan 2018
Learning to Prune Filters in Convolutional Neural Networks
Qiangui Huang
S. Kevin Zhou
Suya You
Ulrich Neumann
VLM
28
177
0
23 Jan 2018
SuperNeurons: Dynamic GPU Memory Management for Training Deep Neural Networks
Linnan Wang
Jinmian Ye
Yiyang Zhao
Wei Wu
Ang Li
Shuaiwen Leon Song
Zenglin Xu
Tim Kraska
3DH
54
264
0
13 Jan 2018
Automated Pruning for Deep Neural Network Compression
Franco Manessi
A. Rozza
Simone Bianco
Paolo Napoletano
Raimondo Schettini
44
56
0
05 Dec 2017
Bit Fusion: Bit-Level Dynamically Composable Architecture for Accelerating Deep Neural Networks
Hardik Sharma
Jongse Park
Naveen Suda
Liangzhen Lai
Benson Chau
Joo-Young Kim
Vikas Chandra
H. Esmaeilzadeh
MQ
32
488
0
05 Dec 2017
DeepWear: Adaptive Local Offloading for On-Wearable Deep Learning
Mengwei Xu
Feng Qian
Mengze Zhu
Feifan Huang
Saumay Pushp
Xuanzhe Liu
36
23
0
01 Dec 2017
DeepCache: Principled Cache for Mobile Deep Vision
Mengwei Xu
Mengze Zhu
Yunxin Liu
F. Lin
Xuanzhe Liu
VLM
42
201
0
01 Dec 2017
ADaPTION: Toolbox and Benchmark for Training Convolutional Neural Networks with Reduced Numerical Precision Weights and Activation
Moritz B. Milde
Daniel Neil
Alessandro Aimar
T. Delbruck
Giacomo Indiveri
MQ
42
9
0
13 Nov 2017
SparCE: Sparsity aware General Purpose Core Extensions to Accelerate Deep Neural Networks
Sanchari Sen
Shubham Jain
Swagath Venkataramani
A. Raghunathan
24
30
0
07 Nov 2017
Accelerating Training of Deep Neural Networks via Sparse Edge Processing
Sourya Dey
Yinan Shao
K. Chugg
Peter A. Beerel
38
16
0
03 Nov 2017
Minimum Energy Quantized Neural Networks
Bert Moons
Koen Goetschalckx
Nick Van Berckelaer
Marian Verhelst
MQ
33
123
0
01 Nov 2017
DeepRebirth: Accelerating Deep Neural Network Execution on Mobile Devices
Dawei Li
Xiaolong Wang
Deguang Kong
31
98
0
16 Aug 2017
Fine-Pruning: Joint Fine-Tuning and Compression of a Convolutional Network with Bayesian Optimization
Frederick Tung
S. Muralidharan
Greg Mori
35
35
0
28 Jul 2017
Channel Pruning for Accelerating Very Deep Neural Networks
Yihui He
Xiangyu Zhang
Jian Sun
128
2,508
0
19 Jul 2017
A Reconfigurable Streaming Deep Convolutional Neural Network Accelerator for Internet of Things
Li Du
Yuan Du
Yilei Li
Mau-Chung Frank Chang
30
168
0
08 Jul 2017
Structured Sparse Ternary Weight Coding of Deep Neural Networks for Efficient Hardware Implementations
Yoonho Boo
Wonyong Sung
MQ
30
11
0
01 Jul 2017
NullHop: A Flexible Convolutional Neural Network Accelerator Based on Sparse Representations of Feature Maps
Alessandro Aimar
Hesham Mostafa
Enrico Calabrese
A. Rios-Navarro
Ricardo Tapiador-Morales
...
Moritz B. Milde
Federico Corradi
A. Linares-Barranco
Shih-Chii Liu
T. Delbruck
93
243
0
05 Jun 2017
Exploring the Regularity of Sparse Structure in Convolutional Neural Networks
Huizi Mao
Song Han
Jeff Pool
Wenshuo Li
Xingyu Liu
Yu Wang
W. Dally
27
241
0
24 May 2017
Previous
1
2
3
4
5
6
7
Next