Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
v1
v2
v3
v4
v5 (latest)
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,481 papers shown
Title
Echo: Compiler-based GPU Memory Footprint Reduction for LSTM RNN Training
Bojian Zheng
Abhishek Tiwari
Nandita Vijaykumar
Gennady Pekhimenko
89
44
0
22 May 2018
CascadeCNN: Pushing the performance limits of quantisation
Alexandros Kouris
Stylianos I. Venieris
C. Bouganis
MQ
56
24
0
22 May 2018
Parsimonious Bayesian deep networks
Mingyuan Zhou
BDL
43
8
0
22 May 2018
AxTrain: Hardware-Oriented Neural Network Training for Approximate Inference
Xin He
Liu Ke
Wenyan Lu
Guihai Yan
Xuan Zhang
54
34
0
21 May 2018
Compression of Deep Convolutional Neural Networks under Joint Sparsity Constraints
Yoojin Choi
Mostafa El-Khamy
Jungwon Lee
40
6
0
21 May 2018
Faster Neural Network Training with Approximate Tensor Operations
Menachem Adelman
Kfir Y. Levy
Ido Hakimi
M. Silberstein
81
27
0
21 May 2018
Neural Network Compression using Transform Coding and Clustering
Thorsten Laude
Yannick Richter
Jörn Ostermann
27
4
0
18 May 2018
RotDCF: Decomposition of Convolutional Filters for Rotation-Equivariant Deep Networks
Xiuyuan Cheng
Qiang Qiu
Robert Calderbank
Guillermo Sapiro
58
43
0
17 May 2018
Object detection at 200 Frames Per Second
Rakesh Mehta
Cemalettin Öztürk
ObjD
74
61
0
16 May 2018
Lightweight Pyramid Networks for Image Deraining
Xueyang Fu
Borong Liang
Yue Huang
Xinghao Ding
John Paisley
64
327
0
16 May 2018
Hu-Fu: Hardware and Software Collaborative Attack Framework against Neural Networks
Wenshuo Li
Jincheng Yu
Xuefei Ning
Pengjun Wang
Qi Wei
Yu Wang
Huazhong Yang
AAML
93
63
0
14 May 2018
Unifying and Merging Well-trained Deep Neural Networks for Inference Stage
Yi-Min Chou
Yi-Ming Chan
Jia-Hong Lee
Chih-Yi Chiu
Chu-Song Chen
MoMe
71
34
0
14 May 2018
ContextNet: Exploring Context and Detail for Semantic Segmentation in Real-time
Rudra P. K. Poudel
Ujwal D. Bonde
Stephan Liwicki
Christopher Zach
SSeg
83
232
0
11 May 2018
GANAX: A Unified MIMD-SIMD Acceleration for Generative Adversarial Networks
Amir Yazdanbakhsh
Hajar Falahati
Philip J. Wolfe
K. Samadi
Nam Sung Kim
H. Esmaeilzadeh
93
74
0
10 May 2018
Boosting up Scene Text Detectors with Guided CNN
Xiaoyu Yue
Zhanghui Kuang
Zhaoyang Zhang
Zhenfang Chen
Pan He
Yu Qiao
Wayne Zhang
33
8
0
10 May 2018
Neural Cache: Bit-Serial In-Cache Acceleration of Deep Neural Networks
Charles Eckert
Xiaowei Wang
Jingcheng Wang
Arun K. Subramaniyan
R. Iyer
D. Sylvester
D. Blaauw
R. Das
MQ
53
341
0
09 May 2018
Towards Accurate and High-Speed Spiking Neuromorphic Systems with Data Quantization-Aware Deep Networks
Fuqiang Liu
Chenchen Liu
32
5
0
08 May 2018
A Hierarchical Matcher using Local Classifier Chains
Lingfeng Zhang
I. Kakadiaris
16
0
0
07 May 2018
Enhancing the Regularization Effect of Weight Pruning in Artificial Neural Networks
Brian Bartoldson
Adrian Barbu
G. Erlebacher
45
5
0
04 May 2018
Power Law in Sparsified Deep Neural Networks
Lu Hou
James T. Kwok
52
3
0
04 May 2018
Pixel-wise Attentional Gating for Parsimonious Pixel Labeling
Shu Kong
Charless C. Fowlkes
99
40
0
03 May 2018
UNIQ: Uniform Noise Injection for Non-Uniform Quantization of Neural Networks
Chaim Baskin
Eli Schwartz
Evgenii Zheltonozhskii
Natan Liss
Raja Giryes
A. Bronstein
A. Mendelson
MQ
104
45
0
29 Apr 2018
Precise Box Score: Extract More Information from Datasets to Improve the Performance of Face Detection
Ce Qi
Xiaoping Chen
Pingyu Wang
Fei Su
CVBM
26
1
0
28 Apr 2018
Low-memory convolutional neural networks through incremental depth-first processing
Jonathan Binas
Yoshua Bengio
SupR
51
3
0
28 Apr 2018
Sparse Persistent RNNs: Squeezing Large Recurrent Networks On-Chip
Feiwen Zhu
Jeff Pool
M. Andersch
J. Appleyard
Fung Xie
53
30
0
26 Apr 2018
Profile-guided memory optimization for deep neural networks
Taro Sekiyama
T. Imamichi
Haruki Imai
Raymond H. Putra
99
22
0
26 Apr 2018
Accelerator-Aware Pruning for Convolutional Neural Networks
Hyeong-Ju Kang
102
90
0
26 Apr 2018
Efficient Multi-objective Neural Architecture Search via Lamarckian Evolution
T. Elsken
J. H. Metzen
Frank Hutter
245
503
0
24 Apr 2018
Measuring the Intrinsic Dimension of Objective Landscapes
Chunyuan Li
Heerad Farkhoor
Rosanne Liu
J. Yosinski
95
418
0
24 Apr 2018
MQGrad: Reinforcement Learning of Gradient Quantization in Parameter Server
Guoxin Cui
Jun Xu
Wei Zeng
Yanyan Lan
Jiafeng Guo
Xueqi Cheng
MQ
33
13
0
22 Apr 2018
MobileFaceNets: Efficient CNNs for Accurate Real-Time Face Verification on Mobile Devices
Sheng Chen
Yang Liu
Xiang Gao
Zhen Han
CVBM
3DH
133
566
0
20 Apr 2018
Minimizing Area and Energy of Deep Learning Hardware Design Using Collective Low Precision and Structured Compression
Shihui Yin
Gaurav Srivastava
S. Venkataramanaiah
C. Chakrabarti
Visar Berisha
Jae-sun Seo
32
8
0
19 Apr 2018
Pelee: A Real-Time Object Detection System on Mobile Devices
R. Wang
Xiang Li
Charles X. Ling
ObjD
87
460
0
18 Apr 2018
Deep Face Recognition: A Survey
Mei Wang
Weihong Deng
NoLa
158
1,243
0
18 Apr 2018
UCNN: Exploiting Computational Reuse in Deep Neural Networks via Weight Repetition
Kartik Hegde
Jiyong Yu
R. Agrawal
Mengjia Yan
Michael Pellauer
Christopher W. Fletcher
79
166
0
18 Apr 2018
Training a Binary Weight Object Detector by Knowledge Transfer for Autonomous Driving
Jiaolong Xu
Peng Wang
Hengzhang Yang
Antonio M. López
MQ
62
23
0
17 Apr 2018
IGCV
2
2
2
: Interleaved Structured Sparse Convolutional Neural Networks
Guotian Xie
Jingdong Wang
Ting Zhang
Jianhuang Lai
Richang Hong
Guo-Jun Qi
120
106
0
17 Apr 2018
Non-Vacuous Generalization Bounds at the ImageNet Scale: A PAC-Bayesian Compression Approach
Wenda Zhou
Victor Veitch
Morgane Austern
Ryan P. Adams
Peter Orbanz
100
215
0
16 Apr 2018
Fast inference of deep neural networks in FPGAs for particle physics
Javier Mauricio Duarte
Song Han
Philip C. Harris
S. Jindariani
E. Kreinar
...
J. Ngadiuba
M. Pierini
R. Rivera
N. Tran
Zhenbin Wu
AI4CE
164
396
0
16 Apr 2018
Data-Dependent Coresets for Compressing Neural Networks with Applications to Generalization Bounds
Cenk Baykal
Lucas Liebenwein
Igor Gilitschenski
Dan Feldman
Daniela Rus
95
79
0
15 Apr 2018
Select, Attend, and Transfer: Light, Learnable Skip Connections
Saeid Asgari Taghanaki
A. Bentaieb
Anmol Sharma
S. Kevin Zhou
Yefeng Zheng
...
Puneet Sharma
Sasa Grbic
Zhoubing Xu
Dorin Comaniciu
Ghassan Hamarneh
94
20
0
14 Apr 2018
Pieces of Eight: 8-bit Neural Machine Translation
Jerry Quinn
Miguel Ballesteros
MQ
96
25
0
13 Apr 2018
The unreasonable effectiveness of the forget gate
J. Westhuizen
Joan Lasenby
90
89
0
13 Apr 2018
A Compact Network Learning Model for Distribution Regression
C. Kou
H. Lee
Teck Khim Ng
64
10
0
13 Apr 2018
Hybrid Binary Networks: Optimizing for Accuracy, Efficiency and Memory
Ameya Prabhu
Vishal Batchu
Rohit Gajawada
Sri Aurobindo Munagala
A. Namboodiri
MQ
55
18
0
11 Apr 2018
Crafting a Toolchain for Image Restoration by Deep Reinforcement Learning
K. Yu
Chao Dong
Liang Lin
Chen Change Loy
CLL
OffRL
100
177
0
10 Apr 2018
A Systematic DNN Weight Pruning Framework using Alternating Direction Method of Multipliers
Tianyun Zhang
Shaokai Ye
Kaiqi Zhang
Jian Tang
Wujie Wen
M. Fardad
Yanzhi Wang
98
439
0
10 Apr 2018
Distribution-Aware Binarization of Neural Networks for Sketch Recognition
Ameya Prabhu
Vishal Batchu
Sri Aurobindo Munagala
Rohit Gajawada
A. Namboodiri
MQ
109
5
0
09 Apr 2018
Universal and Succinct Source Coding of Deep Neural Networks
Sourya Basu
Lav Varshney
BDL
21
3
0
09 Apr 2018
Estimating Depth from RGB and Sparse Sensing
Zhao Chen
Vijay Badrinarayanan
Gilad Drozdov
Andrew Rabinovich
MDE
3DV
76
109
0
08 Apr 2018
Previous
1
2
3
...
61
62
63
...
68
69
70
Next