Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
v1
v2
v3
v4
v5 (latest)
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,481 papers shown
Title
Extremely Low Bit Neural Network: Squeeze the Last Bit Out with ADMM
Cong Leng
Hao Li
Shenghuo Zhu
Rong Jin
MQ
76
288
0
24 Jul 2017
Neuron Pruning for Compressing Deep Networks using Maxout Architectures
Fernando Moya Rueda
René Grzeszick
G. Fink
CVBM
66
17
0
21 Jul 2017
ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression
Jian-Hao Luo
Jianxin Wu
Weiyao Lin
68
1,764
0
20 Jul 2017
Channel Pruning for Accelerating Very Deep Neural Networks
Yihui He
Xiangyu Zhang
Jian Sun
338
2,541
0
19 Jul 2017
Pruning Convolutional Neural Networks for Image Instance Retrieval
Gaurav Manek
Jie Lin
V. Chandrasekhar
Ling-Yu Duan
Sateesh Giduthuri
Xiaoli Li
T. Poggio
47
2
0
18 Jul 2017
Fast and Accurate Image Super Resolution by Deep CNN with Skip Connection and Network in Network
Jin Yamanaka
S. Kuwashima
Takio Kurita
SupR
88
216
0
18 Jul 2017
Ternary Residual Networks
Abhisek Kundu
K. Banerjee
Naveen Mellempudi
Dheevatsa Mudigere
Dipankar Das
Bharat Kaul
Pradeep Dubey
79
8
0
15 Jul 2017
Interleaved Group Convolutions for Deep Neural Networks
Ting Zhang
Guo-Jun Qi
Bin Xiao
Jingdong Wang
131
82
0
10 Jul 2017
An Embedded Deep Learning based Word Prediction
Seunghak Yu
Nilesh Kulkarni
Haejun Lee
J. Kim
51
0
0
06 Jul 2017
Model compression as constrained optimization, with application to neural nets. Part I: general framework
Miguel Á. Carreira-Perpiñán
MQ
46
32
0
05 Jul 2017
ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices
Xiangyu Zhang
Xinyu Zhou
Mengxiao Lin
Jian Sun
AI4TS
377
6,929
0
04 Jul 2017
Structured Sparse Ternary Weight Coding of Deep Neural Networks for Efficient Hardware Implementations
Yoonho Boo
Wonyong Sung
MQ
65
11
0
01 Jul 2017
Irregular Convolutional Neural Networks
Jiabin Ma
Wei Wang
Liang Wang
50
12
0
24 Jun 2017
Balanced Quantization: An Effective and Efficient Approach to Quantized Neural Networks
Shuchang Zhou
Yuzhi Wang
He Wen
Qinyao He
Yuheng Zou
MQ
105
111
0
22 Jun 2017
MEC: Memory-efficient Convolution for Deep Neural Network
Minsik Cho
D. Brand
59
86
0
21 Jun 2017
Using Convolutional Neural Networks in Robots with Limited Computational Resources: Detecting NAO Robots while Playing Soccer
Nicolás Cruz
Kenzo Lobos-Tsunekawa
Javier Ruiz-del-Solar
60
35
0
20 Jun 2017
An Entropy-based Pruning Method for CNN Compression
Jian-Hao Luo
Jianxin Wu
48
180
0
19 Jun 2017
Sobolev Training for Neural Networks
Wojciech M. Czarnecki
Simon Osindero
Max Jaderberg
G. Swirszcz
Razvan Pascanu
93
248
0
15 Jun 2017
LinkNet: Exploiting Encoder Representations for Efficient Semantic Segmentation
Abhishek Chaurasia
Eugenio Culurciello
SSeg
89
1,394
0
14 Jun 2017
Getting deep recommenders fit: Bloom embeddings for sparse binary input/output networks
Joan Serrà
Alexandros Karatzoglou
85
53
0
13 Jun 2017
SEP-Nets: Small and Effective Pattern Networks
Zhe Li
Xiaoyu Wang
Xutao Lv
Tianbao Yang
85
12
0
13 Jun 2017
ShiftCNN: Generalized Low-Precision Architecture for Inference of Convolutional Neural Networks
Denis A. Gudovskiy
Luca Rigazio
MQ
114
54
0
07 Jun 2017
DeepIoT: Compressing Deep Neural Network Structures for Sensing Systems with a Compressor-Critic Framework
Shuochao Yao
Yiran Zhao
Aston Zhang
Lu Su
Tarek Abdelzaher
92
187
0
05 Jun 2017
IDK Cascades: Fast Deep Learning by Learning not to Overthink
Xin Wang
Yujia Luo
D. Crankshaw
Alexey Tumanov
Fisher Yu
Joseph E. Gonzalez
85
108
0
03 Jun 2017
MobiRNN: Efficient Recurrent Neural Network Execution on Mobile GPU
Qingqing Cao
Niranjan Balasubramanian
A. Balasubramanian
72
63
0
03 Jun 2017
Tensor Contraction Layers for Parsimonious Deep Nets
Jean Kossaifi
Aran Khanna
Zachary Chase Lipton
Tommaso Furlanello
Anima Anandkumar
80
60
0
01 Jun 2017
Deep Mutual Learning
Ying Zhang
Tao Xiang
Timothy M. Hospedales
Huchuan Lu
FedML
172
1,662
0
01 Jun 2017
Learning Time/Memory-Efficient Deep Architectures with Budgeted Super Networks
Tom Véniat
Ludovic Denoyer
130
21
0
31 May 2017
Computation-Performance Optimization of Convolutional Neural Networks with Redundant Kernel Removal
Chih-Ting Liu
Yi-Heng Wu
Yu-Sheng Lin
Shao-Yi Chien
SupR
28
6
0
30 May 2017
Iterative Machine Teaching
Weiyang Liu
Bo Dai
Ahmad Humayun
C. Tay
Chen Yu
Linda B. Smith
James M. Rehg
Le Song
132
143
0
30 May 2017
GXNOR-Net: Training deep neural networks with ternary weights and activations without full-precision memory under a unified discretization framework
Lei Deng
Peng Jiao
Jing Pei
Zhenzhi Wu
Guoqi Li
MQ
99
20
0
25 May 2017
Exploring the Regularity of Sparse Structure in Convolutional Neural Networks
Huizi Mao
Song Han
Jeff Pool
Wenshuo Li
Xingyu Liu
Yu Wang
W. Dally
136
244
0
24 May 2017
Bayesian Compression for Deep Learning
Christos Louizos
Karen Ullrich
Max Welling
UQCV
BDL
210
481
0
24 May 2017
SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks
A. Parashar
Minsoo Rhu
Anurag Mukkara
A. Puglielli
Rangharajan Venkatesan
Brucek Khailany
J. Emer
S. Keckler
W. Dally
88
1,135
0
23 May 2017
TernGrad: Ternary Gradients to Reduce Communication in Distributed Deep Learning
W. Wen
Cong Xu
Feng Yan
Chunpeng Wu
Yandan Wang
Yiran Chen
Hai Helen Li
238
994
0
22 May 2017
Structural Compression of Convolutional Neural Networks
R. Abbasi-Asl
Bin Yu
63
16
0
20 May 2017
Structured Bayesian Pruning via Log-Normal Multiplicative Noise
Kirill Neklyudov
Dmitry Molchanov
Arsenii Ashukha
Dmitry Vetrov
BDL
152
189
0
20 May 2017
The High-Dimensional Geometry of Binary Neural Networks
Alexander G. Anderson
C. P. Berg
MQ
91
76
0
19 May 2017
Espresso: Efficient Forward Propagation for BCNNs
Fabrizio Pedersoli
George Tzanetakis
Andrea Tagliasacchi
MQ
40
13
0
19 May 2017
Building effective deep neural network architectures one feature at a time
Martin Mundt
Tobias Weis
K. Konda
Visvanathan Ramesh
34
1
0
18 May 2017
Design of a Very Compact CNN Classifier for Online Handwritten Chinese Character Recognition Using DropWeight and Global Pooling
Xuefeng Xiao
Yafeng Yang
Tasweer Ahmad
Lianwen Jin
Tianhai Chang
63
21
0
15 May 2017
Incremental Learning Through Deep Adaptation
Amir Rosenfeld
John K. Tsotsos
CLL
83
280
0
11 May 2017
Sharp Models on Dull Hardware: Fast and Accurate Neural Machine Translation Decoding on the CPU
Jacob Devlin
78
36
0
04 May 2017
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks
Minsoo Rhu
Mike O'Connor
Niladrish Chatterjee
Jeff Pool
S. Keckler
94
179
0
03 May 2017
Image reconstruction by domain transform manifold learning
Bo Zhu
Jeremiah Zhe Liu
Bruce Rosen
Matthew S. Rosen
101
1,541
0
28 Apr 2017
ICNet for Real-Time Semantic Segmentation on High-Resolution Images
Hengshuang Zhao
Xiaojuan Qi
Xiaoyong Shen
Jianping Shi
Jiaya Jia
SSeg
104
1,416
0
27 Apr 2017
Speeding up Convolutional Neural Networks By Exploiting the Sparsity of Rectifier Units
Shaoshuai Shi
Xiaowen Chu
129
44
0
25 Apr 2017
Accurate Optical Flow via Direct Cost Volume Processing
Jia Xu
René Ranftl
V. Koltun
131
240
0
24 Apr 2017
A Review on Deep Learning Techniques Applied to Semantic Segmentation
Alberto Garcia-Garcia
Sergio Orts
Sergiu Oprea
Victor Villena-Martinez
Jose Garcia-Rodriguez
3DV
SSeg
136
1,278
0
22 Apr 2017
Exploring Sparsity in Recurrent Neural Networks
Sharan Narang
Erich Elsen
G. Diamos
Shubho Sengupta
82
313
0
17 Apr 2017
Previous
1
2
3
...
66
67
68
69
70
Next