Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
v1
v2
v3
v4
v5 (latest)
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,481 papers shown
Title
Efficient Inferencing of Compressed Deep Neural Networks
Dharma Teja Vooturi
Saurabh Goyal
Anamitra R. Choudhury
Yogish Sabharwal
Ashish Verma
42
6
0
01 Nov 2017
Towards Effective Low-bitwidth Convolutional Neural Networks
Bohan Zhuang
Chunhua Shen
Mingkui Tan
Lingqiao Liu
Ian Reid
MQ
110
234
0
01 Nov 2017
Tensorizing Generative Adversarial Nets
Xingwei Cao
Xuyang Zhao
Qibin Zhao
GAN
56
9
0
30 Oct 2017
Knowledge Projection for Deep Neural Networks
Zhi Zhang
G. Ning
Zhihai He
62
15
0
26 Oct 2017
Trace norm regularization and faster inference for embedded speech recognition RNNs
Markus Kliegl
Siddharth Goyal
Kexin Zhao
Kavya Srinet
Mohammad Shoeybi
67
8
0
25 Oct 2017
A Survey of Model Compression and Acceleration for Deep Neural Networks
Yu Cheng
Duo Wang
Pan Zhou
Zhang Tao
179
1,101
0
23 Oct 2017
Learning Discrete Weights Using the Local Reparameterization Trick
Oran Shayer
Dan Levi
Ethan Fetaya
90
88
0
21 Oct 2017
Data-Free Knowledge Distillation for Deep Neural Networks
Raphael Gontijo-Lopes
Stefano Fenu
Thad Starner
100
273
0
19 Oct 2017
TensorQuant - A Simulation Toolbox for Deep Neural Network Quantization
D. Loroch
Norbert Wehn
Franz-Josef Pfreundt
J. Keuper
MQ
57
23
0
13 Oct 2017
STDP Based Pruning of Connections and Weight Quantization in Spiking Neural Networks for Energy Efficient Recognition
Nitin Rathi
Priyadarshini Panda
Kaushik Roy
83
114
0
12 Oct 2017
Energy-efficient Amortized Inference with Cascaded Deep Classifiers
Jiaqi Guan
Yang Liu
Qiang Liu
Jian-wei Peng
68
33
0
10 Oct 2017
Artificial Neural Networks-Based Machine Learning for Wireless Networks: A Tutorial
Mingzhe Chen
Ursula Challita
Walid Saad
Changchuan Yin
Mérouane Debbah
110
209
0
09 Oct 2017
Keynote: Small Neural Nets Are Beautiful: Enabling Embedded Systems with Small Deep-Neural-Network Architectures
F. Iandola
Kurt Keutzer
77
37
0
07 Oct 2017
To prune, or not to prune: exploring the efficacy of pruning for model compression
Michael Zhu
Suyog Gupta
213
1,289
0
05 Oct 2017
Improving Efficiency in Convolutional Neural Network with Multilinear Filters
D. Tran
Alexandros Iosifidis
Moncef Gabbouj
60
40
0
28 Sep 2017
Connectivity Learning in Multi-Branch Networks
Karim Ahmed
Lorenzo Torresani
73
26
0
27 Sep 2017
Machine Learning Models that Remember Too Much
Congzheng Song
Thomas Ristenpart
Vitaly Shmatikov
VLM
77
522
0
22 Sep 2017
Computation Error Analysis of Block Floating Point Arithmetic Oriented Convolution Neural Network Accelerator Design
Zhourui Song
Zhenyu Liu
Dongsheng Wang
57
42
0
22 Sep 2017
Structured Probabilistic Pruning for Convolutional Neural Network Acceleration
Huan Wang
Qiming Zhang
Yuehai Wang
Roland Hu
117
11
0
20 Sep 2017
Compressing Low Precision Deep Neural Networks Using Sparsity-Induced Regularization in Ternary Networks
Julian Faraone
Nicholas J. Fraser
Giulio Gambardella
Michaela Blott
Philip H. W. Leong
MQ
UQCV
62
12
0
19 Sep 2017
N2N Learning: Network to Network Compression via Policy Gradient Reinforcement Learning
A. Ashok
Nicholas Rhinehart
Fares N. Beainy
Kris Kitani
98
171
0
18 Sep 2017
Recursive Binary Neural Network Learning Model with 2.28b/Weight Storage Requirement
Tianchan Guan
Xiaoyang Zeng
Mingoo Seok
MQ
31
6
0
15 Sep 2017
A Streaming Accelerator for Deep Convolutional Neural Networks with Image and Feature Decomposition for Resource-limited System Applications
Yuan Du
Li Du
Yilei Li
Junjie Su
Mau-Chung Frank Chang
36
6
0
15 Sep 2017
Learning Intrinsic Sparse Structures within Long Short-Term Memory
W. Wen
Yuxiong He
Samyam Rajbhandari
Minjia Zhang
Wenhan Wang
Fang Liu
Bin Hu
Yiran Chen
H. Li
MQ
141
142
0
15 Sep 2017
Supervising Unsupervised Learning
Vikas Garg
Adam Kalai
SSL
FedML
51
30
0
14 Sep 2017
Binary-decomposed DCNN for accelerating computation and compressing model without retraining
Ryuji Kamiya
Takayoshi Yamashita
Mitsuru Ambai
Ikuro Sato
Yuji Yamauchi
H. Fujiyoshi
MQ
26
5
0
14 Sep 2017
Flexible Network Binarization with Layer-wise Priority
He Wang
Yi Tian Xu
Bingbing Ni
Hongteng Xu
MQ
72
10
0
13 Sep 2017
Model Distillation with Knowledge Transfer from Face Classification to Alignment and Verification
Chong-Jun Wang
Xipeng Lan
Yang Zhang
CVBM
87
26
0
09 Sep 2017
Real-time convolutional networks for sonar image classification in low-power embedded systems
Matias Valdenegro-Toro
42
10
0
07 Sep 2017
The Mating Rituals of Deep Neural Networks: Learning Compact Feature Representations through Sexual Evolutionary Synthesis
A. Chung
M. Shafiee
Paul Fieguth
A. Wong
34
4
0
07 Sep 2017
BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks
Surat Teerapittayanon
Bradley McDanel
H. T. Kung
UQCV
121
1,153
0
06 Sep 2017
Domain-adaptive deep network compression
Marc Masana
Joost van de Weijer
Luis Herranz
Andrew D. Bagdanov
J. Álvarez
82
62
0
04 Sep 2017
Fast Image Processing with Fully-Convolutional Networks
Qifeng Chen
Jia Xu
V. Koltun
83
323
0
02 Sep 2017
Continual One-Shot Learning of Hidden Spike-Patterns with Neural Network Simulation Expansion and STDP Convergence Predictions
Toby Lightheart
S. Grainger
Tien-Fu Lu
21
0
0
30 Aug 2017
Performance Guaranteed Network Acceleration via High-Order Residual Quantization
Zefan Li
Bingbing Ni
Wenjun Zhang
Xiaokang Yang
Wen Gao
MQ
91
107
0
29 Aug 2017
CirCNN: Accelerating and Compressing Deep Neural Networks Using Block-CirculantWeight Matrices
Caiwen Ding
Siyu Liao
Yanzhi Wang
Zhe Li
Ning Liu
...
Yipeng Zhang
Jian Tang
Qinru Qiu
Xinyu Lin
Bo Yuan
GNN
73
260
0
29 Aug 2017
Deep Learning Sparse Ternary Projections for Compressed Sensing of Images
Duc Minh Nguyen
Evaggelia Tsiligianni
Nikos Deligiannis
37
27
0
28 Aug 2017
The Convergence of Machine Learning and Communications
Wojciech Samek
S. Stańczak
Thomas Wiegand
AI4CE
46
29
0
28 Aug 2017
Learning Efficient Convolutional Networks through Network Slimming
Zhuang Liu
Jianguo Li
Zhiqiang Shen
Gao Huang
Shoumeng Yan
Changshui Zhang
229
2,431
0
22 Aug 2017
Neural Networks Compression for Language Modeling
Artem M. Grachev
D. Ignatov
Andrey V. Savchenko
72
30
0
20 Aug 2017
Deep Neural Network Capacity
Aosen Wang
Huan Zhou
Wenyao Xu
Xin Chen
20
4
0
16 Aug 2017
BitNet: Bit-Regularized Deep Neural Networks
Aswin Raghavan
Mohamed R. Amer
S. Chai
Graham Taylor
MQ
77
10
0
16 Aug 2017
DeepRebirth: Accelerating Deep Neural Network Execution on Mobile Devices
Dawei Li
Xiaolong Wang
Deguang Kong
81
99
0
16 Aug 2017
Enabling Massive Deep Neural Networks with the GraphBLAS
J. Kepner
Manoj Kumar
José Moreira
P. Pattnaik
M. Serrano
H. Tufo
GNN
98
33
0
09 Aug 2017
Prune the Convolutional Neural Networks with Sparse Shrink
Xuzhao Li
Changsong Liu
CVBM
20
4
0
08 Aug 2017
Natural Language Processing with Small Feed-Forward Networks
Jan A. Botha
Emily Pitler
Ji Ma
A. Bakalov
Alexandru Salcianu
David J. Weiss
Ryan T. McDonald
Slav Petrov
HAI
75
38
0
01 Aug 2017
Streaming Architecture for Large-Scale Quantized Neural Networks on an FPGA-Based Dataflow Platform
Chaim Baskin
Natan Liss
Evgenii Zheltonozhskii
A. Bronstein
A. Mendelson
GNN
MQ
139
35
0
31 Jul 2017
Fine-Pruning: Joint Fine-Tuning and Compression of a Convolutional Network with Bayesian Optimization
Frederick Tung
S. Muralidharan
Greg Mori
72
36
0
28 Jul 2017
Learning Bag-of-Features Pooling for Deep Convolutional Neural Networks
Nikolaos Passalis
Anastasios Tefas
86
70
0
25 Jul 2017
Towards Evolutional Compression
Yunhe Wang
Chang Xu
Jiayan Qiu
Chao Xu
Dacheng Tao
63
14
0
25 Jul 2017
Previous
1
2
3
...
65
66
67
68
69
70
Next