Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,448 papers shown
Title
Deep Rewiring: Training very sparse deep networks
G. Bellec
David Kappel
Wolfgang Maass
Robert Legenstein
BDL
29
275
0
14 Nov 2017
ADaPTION: Toolbox and Benchmark for Training Convolutional Neural Networks with Reduced Numerical Precision Weights and Activation
Moritz B. Milde
Daniel Neil
Alessandro Aimar
T. Delbruck
Giacomo Indiveri
MQ
42
9
0
13 Nov 2017
Weightless: Lossy Weight Encoding For Deep Neural Network Compression
Brandon Reagen
Udit Gupta
Bob Adolf
Michael Mitzenmacher
Alexander M. Rush
Gu-Yeon Wei
David Brooks
27
38
0
13 Nov 2017
CT-SRCNN: Cascade Trained and Trimmed Deep Convolutional Neural Networks for Image Super Resolution
Haoyu Ren
Mostafa El-Khamy
Jungwon Lee
SupR
33
27
0
11 Nov 2017
Learning K-way D-dimensional Discrete Code For Compact Embedding Representations
Ting Chen
Martin Renqiang Min
Yizhou Sun
19
10
0
08 Nov 2017
Revealing structure components of the retina by deep learning networks
Qianyu Yan
Zhaofei Yu
Feng Chen
Jian K. Liu
FAtt
16
7
0
08 Nov 2017
Block-Sparse Recurrent Neural Networks
Sharan Narang
Eric Undersander
G. Diamos
19
136
0
08 Nov 2017
Compression-aware Training of Deep Networks
J. Álvarez
Mathieu Salzmann
21
172
0
07 Nov 2017
Moonshine: Distilling with Cheap Convolutions
Elliot J. Crowley
Gavia Gray
Amos Storkey
33
120
0
07 Nov 2017
Interpreting Convolutional Neural Networks Through Compression
R. Abbasi-Asl
Bin-Xia Yu
FAtt
19
21
0
07 Nov 2017
SparCE: Sparsity aware General Purpose Core Extensions to Accelerate Deep Neural Networks
Sanchari Sen
Shubham Jain
Swagath Venkataramani
A. Raghunathan
24
30
0
07 Nov 2017
Characterizing Sparse Connectivity Patterns in Neural Networks
Sourya Dey
Kuan-Wen Huang
Peter A. Beerel
K. Chugg
24
11
0
06 Nov 2017
Neural Speed Reading via Skim-RNN
Minjoon Seo
Sewon Min
Ali Farhadi
Hannaneh Hajishirzi
42
79
0
06 Nov 2017
Accelerating Training of Deep Neural Networks via Sparse Edge Processing
Sourya Dey
Yinan Shao
K. Chugg
Peter A. Beerel
38
16
0
03 Nov 2017
ReBNet: Residual Binarized Neural Network
M. Ghasemzadeh
Mohammad Samragh
F. Koushanfar
MQ
30
4
0
03 Nov 2017
SparseNN: An Energy-Efficient Neural Network Accelerator Exploiting Input and Output Sparsity
Jingyang Zhu
Jingbo Jiang
Xizi Chen
Chi-Ying Tsui
23
36
0
03 Nov 2017
Compressing Word Embeddings via Deep Compositional Code Learning
Raphael Shu
Hideki Nakayama
40
129
0
03 Nov 2017
Efficient Inferencing of Compressed Deep Neural Networks
Dharma Teja Vooturi
Saurabh Goyal
Anamitra R. Choudhury
Yogish Sabharwal
Ashish Verma
24
6
0
01 Nov 2017
Towards Effective Low-bitwidth Convolutional Neural Networks
Bohan Zhuang
Chunhua Shen
Mingkui Tan
Lingqiao Liu
Ian Reid
MQ
39
232
0
01 Nov 2017
Tensorizing Generative Adversarial Nets
Xingwei Cao
Xuyang Zhao
Qibin Zhao
GAN
25
9
0
30 Oct 2017
Knowledge Projection for Deep Neural Networks
Zhi Zhang
G. Ning
Zhihai He
38
15
0
26 Oct 2017
Trace norm regularization and faster inference for embedded speech recognition RNNs
Markus Kliegl
Siddharth Goyal
Kexin Zhao
Kavya Srinet
Mohammad Shoeybi
40
8
0
25 Oct 2017
A Survey of Model Compression and Acceleration for Deep Neural Networks
Yu Cheng
Duo Wang
Pan Zhou
Zhang Tao
40
1,087
0
23 Oct 2017
Learning Discrete Weights Using the Local Reparameterization Trick
Oran Shayer
Dan Levi
Ethan Fetaya
21
88
0
21 Oct 2017
Data-Free Knowledge Distillation for Deep Neural Networks
Raphael Gontijo-Lopes
Stefano Fenu
Thad Starner
25
270
0
19 Oct 2017
TensorQuant - A Simulation Toolbox for Deep Neural Network Quantization
D. Loroch
Norbert Wehn
Franz-Josef Pfreundt
J. Keuper
MQ
33
23
0
13 Oct 2017
STDP Based Pruning of Connections and Weight Quantization in Spiking Neural Networks for Energy Efficient Recognition
Nitin Rathi
Priyadarshini Panda
Kaushik Roy
27
112
0
12 Oct 2017
Energy-efficient Amortized Inference with Cascaded Deep Classifiers
Jiaqi Guan
Yang Liu
Qiang Liu
Jian-wei Peng
22
33
0
10 Oct 2017
Artificial Neural Networks-Based Machine Learning for Wireless Networks: A Tutorial
Mingzhe Chen
Ursula Challita
Walid Saad
Changchuan Yin
Mérouane Debbah
25
208
0
09 Oct 2017
Keynote: Small Neural Nets Are Beautiful: Enabling Embedded Systems with Small Deep-Neural-Network Architectures
F. Iandola
Kurt Keutzer
31
37
0
07 Oct 2017
To prune, or not to prune: exploring the efficacy of pruning for model compression
Michael Zhu
Suyog Gupta
60
1,253
0
05 Oct 2017
Improving Efficiency in Convolutional Neural Network with Multilinear Filters
D. Tran
Alexandros Iosifidis
Moncef Gabbouj
18
40
0
28 Sep 2017
Connectivity Learning in Multi-Branch Networks
Karim Ahmed
Lorenzo Torresani
24
26
0
27 Sep 2017
Machine Learning Models that Remember Too Much
Congzheng Song
Thomas Ristenpart
Vitaly Shmatikov
VLM
36
505
0
22 Sep 2017
Computation Error Analysis of Block Floating Point Arithmetic Oriented Convolution Neural Network Accelerator Design
Zhourui Song
Zhenyu Liu
Dongsheng Wang
31
41
0
22 Sep 2017
Structured Probabilistic Pruning for Convolutional Neural Network Acceleration
Huan Wang
Qiming Zhang
Yuehai Wang
Roland Hu
35
11
0
20 Sep 2017
Compressing Low Precision Deep Neural Networks Using Sparsity-Induced Regularization in Ternary Networks
Julian Faraone
Nicholas J. Fraser
Giulio Gambardella
Michaela Blott
Philip H. W. Leong
MQ
UQCV
29
12
0
19 Sep 2017
N2N Learning: Network to Network Compression via Policy Gradient Reinforcement Learning
A. Ashok
Nicholas Rhinehart
Fares N. Beainy
Kris Kitani
26
170
0
18 Sep 2017
Recursive Binary Neural Network Learning Model with 2.28b/Weight Storage Requirement
Tianchan Guan
Xiaoyang Zeng
Mingoo Seok
MQ
24
6
0
15 Sep 2017
A Streaming Accelerator for Deep Convolutional Neural Networks with Image and Feature Decomposition for Resource-limited System Applications
Yuan Du
Li Du
Yilei Li
Junjie Su
Mau-Chung Frank Chang
25
6
0
15 Sep 2017
Learning Intrinsic Sparse Structures within Long Short-Term Memory
W. Wen
Yuxiong He
Samyam Rajbhandari
Minjia Zhang
Wenhan Wang
Fang Liu
Bin Hu
Yiran Chen
H. Li
MQ
35
140
0
15 Sep 2017
Supervising Unsupervised Learning
Vikas K. Garg
Adam Kalai
SSL
FedML
26
29
0
14 Sep 2017
Binary-decomposed DCNN for accelerating computation and compressing model without retraining
Ryuji Kamiya
Takayoshi Yamashita
Mitsuru Ambai
Ikuro Sato
Yuji Yamauchi
H. Fujiyoshi
MQ
17
4
0
14 Sep 2017
Flexible Network Binarization with Layer-wise Priority
He Wang
Yi Tian Xu
Bingbing Ni
Hongteng Xu
MQ
36
10
0
13 Sep 2017
Model Distillation with Knowledge Transfer from Face Classification to Alignment and Verification
Chong-Jun Wang
Xipeng Lan
Yang Zhang
CVBM
17
26
0
09 Sep 2017
Real-time convolutional networks for sonar image classification in low-power embedded systems
Matias Valdenegro-Toro
31
10
0
07 Sep 2017
The Mating Rituals of Deep Neural Networks: Learning Compact Feature Representations through Sexual Evolutionary Synthesis
A. Chung
M. Shafiee
Paul Fieguth
A. Wong
32
4
0
07 Sep 2017
BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks
Surat Teerapittayanon
Bradley McDanel
H. T. Kung
UQCV
33
1,118
0
06 Sep 2017
Domain-adaptive deep network compression
Marc Masana
Joost van de Weijer
Luis Herranz
Andrew D. Bagdanov
J. Álvarez
44
62
0
04 Sep 2017
Fast Image Processing with Fully-Convolutional Networks
Qifeng Chen
Jia Xu
V. Koltun
17
322
0
02 Sep 2017
Previous
1
2
3
...
64
65
66
67
68
69
Next