Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
v1
v2
v3
v4
v5 (latest)
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,481 papers shown
Title
Practical Lossless Compression with Latent Variables using Bits Back Coding
James Townsend
Thomas Bird
David Barber
DRL
108
142
0
15 Jan 2019
How Compact?: Assessing Compactness of Representations through Layer-Wise Pruning
Hyun-Joo Jung
Jaedeok Kim
Yoonsuck Choe
35
1
0
09 Jan 2019
Collaborative Execution of Deep Neural Networks on Internet of Things Devices
Ramyad Hadidi
Jiashen Cao
Michael S. Ryoo
Hyesoon Kim
50
19
0
08 Jan 2019
FastGRNN: A Fast, Accurate, Stable and Tiny Kilobyte Sized Gated Recurrent Neural Network
Aditya Kusupati
Manish Singh
Kush S. Bhatia
A. Kumar
Prateek Jain
Manik Varma
92
190
0
08 Jan 2019
A Comprehensive guide to Bayesian Convolutional Neural Network with Variational Inference
Kumar Shridhar
F. Laumann
Marcus Liwicki
BDL
UQCV
110
176
0
08 Jan 2019
Guidelines and Benchmarks for Deployment of Deep Learning Models on Smartphones as Real-Time Apps
Abhishek Sehgal
N. Kehtarnavaz
46
38
0
08 Jan 2019
Spatial-Winograd Pruning Enabling Sparse Winograd Convolution
Jiecao Yu
Jongsoo Park
Maxim Naumov
21
7
0
08 Jan 2019
Efficient Winograd Convolution via Integer Arithmetic
Lingchuan Meng
J. Brothers
72
29
0
07 Jan 2019
GASL: Guided Attention for Sparsity Learning in Deep Neural Networks
A. Torfi
Rouzbeh A. Shirvani
Sobhan Soleymani
Nasser M. Nasrabadi
94
8
0
07 Jan 2019
DSConv: Efficient Convolution Operator
Marcelo Gennari
Roger Fawcett
V. Prisacariu
MQ
50
68
0
07 Jan 2019
CC-Net: Image Complexity Guided Network Compression for Biomedical Image Segmentation
Suraj Mishra
Peixian Liang
A. Czajka
Benlin Liu
X. S. Hu
MedIm
55
27
0
06 Jan 2019
Dataflow-based Joint Quantization of Weights and Activations for Deep Neural Networks
Xue Geng
Jie Fu
Bin Zhao
Jie Lin
M. Aly
C. Pal
V. Chandrasekhar
MQ
35
6
0
04 Jan 2019
HG-Caffe: Mobile and Embedded Neural Network GPU (OpenCL) Inference Engine with FP16 Supporting
Zhuoran Ji
BDL
28
5
0
03 Jan 2019
A Theoretical Analysis of Deep Q-Learning
Jianqing Fan
Zhuoran Yang
Yuchen Xie
Zhaoran Wang
222
613
0
01 Jan 2019
FPGA-based Accelerators of Deep Learning Networks for Learning and Classification: A Review
Ahmad Shawahna
S. M. Sait
A. El-Maleh
85
380
0
01 Jan 2019
A Noise-Sensitivity-Analysis-Based Test Prioritization Technique for Deep Neural Networks
Long Zhang
Xuechao Sun
Yong Li
Zhenyu Zhang
AAML
53
22
0
01 Jan 2019
Regularized Binary Network Training
Sajad Darabi
Mouloud Belbahri
Matthieu Courbariaux
V. Nia
MQ
65
32
0
31 Dec 2018
Federated Learning via Over-the-Air Computation
Kai Yang
Tao Jiang
Yuanming Shi
Z. Ding
FedML
111
887
0
31 Dec 2018
Per-Tensor Fixed-Point Quantization of the Back-Propagation Algorithm
Charbel Sakr
Naresh R Shanbhag
MQ
84
43
0
31 Dec 2018
Deep Residual Learning in the JPEG Transform Domain
Max Ehrlich
L. Davis
99
125
0
31 Dec 2018
ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers
Ao Ren
Tianyun Zhang
Shaokai Ye
Jiayu Li
Wenyao Xu
Xuehai Qian
Xinyu Lin
Yanzhi Wang
MQ
113
161
0
31 Dec 2018
Quantized Guided Pruning for Efficient Hardware Implementations of Convolutional Neural Networks
G. B. Hacene
Vincent Gripon
M. Arzel
Nicolas Farrugia
Yoshua Bengio
MQ
51
14
0
29 Dec 2018
On the Benefit of Width for Neural Networks: Disappearance of Bad Basins
Dawei Li
Tian Ding
Ruoyu Sun
150
39
0
28 Dec 2018
Exploring Weight Symmetry in Deep Neural Networks
S. Hu
Sergey Zagoruyko
N. Komodakis
59
33
0
28 Dec 2018
Towards a Theoretical Understanding of Hashing-Based Neural Nets
Yibo Lin
Zhao Song
Lin F. Yang
43
5
0
26 Dec 2018
Studying the Plasticity in Deep Convolutional Neural Networks using Random Pruning
Deepak Mittal
S. Bhardwaj
Mitesh M. Khapra
Balaraman Ravindran
3DPC
88
32
0
26 Dec 2018
JALAD: Joint Accuracy- and Latency-Aware Deep Structure Decoupling for Edge-Cloud Execution
Hongshan Li
Chenghao Hu
Jingyan Jiang
Zhi Wang
Yonggang Wen
Wenwu Zhu
110
136
0
25 Dec 2018
A Survey of FPGA Based Deep Learning Accelerators: Challenges and Opportunities
Teng Wang
Chao Wang
Xuehai Zhou
Hua-ping Chen
64
34
0
25 Dec 2018
Dynamic Runtime Feature Map Pruning
Tailin Liang
Lei Wang
Shaobo Shi
C. Glossner
3DPC
44
8
0
24 Dec 2018
Precision Highway for Ultra Low-Precision Quantization
Eunhyeok Park
Dongyoung Kim
S. Yoo
Peter Vajda
MQ
AI4TS
158
12
0
24 Dec 2018
Artificial neural networks condensation: A strategy to facilitate adaption of machine learning in medical settings by reducing computational burden
Dianbo Liu
N. Sepulveda
Ming Zheng
68
7
0
23 Dec 2018
Cascaded Coarse-to-Fine Deep Kernel Networks for Efficient Satellite Image Change Detection
H. Sahbi
38
0
0
21 Dec 2018
COSINE: Compressive Network Embedding on Large-scale Information Networks
Zhengyan Zhang
Cheng Yang
Zhiyuan Liu
Maosong Sun
Zhichong Fang
Bo Zhang
Leyu Lin
GNN
55
7
0
21 Dec 2018
ChamNet: Towards Efficient Network Design through Platform-Aware Model Adaptation
Xiaoliang Dai
Peizhao Zhang
Bichen Wu
Hongxu Yin
Fei Sun
...
Yiming Wu
Yangqing Jia
Peter Vajda
M. Uyttendaele
N. Jha
104
275
0
21 Dec 2018
Slimmable Neural Networks
Jiahui Yu
L. Yang
N. Xu
Jianchao Yang
Thomas Huang
102
559
0
21 Dec 2018
SQuantizer: Simultaneous Learning for Both Sparse and Low-precision Neural Networks
M. Park
Xiaofang Xu
C. Brick
MQ
61
8
0
20 Dec 2018
Adam Induces Implicit Weight Sparsity in Rectifier Neural Networks
A. Yaguchi
Taiji Suzuki
Wataru Asano
Shuhei Nitta
Y. Sakata
A. Tanizawa
36
18
0
19 Dec 2018
Entropy-Constrained Training of Deep Neural Networks
Simon Wiedemann
Arturo Marbán
K. Müller
Wojciech Samek
84
29
0
18 Dec 2018
Expanding the Reach of Federated Learning by Reducing Client Resource Requirements
S. Caldas
Jakub Konecný
H. B. McMahan
Ameet Talwalkar
127
451
0
18 Dec 2018
A Layer Decomposition-Recomposition Framework for Neuron Pruning towards Accurate Lightweight Networks
Weijie Chen
Yuan Zhang
Di Xie
Shiliang Pu
39
13
0
17 Dec 2018
Learning Student Networks via Feature Embedding
Hanting Chen
Yunhe Wang
Chang Xu
Chao Xu
Dacheng Tao
71
96
0
17 Dec 2018
Distill-Net: Application-Specific Distillation of Deep Convolutional Neural Networks for Resource-Constrained IoT Platforms
Mohammad Motamedi
Felix Portillo
Daniel D. Fong
S. Ghiasi
35
3
0
16 Dec 2018
Resource-Scalable CNN Synthesis for IoT Applications
Mohammad Motamedi
Felix Portillo
Mahya Saffarpour
Daniel D. Fong
S. Ghiasi
26
5
0
16 Dec 2018
A Low Effort Approach to Structured CNN Design Using PCA
Isha Garg
Priyadarshini Panda
Kaushik Roy
68
62
0
15 Dec 2018
E-RNN: Design Optimization for Efficient Recurrent Neural Networks in FPGAs
Zhe Li
Caiwen Ding
Siyue Wang
Wujie Wen
Youwei Zhuo
...
Qinru Qiu
Wenyao Xu
Xinyu Lin
Xuehai Qian
Yanzhi Wang
MQ
63
65
0
12 Dec 2018
Exploiting Kernel Sparsity and Entropy for Interpretable CNN Compression
Yuchao Li
Shaohui Lin
Baochang Zhang
Jianzhuang Liu
David Doermann
Yongjian Wu
Feiyue Huang
Rongrong Ji
82
130
0
11 Dec 2018
Channel selection using Gumbel Softmax
Charles Herrmann
Richard Strong Bowen
Ramin Zabih
58
3
0
11 Dec 2018
Accelerating Convolutional Neural Networks via Activation Map Compression
Georgios Georgiadis
88
76
0
10 Dec 2018
Reliable Identification of Redundant Kernels for Convolutional Neural Network Compression
Wei Wang
Liqiang Zhu
CVBM
61
14
0
10 Dec 2018
No Peek: A Survey of private distributed deep learning
Praneeth Vepakomma
Tristan Swedish
Ramesh Raskar
O. Gupta
Abhimanyu Dubey
SyDa
FedML
86
100
0
08 Dec 2018
Previous
1
2
3
...
55
56
57
...
68
69
70
Next