Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
v1
v2
v3
v4
v5 (latest)
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,481 papers shown
Title
Towards Compact and Robust Deep Neural Networks
Vikash Sehwag
Shiqi Wang
Prateek Mittal
Suman Jana
AAML
82
40
0
14 Jun 2019
Effectiveness of Distillation Attack and Countermeasure on Neural Network Watermarking
Ziqi Yang
Hung Dang
E. Chang
AAML
116
34
0
14 Jun 2019
Run-Time Efficient RNN Compression for Inference on Edge Devices
Urmish Thakker
Jesse G. Beu
Dibakar Gope
Ganesh S. Dasika
Matthew Mattina
90
19
0
12 Jun 2019
Table-Based Neural Units: Fully Quantizing Networks for Multiply-Free Inference
Michele Covell
David Marwood
S. Baluja
Nick Johnston
MQ
54
7
0
11 Jun 2019
BasisConv: A method for compressed representation and learning in CNNs
M. Tayyab
Abhijit Mahalanobis
3DPC
SSL
38
6
0
11 Jun 2019
BlockSwap: Fisher-guided Block Substitution for Network Compression on a Budget
Jack Turner
Elliot J. Crowley
Michael F. P. O'Boyle
Amos Storkey
Gavia Gray
86
38
0
10 Jun 2019
Quantification and Analysis of Layer-wise and Pixel-wise Information Discarding
Haotian Ma
Hao Zhang
Fan Zhou
Yinqing Zhang
Quanshi Zhang
FAtt
34
0
0
10 Jun 2019
Making CNNs for Video Parsing Accessible
Zijin Luo
Matthew J. Guzdial
Mark O. Riedl
54
9
0
10 Jun 2019
The Generalization-Stability Tradeoff In Neural Network Pruning
Brian Bartoldson
Ari S. Morcos
Adrian Barbu
G. Erlebacher
113
76
0
09 Jun 2019
Redundancy-Free Computation Graphs for Graph Neural Networks
Zhihao Jia
Sina Lin
Rex Ying
Jiaxuan You
J. Leskovec
Alexander Aiken
GNN
52
11
0
09 Jun 2019
Distilling Object Detectors with Fine-grained Feature Imitation
Tao Wang
Li-xin Yuan
Xiaopeng Zhang
Jiashi Feng
ObjD
72
385
0
09 Jun 2019
DiCENet: Dimension-wise Convolutions for Efficient Networks
Sachin Mehta
Hannaneh Hajishirzi
Mohammad Rastegari
101
43
0
08 Jun 2019
Fighting Quantization Bias With Bias
Alexander Finkelstein
Uri Almog
Mark Grobman
MQ
84
57
0
07 Jun 2019
Compressing RNNs for IoT devices by 15-38x using Kronecker Products
Urmish Thakker
Jesse G. Beu
Dibakar Gope
Chu Zhou
Igor Fedorov
Ganesh S. Dasika
Matthew Mattina
121
36
0
07 Jun 2019
Uncertainty-guided Continual Learning with Bayesian Neural Networks
Sayna Ebrahimi
Mohamed Elhoseiny
Trevor Darrell
Marcus Rohrbach
CLL
BDL
70
197
0
06 Jun 2019
The Architectural Implications of Facebook's DNN-based Personalized Recommendation
Udit Gupta
Carole-Jean Wu
Xiaodong Wang
Maxim Naumov
Brandon Reagen
...
Andrey Malevich
Dheevatsa Mudigere
M. Smelyanskiy
Liang Xiong
Xuan Zhang
GNN
127
292
0
06 Jun 2019
Butterfly Transform: An Efficient FFT Based Neural Architecture Design
Keivan Alizadeh-Vahid
Anish K. Prabhu
Ali Farhadi
Mohammad Rastegari
138
50
0
05 Jun 2019
Visual Confusion Label Tree For Image Classification
Yuntao Liu
Y. Dou
Ruochun Jin
Rongchun Li
46
5
0
05 Jun 2019
OpenEI: An Open Framework for Edge Intelligence
Xingzhou Zhang
Yifan Wang
Sidi Lu
Liangkai Liu
Lanyu Xu
Weisong Shi
82
101
0
05 Jun 2019
Evaluating Scalable Bayesian Deep Learning Methods for Robust Computer Vision
Fredrik K. Gustafsson
Martin Danelljan
Thomas B. Schon
OOD
UQCV
BDL
102
302
0
04 Jun 2019
NodeDrop: A Condition for Reducing Network Size without Effect on Output
Louis Jensen
Jacob A. Harer
S. Chin
25
0
0
03 Jun 2019
Terminal Brain Damage: Exposing the Graceless Degradation in Deep Neural Networks Under Hardware Fault Attacks
Sanghyun Hong
Pietro Frigo
Yigitcan Kaya
Cristiano Giuffrida
Tudor Dumitras
AAML
77
214
0
03 Jun 2019
SpikeGrad: An ANN-equivalent Computation Model for Implementing Backpropagation with Spikes
Johannes C. Thiele
O. Bichler
A. Dupret
67
33
0
03 Jun 2019
Dimensionality compression and expansion in Deep Neural Networks
Stefano Recanatesi
M. Farrell
Madhu S. Advani
Timothy Moore
Guillaume Lajoie
E. Shea-Brown
77
74
0
02 Jun 2019
SHE: A Fast and Accurate Deep Neural Network for Encrypted Data
Qian Lou
Lei Jiang
110
124
0
01 Jun 2019
The Pupil Has Become the Master: Teacher-Student Model-Based Word Embedding Distillation with Ensemble Learning
Bonggun Shin
Hao Yang
Jinho Choi
20
12
0
31 May 2019
Learning Sparse Networks Using Targeted Dropout
Aidan Gomez
Ivan Zhang
Siddhartha Rao Kamalakara
Divyam Madaan
Kevin Swersky
Y. Gal
Geoffrey E. Hinton
115
98
0
31 May 2019
Increasing Compactness Of Deep Learning Based Speech Enhancement Models With Parameter Pruning And Quantization Techniques
Jyun-Yi Wu
Cheng Yu
Szu-Wei Fu
Chih-Ting Liu
Shao-Yi Chien
Yu Tsao
33
24
0
31 May 2019
DeepShift: Towards Multiplication-Less Neural Networks
Mostafa Elhoushi
Zihao Chen
F. Shafiq
Ye Tian
Joey Yiwei Li
MQ
138
102
0
30 May 2019
Memory-Driven Mixed Low Precision Quantization For Enabling Deep Network Inference On Microcontrollers
Manuele Rusci
Alessandro Capotondi
Luca Benini
MQ
102
75
0
30 May 2019
Robust Sparse Regularization: Simultaneously Optimizing Neural Network Robustness and Compactness
Adnan Siraj Rakin
Zhezhi He
Li Yang
Yanzhi Wang
Liqiang Wang
Deliang Fan
AAML
98
21
0
30 May 2019
Quantization Loss Re-Learning Method
Kunping Li
MQ
31
1
0
30 May 2019
Rethinking Full Connectivity in Recurrent Neural Networks
Matthijs Van Keirsbilck
A. Keller
Xiaodong Yang
LRM
41
14
0
29 May 2019
A Study of BFLOAT16 for Deep Learning Training
Dhiraj D. Kalamkar
Dheevatsa Mudigere
Naveen Mellempudi
Dipankar Das
K. Banerjee
...
Sudarshan Srinivasan
Abhisek Kundu
M. Smelyanskiy
Bharat Kaul
Pradeep Dubey
MQ
119
351
0
29 May 2019
Attention Based Pruning for Shift Networks
G. B. Hacene
Carlos Lassance
Vincent Gripon
Matthieu Courbariaux
Yoshua Bengio
103
25
0
29 May 2019
Instant Quantization of Neural Networks using Monte Carlo Methods
Gonçalo Mordido
Matthijs Van Keirsbilck
A. Keller
MQ
51
9
0
29 May 2019
Graph DNA: Deep Neighborhood Aware Graph Encoding for Collaborative Filtering
Liwei Wu
Hsiang-Fu Yu
Nikhil S. Rao
James Sharpnack
Cho-Jui Hsieh
GNN
36
10
0
29 May 2019
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
Mingxing Tan
Quoc V. Le
3DV
MedIm
320
18,360
0
28 May 2019
CompactNet: Platform-Aware Automatic Optimization for Convolutional Neural Networks
Weicheng Li
Rui Wang
Zhongzhi Luan
Di Huang
Zidong Du
Yunji Chen
D. Qian
27
1
0
28 May 2019
OICSR: Out-In-Channel Sparsity Regularization for Compact Deep Neural Networks
Jiashi Li
Q. Qi
Jingyu Wang
Ce Ge
Yujian Betterest Li
Zhangzhang Yue
Haifeng Sun
BDL
CML
101
53
0
28 May 2019
Brain-inspired reverse adversarial examples
Shaokai Ye
S. Tan
Kaidi Xu
Yanzhi Wang
Chenglong Bao
Kaisheng Ma
AAML
33
5
0
28 May 2019
Inference with Hybrid Bio-hardware Neural Networks
Yuan Zeng
Zubayer Ibne Ferdous
Weixian Zhang
Mufan Xu
Anlan Yu
Drew Patel
Xiaochen Guo
Y. Berdichevsky
Zhiyuan Yan
17
4
0
28 May 2019
Mixed Precision DNNs: All you need is a good parametrization
Stefan Uhlich
Lukas Mauch
Fabien Cardinaux
K. Yoshiyama
Javier Alonso García
Stephen Tiedemann
Thomas Kemp
Akira Nakamura
MQ
113
39
0
27 May 2019
SCAN: A Scalable Neural Networks Framework Towards Compact and Efficient Models
Linfeng Zhang
Zhanhong Tan
Jiebo Song
Jingwei Chen
Chenglong Bao
Kaisheng Ma
55
71
0
27 May 2019
Incremental Learning Using a Grow-and-Prune Paradigm with Efficient Neural Networks
Xiaoliang Dai
Hongxu Yin
N. Jha
88
32
0
27 May 2019
Shredder: Learning Noise Distributions to Protect Inference Privacy
Fatemehsadat Mireshghallah
Mohammadkazem Taram
Prakash Ramrakhyani
Dean Tullsen
H. Esmaeilzadeh
79
11
0
26 May 2019
Feature Map Transform Coding for Energy-Efficient CNN Inference
Brian Chmiel
Chaim Baskin
Ron Banner
Evgenii Zheltonozhskii
Yevgeny Yermolin
Alex Karbachevsky
A. Bronstein
A. Mendelson
101
26
0
26 May 2019
HadaNets: Flexible Quantization Strategies for Neural Networks
Yash Akhauri
MQ
43
7
0
26 May 2019
ShrinkTeaNet: Million-scale Lightweight Face Recognition via Shrinking Teacher-Student Networks
C. Duong
Khoa Luu
Kha Gia Quach
Ngan Le
CVBM
70
39
0
25 May 2019
Bayesian Tensorized Neural Networks with Automatic Rank Selection
Cole Hawkins
Zheng Zhang
BDL
63
54
0
24 May 2019
Previous
1
2
3
...
51
52
53
...
68
69
70
Next