Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,448 papers shown
Title
Learning SMaLL Predictors
Vikas K. Garg
O. Dekel
Lin Xiao
8
3
0
06 Mar 2018
Personalized Exposure Control Using Adaptive Metering and Reinforcement Learning
Huan Yang
Baoyuan Wang
Noranart Vesdapunt
Minyi Guo
S. B. Kang
32
22
0
06 Mar 2018
Deep Neural Network Compression with Single and Multiple Level Quantization
Yuhui Xu
Yongzhuang Wang
Aojun Zhou
Weiyao Lin
H. Xiong
MQ
20
114
0
06 Mar 2018
Stochastic Activation Pruning for Robust Adversarial Defense
Guneet Singh Dhillon
Kamyar Azizzadenesheli
Zachary Chase Lipton
Jeremy Bernstein
Jean Kossaifi
Aran Khanna
Anima Anandkumar
AAML
33
545
0
05 Mar 2018
An Optimal Control Approach to Deep Learning and Applications to Discrete-Weight Neural Networks
Qianxiao Li
Shuji Hao
32
75
0
04 Mar 2018
The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches
Md. Zahangir Alom
T. Taha
C. Yakopcic
Stefan Westberg
P. Sidike
Mst Shamima Nasrin
B. Van Essen
A. Awwal
V. Asari
VLM
34
875
0
03 Mar 2018
Scalar Quantization as Sparse Least Square Optimization
Chen Wang
Xiaomei Yang
Shaomin Fei
Kai Zhou
Xiaofeng Gong
Miao Du
Ruisen Luo
MQ
20
3
0
01 Mar 2018
Learning Sparse Structured Ensembles with SG-MCMC and Network Pruning
Yichi Zhang
Zhijian Ou
27
0
0
01 Mar 2018
Compressing Neural Networks using the Variational Information Bottleneck
Bin Dai
Chen Zhu
David Wipf
MLT
28
179
0
28 Feb 2018
Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs
Xuhao Chen
21
25
0
28 Feb 2018
Frank-Wolfe Network: An Interpretable Deep Structure for Non-Sparse Coding
Dong Liu
Ke Sun
Zhangyang Wang
Runsheng Liu
Zhengjun Zha
24
12
0
28 Feb 2018
Recurrent Residual Module for Fast Inference in Videos
Bowen Pan
Wuwei Lin
Xiaolin Fang
Chaoqin Huang
Bolei Zhou
Cewu Lu
ObjD
28
33
0
27 Feb 2018
Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis
Tal Ben-Nun
Torsten Hoefler
GNN
33
704
0
26 Feb 2018
PBGen: Partial Binarization of Deconvolution-Based Generators for Edge Intelligence
Jinglan Liu
Jiaxin Zhang
Yukun Ding
Xiaowei Xu
Meng Jiang
Yiyu Shi
41
4
0
26 Feb 2018
Wide Compression: Tensor Ring Nets
Wenqi Wang
Yifan Sun
Brian Eriksson
Wenlin Wang
Vaneet Aggarwal
13
167
0
25 Feb 2018
Loss-aware Weight Quantization of Deep Networks
Lu Hou
James T. Kwok
MQ
35
127
0
23 Feb 2018
Training wide residual networks for deployment using a single bit for each weight
Mark D Mcdonnell
MQ
41
71
0
23 Feb 2018
Approximation Algorithms for Cascading Prediction Models
Matthew J. Streeter
TPM
13
19
0
21 Feb 2018
Building Efficient ConvNets using Redundant Feature Pruning
B. Ayinde
J. Zurada
VLM
3DPC
29
47
0
21 Feb 2018
3LC: Lightweight and Effective Traffic Compression for Distributed Machine Learning
Hyeontaek Lim
D. Andersen
M. Kaminsky
21
70
0
21 Feb 2018
The Description Length of Deep Learning Models
Léonard Blier
Yann Ollivier
32
97
0
20 Feb 2018
DeepThin: A Self-Compressing Library for Deep Neural Networks
Matthew Sotoudeh
Sara S. Baghsorkhi
18
4
0
20 Feb 2018
Layer-wise synapse optimization for implementing neural networks on general neuromorphic architectures
John Mern
Jayesh K. Gupta
Mykel Kochenderfer
40
1
0
20 Feb 2018
A Scalable Near-Memory Architecture for Training Deep Neural Networks on Large In-Memory Datasets
Fabian Schuiki
Michael Schaffner
Frank K. Gürkaynak
Luca Benini
31
70
0
19 Feb 2018
Towards Ultra-High Performance and Energy Efficiency of Deep Learning Systems: An Algorithm-Hardware Co-Optimization Framework
Yanzhi Wang
Caiwen Ding
Zhe Li
Geng Yuan
Siyu Liao
...
Bo Yuan
Xuehai Qian
Jian Tang
Qinru Qiu
Xinyu Lin
31
33
0
18 Feb 2018
Efficient Sparse-Winograd Convolutional Neural Networks
Xingyu Liu
Jeff Pool
Song Han
W. Dally
13
122
0
18 Feb 2018
Towards Principled Design of Deep Convolutional Networks: Introducing SimpNet
S. H. HasanPour
Mohammad Rouhani
Mohsen Fayyaz
Mohammad Sabokrou
Ehsan Adeli
52
45
0
17 Feb 2018
Systematic Weight Pruning of DNNs using Alternating Direction Method of Multipliers
Tianyun Zhang
Shaokai Ye
Yipeng Zhang
Yanzhi Wang
M. Fardad
22
21
0
15 Feb 2018
Model compression via distillation and quantization
A. Polino
Razvan Pascanu
Dan Alistarh
MQ
48
718
0
15 Feb 2018
Security Analysis and Enhancement of Model Compressed Deep Learning Systems under Adversarial Attacks
Qi Liu
Tao Liu
Zihao Liu
Yanzhi Wang
Yier Jin
Wujie Wen
AAML
35
48
0
14 Feb 2018
Paraphrasing Complex Network: Network Compression via Factor Transfer
Jangho Kim
Seonguk Park
Nojun Kwak
32
543
0
14 Feb 2018
SLAQ: Quality-Driven Scheduling for Distributed Machine Learning
Haoyu Zhang
Logan Stafman
Andrew Or
M. Freedman
38
140
0
13 Feb 2018
Training and Inference with Integers in Deep Neural Networks
Shuang Wu
Guoqi Li
F. Chen
Luping Shi
MQ
41
389
0
13 Feb 2018
Attention-Based Guided Structured Sparsity of Deep Neural Networks
A. Torfi
Rouzbeh A. Shirvani
Sobhan Soleymani
Nasser M. Nasrabadi
29
23
0
13 Feb 2018
DCFNet: Deep Neural Network with Decomposed Convolutional Filters
Qiang Qiu
Xiuyuan Cheng
Robert Calderbank
Guillermo Sapiro
41
69
0
12 Feb 2018
ClosNets: a Priori Sparse Topologies for Faster DNN Training
Mihailo Isakov
Michel A. Kinsy
CVBM
36
0
0
12 Feb 2018
Edge-Host Partitioning of Deep Neural Networks with Feature Space Encoding for Resource-Constrained Internet-of-Things Platforms
J. Ko
Taesik Na
M. Amir
Saibal Mukhopadhyay
24
148
0
11 Feb 2018
ThUnderVolt: Enabling Aggressive Voltage Underscaling and Timing Error Resilience for Energy Efficient Deep Neural Network Accelerators
Jeff Zhang
Kartheek Rangineni
Zahra Ghodsi
S. Garg
36
118
0
11 Feb 2018
Analyzing and Mitigating the Impact of Permanent Faults on a Systolic Array Based Neural Network Accelerator
Jeff Zhang
Tianyu Gu
K. Basu
S. Garg
14
134
0
11 Feb 2018
The Need for Speed of AI Applications: Performance Comparison of Native vs. Browser-based Algorithm Implementations
Bernd Malle
Nicola Giuliani
Peter Kieseberg
Andreas Holzinger
13
8
0
11 Feb 2018
On the Universal Approximability and Complexity Bounds of Quantized ReLU Neural Networks
Yukun Ding
Jinglan Liu
Jinjun Xiong
Yiyu Shi
MQ
37
21
0
10 Feb 2018
AMC: AutoML for Model Compression and Acceleration on Mobile Devices
Yihui He
Ji Lin
Zhijian Liu
Hanrui Wang
Li Li
Song Han
35
1,343
0
10 Feb 2018
Nature vs. Nurture: The Role of Environmental Resources in Evolutionary Deep Intelligence
A. Chung
Paul Fieguth
A. Wong
24
1
0
09 Feb 2018
Going Deeper in Spiking Neural Networks: VGG and Residual Architectures
Abhronil Sengupta
Yuting Ye
Robert Y. Wang
Chiao Liu
Kaushik Roy
38
989
0
07 Feb 2018
Effective Quantization Approaches for Recurrent Neural Networks
Md. Zahangir Alom
A. Moody
N. Maruyama
B. Van Essen
T. Taha
MQ
8
33
0
07 Feb 2018
CryptoRec: Privacy-preserving Recommendation as a Service
Jun Wang
Afonso Arriaga
Qiang Tang
Peter Y. A. Ryan
13
3
0
07 Feb 2018
Universal Deep Neural Network Compression
Yoojin Choi
Mostafa El-Khamy
Jungwon Lee
MQ
86
86
0
07 Feb 2018
Digital Watermarking for Deep Neural Networks
Yuki Nagai
Yusuke Uchida
S. Sakazawa
Shiníchi Satoh
WIGM
31
144
0
06 Feb 2018
Musical Chair: Efficient Real-Time Recognition Using Collaborative IoT Devices
Ramyad Hadidi
Jiashen Cao
M. Woodward
Michael S. Ryoo
Hyesoon Kim
22
34
0
05 Feb 2018
Learning Compact Neural Networks with Regularization
Samet Oymak
MLT
46
39
0
05 Feb 2018
Previous
1
2
3
...
62
63
64
...
67
68
69
Next