Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,448 papers shown
Title
SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks
A. Parashar
Minsoo Rhu
Anurag Mukkara
A. Puglielli
Rangharajan Venkatesan
Brucek Khailany
J. Emer
S. Keckler
W. Dally
35
1,115
0
23 May 2017
TernGrad: Ternary Gradients to Reduce Communication in Distributed Deep Learning
W. Wen
Cong Xu
Feng Yan
Chunpeng Wu
Yandan Wang
Yiran Chen
Hai Helen Li
52
984
0
22 May 2017
Structural Compression of Convolutional Neural Networks
R. Abbasi-Asl
Bin-Xia Yu
33
16
0
20 May 2017
Structured Bayesian Pruning via Log-Normal Multiplicative Noise
Kirill Neklyudov
Dmitry Molchanov
Arsenii Ashukha
Dmitry Vetrov
BDL
22
188
0
20 May 2017
The High-Dimensional Geometry of Binary Neural Networks
Alexander G. Anderson
C. P. Berg
MQ
35
75
0
19 May 2017
Espresso: Efficient Forward Propagation for BCNNs
Fabrizio Pedersoli
George Tzanetakis
Andrea Tagliasacchi
MQ
21
13
0
19 May 2017
Building effective deep neural network architectures one feature at a time
Martin Mundt
Tobias Weis
K. Konda
Visvanathan Ramesh
27
1
0
18 May 2017
Design of a Very Compact CNN Classifier for Online Handwritten Chinese Character Recognition Using DropWeight and Global Pooling
Xuefeng Xiao
Yafeng Yang
Tasweer Ahmad
Lianwen Jin
Tianhai Chang
34
21
0
15 May 2017
Incremental Learning Through Deep Adaptation
Amir Rosenfeld
John K. Tsotsos
CLL
19
276
0
11 May 2017
Sharp Models on Dull Hardware: Fast and Accurate Neural Machine Translation Decoding on the CPU
Jacob Devlin
29
36
0
04 May 2017
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks
Minsoo Rhu
Mike O'Connor
Niladrish Chatterjee
Jeff Pool
S. Keckler
33
176
0
03 May 2017
Image reconstruction by domain transform manifold learning
Bo Zhu
Jeremiah Zhe Liu
Bruce Rosen
Matthew S. Rosen
34
1,518
0
28 Apr 2017
ICNet for Real-Time Semantic Segmentation on High-Resolution Images
Hengshuang Zhao
Xiaojuan Qi
Xiaoyong Shen
Jianping Shi
Jiaya Jia
SSeg
34
1,403
0
27 Apr 2017
Speeding up Convolutional Neural Networks By Exploiting the Sparsity of Rectifier Units
Shaoshuai Shi
Xuming Hu
29
43
0
25 Apr 2017
Accurate Optical Flow via Direct Cost Volume Processing
Jia Xu
René Ranftl
V. Koltun
21
238
0
24 Apr 2017
A Review on Deep Learning Techniques Applied to Semantic Segmentation
Alberto Garcia-Garcia
Sergio Orts
Sergiu Oprea
Victor Villena-Martinez
Jose Garcia-Rodriguez
3DV
SSeg
34
1,270
0
22 Apr 2017
Exploring Sparsity in Recurrent Neural Networks
Sharan Narang
Erich Elsen
G. Diamos
Shubho Sengupta
21
308
0
17 Apr 2017
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
Andrew G. Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
M. Andreetto
Hartwig Adam
3DH
950
20,613
0
17 Apr 2017
Enabling Embedded Inference Engine with ARM Compute Library: A Case Study
Dawei Sun
Shaoshan Liu
J. Gaudiot
8
13
0
12 Apr 2017
DyVEDeep: Dynamic Variable Effort Deep Neural Networks
Sanjay Ganapathy
Swagath Venkataramani
Balaraman Ravindran
A. Raghunathan
27
8
0
04 Apr 2017
Soft-to-Hard Vector Quantization for End-to-End Learning Compressible Representations
E. Agustsson
Fabian Mentzer
Michael Tschannen
Lukas Cavigelli
Radu Timofte
Luca Benini
Luc Van Gool
MQ
24
480
0
03 Apr 2017
Multi-Scale Dense Networks for Resource Efficient Image Classification
Gao Huang
Danlu Chen
Tianhong Li
Felix Wu
Laurens van der Maaten
Kilian Q. Weinberger
VLM
24
137
0
29 Mar 2017
Coordinating Filters for Faster Deep Neural Networks
W. Wen
Cong Xu
Chunpeng Wu
Yandan Wang
Yiran Chen
Hai Helen Li
16
138
0
28 Mar 2017
Efficient Processing of Deep Neural Networks: A Tutorial and Survey
Vivienne Sze
Yu-hsin Chen
Tien-Ju Yang
J. Emer
AAML
3DV
59
2,996
0
27 Mar 2017
More is Less: A More Complicated Network with Less Inference Complexity
Xuanyi Dong
Junshi Huang
Yi Yang
Shuicheng Yan
26
288
0
25 Mar 2017
Quality Resilient Deep Neural Networks
Samuel F. Dodge
Lina Karam
OOD
13
46
0
23 Mar 2017
A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification and Domain Adaptation
Chunpeng Wu
W. Wen
Tariq Afzal
Yongmei Zhang
Yiran Chen
Hai Helen Li
32
46
0
12 Mar 2017
Deep Convolutional Neural Network Inference with Floating-point Weights and Fixed-point Activations
Liangzhen Lai
Naveen Suda
Vikas Chandra
MQ
33
85
0
08 Mar 2017
NoScope: Optimizing Neural Network Queries over Video at Scale
Daniel Kang
John Emmons
Firas Abuzaid
Peter Bailis
Matei A. Zaharia
29
205
0
07 Mar 2017
Theoretical Properties for Neural Networks with Weight Matrices of Low Displacement Rank
Liang Zhao
Siyu Liao
Yanzhi Wang
Zhe Li
Jian Tang
Victor Pan
Bo Yuan
33
61
0
01 Mar 2017
ShaResNet: reducing residual network parameter number by sharing weights
Alexandre Boulch
32
26
0
28 Feb 2017
Billion-scale similarity search with GPUs
Jeff Johnson
Matthijs Douze
Hervé Jégou
93
3,650
0
28 Feb 2017
Memory-Efficient Global Refinement of Decision-Tree Ensembles and its Application to Face Alignment
Nenad Markuš
Ivan Gogić
Igor S. Pandzic
Jörgen Ahlberg
CVBM
26
1
0
27 Feb 2017
Adaptive Ensemble Prediction for Deep Neural Networks based on Confidence Level
H. Inoue
UQCV
FedML
16
1
0
27 Feb 2017
Low-Precision Batch-Normalized Activations
Benjamin Graham
MQ
27
9
0
27 Feb 2017
Fixed-point optimization of deep neural networks with adaptive step size retraining
Sungho Shin
Yoonho Boo
Wonyong Sung
MQ
32
34
0
27 Feb 2017
Building Fast and Compact Convolutional Neural Networks for Offline Handwritten Chinese Character Recognition
Xuefeng Xiao
Lianwen Jin
Yafeng Yang
Weixin Yang
Jun Sun
Tianhai Chang
21
153
0
26 Feb 2017
Tuple-oriented Compression for Large-scale Mini-batch Stochastic Gradient Descent
Fengan Li
Lingjiao Chen
Yijing Zeng
Arun Kumar
Jeffrey F. Naughton
J. Patel
Xi Wu
26
19
0
22 Feb 2017
The Power of Sparsity in Convolutional Neural Networks
Soravit Changpinyo
Mark Sandler
A. Zhmoginov
22
132
0
21 Feb 2017
Soft Weight-Sharing for Neural Network Compression
Karen Ullrich
Edward Meeds
Max Welling
37
412
0
13 Feb 2017
Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights
Aojun Zhou
Anbang Yao
Yiwen Guo
Lin Xu
Yurong Chen
MQ
337
1,049
0
10 Feb 2017
Exploiting Domain Knowledge via Grouped Weight Sharing with Application to Text Categorization
Ye Zhang
Matthew Lease
Byron C. Wallace
21
15
0
08 Feb 2017
Deep Learning with Low Precision by Half-wave Gaussian Quantization
Zhaowei Cai
Xiaodong He
Jian Sun
Nuno Vasconcelos
MQ
50
503
0
03 Feb 2017
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
106
1,505
0
25 Jan 2017
Variational Dropout Sparsifies Deep Neural Networks
Dmitry Molchanov
Arsenii Ashukha
Dmitry Vetrov
BDL
17
820
0
19 Jan 2017
Compression of Deep Neural Networks for Image Instance Retrieval
V. Chandrasekhar
Jie Lin
Q. Liao
Olivier Morère
D. Shapiro
Lingyu Duan
Tomaso Poggio
33
25
0
18 Jan 2017
The Incredible Shrinking Neural Network: New Perspectives on Learning Representations Through The Lens of Pruning
Aditya Sharma
Nikolas Wolfe
Bhiksha Raj
18
18
0
16 Jan 2017
Embedding Watermarks into Deep Neural Networks
Yusuke Uchida
Yuki Nagai
S. Sakazawa
Shiníchi Satoh
62
598
0
15 Jan 2017
Scaling Binarized Neural Networks on Reconfigurable Logic
Nicholas J. Fraser
Yaman Umuroglu
Giulio Gambardella
Michaela Blott
Philip H. W. Leong
Magnus Jahre
K. Vissers
MQ
20
57
0
12 Jan 2017
QuickNet: Maximizing Efficiency and Efficacy in Deep Architectures
Tapabrata Ghosh
19
6
0
09 Jan 2017
Previous
1
2
3
...
66
67
68
69
Next