Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
v1
v2
v3
v4
v5 (latest)
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,481 papers shown
Title
The Loss Surface of XOR Artificial Neural Networks
D. Mehta
Xiaojun Zhao
Edgar A. Bernal
D. Wales
167
19
0
06 Apr 2018
Structured Evolution with Compact Architectures for Scalable Policy Optimization
K. Choromanski
Mark Rowland
Vikas Sindhwani
Richard Turner
Adrian Weller
118
149
0
06 Apr 2018
Building Efficient CNN Architecture for Offline Handwritten Chinese Character Recognition
Zhiyuan Li
Nanjun Teng
Min Jin
Huaxiang Lu
39
52
0
04 Apr 2018
DeepSigns: A Generic Watermarking Framework for IP Protection of Deep Learning Models
B. Rouhani
Huili Chen
F. Koushanfar
130
48
0
02 Apr 2018
Structured Weight Matrices-Based Hardware Accelerators in Deep Neural Networks: FPGAs and ASICs
Caiwen Ding
Ao Ren
Geng Yuan
Xiaolong Ma
Jiayu Li
Ning Liu
Bo Yuan
Yanzhi Wang
77
23
0
28 Mar 2018
Adversarial Network Compression
Vasileios Belagiannis
Azade Farshad
Fabio Galasso
GAN
AAML
69
58
0
28 Mar 2018
FPGA Implementations of 3D-SIMD Processor Architecture for Deep Neural Networks Using Relative Indexed Compressed Sparse Filter Encoding Format and Stacked Filters Stationary Flow
Yuechao Gao
Nianhong Liu
Shenmin Zhang
98
1
0
28 Mar 2018
Incremental Training of Deep Convolutional Neural Networks
R. Istrate
A. Malossi
C. Bekas
Dimitrios S. Nikolopoulos
CLL
68
21
0
27 Mar 2018
Diagonalwise Refactorization: An Efficient Training Method for Depthwise Convolutions
Zheng Qin
Zhaoning Zhang
Dongsheng Li
Yiming Zhang
Yuxing Peng
62
28
0
27 Mar 2018
Face Recognition with Hybrid Efficient Convolution Algorithms on FPGAs
Chuanhao Zhuge
Xinheng Liu
Xiaofan Zhang
S. Gummadi
Jinjun Xiong
Deming Chen
CVBM
64
36
0
23 Mar 2018
Iterative Low-Rank Approximation for CNN Compression
Maksym Kholiavchenko
36
9
0
23 Mar 2018
SqueezeNext: Hardware-Aware Neural Network Design
A. Gholami
K. Kwon
Bichen Wu
Zizheng Tai
Xiangyu Yue
Peter H. Jin
Sicheng Zhao
Kurt Keutzer
69
300
0
23 Mar 2018
Fast, Accurate, and Lightweight Super-Resolution with Cascading Residual Network
Namhyuk Ahn
Byungkon Kang
Kyung-ah Sohn
SupR
123
1,131
0
23 Mar 2018
Design Principles for Sparse Matrix Multiplication on the GPU
Carl Yang
A. Buluç
John Douglas Owens
61
109
0
22 Mar 2018
Task dependent Deep LDA pruning of neural networks
Qing Tian
Tal Arbel
James J. Clark
31
0
0
21 Mar 2018
Efficient Recurrent Neural Networks using Structured Matrices in FPGAs
Zhe Li
Shuo Wang
Caiwen Ding
Qinru Qiu
Yanzhi Wang
Yun Liang
GNN
41
21
0
20 Mar 2018
Local Binary Pattern Networks
Jeng-Hau Lin
Yunfan Yang
Rajesh K. Gupta
Zhuowen Tu
MQ
50
13
0
19 Mar 2018
ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation
Sachin Mehta
Mohammad Rastegari
A. Caspi
Linda G. Shapiro
Hannaneh Hajishirzi
SSeg
156
784
0
19 Mar 2018
Constrained Deep Learning using Conditional Gradient and Applications in Computer Vision
Sathya Ravi
Tuan Dinh
Vishnu Suresh Lokhande
Vikas Singh
AI4CE
71
22
0
17 Mar 2018
EVA
2
^2
2
: Exploiting Temporal Redundancy in Live Computer Vision
Mark Buckler
Philip Bedoukian
Suren Jayasuriya
Adrian Sampson
126
79
0
16 Mar 2018
TBD: Benchmarking and Analyzing Deep Neural Network Training
Hongyu Zhu
Mohamed Akrout
Bojian Zheng
Andrew Pelegris
Amar Phanishayee
Bianca Schroeder
Gennady Pekhimenko
103
81
0
16 Mar 2018
Efficient Hardware Realization of Convolutional Neural Networks using Intra-Kernel Regular Pruning
Maurice Yang
Mahmoud Faraj
Assem Hussein
V. Gaudet
CVBM
69
12
0
15 Mar 2018
Toolflows for Mapping Convolutional Neural Networks on FPGAs: A Survey and Future Directions
Stylianos I. Venieris
Alexandros Kouris
C. Bouganis
76
185
0
15 Mar 2018
Exploring Linear Relationship in Feature Map Subspace for ConvNets Compression
Dong Wang
Lei Zhou
Xueni Zhang
Xiao Bai
Jun Zhou
80
47
0
15 Mar 2018
C-LSTM: Enabling Efficient LSTM using Structured Compression Techniques on FPGAs
Shuo Wang
Zhe Li
Caiwen Ding
Bo Yuan
Yanzhi Wang
Qinru Qiu
Yun Liang
61
197
0
14 Mar 2018
LCANet: End-to-End Lipreading with Cascaded Attention-CTC
Kai Xu
Dawei Li
N. Cassimatis
Xiaolong Wang
86
97
0
13 Mar 2018
Quantization of Fully Convolutional Networks for Accurate Biomedical Image Segmentation
Xiaowei Xu
Q. Lu
Yu Hu
Lin Yang
X. S. Hu
Benlin Liu
Yiyu Shi
MedIm
84
85
0
13 Mar 2018
FeTa: A DCA Pruning Algorithm with Generalization Error Guarantees
Konstantinos Pitas
Mike Davies
P. Vandergheynst
27
2
0
12 Mar 2018
ShuffleSeg: Real-time Semantic Segmentation Network
M. Gamal
Mennatullah Siam
Moemen Abdel-Razek
SSeg
68
60
0
10 Mar 2018
Bit-Tactical: Exploiting Ineffectual Computations in Convolutional Neural Networks: Which, Why, and How
A. Delmas
Patrick Judd
Dylan Malone Stuart
Zissis Poulos
Mostafa Mahmoud
Sayeh Sharify
Milos Nikolic
Andreas Moshovos
59
24
0
09 Mar 2018
The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
Jonathan Frankle
Michael Carbin
445
3,497
0
09 Mar 2018
High-Accuracy Low-Precision Training
Christopher De Sa
Megan Leszczynski
Jian Zhang
Alana Marzoev
Christopher R. Aberger
K. Olukotun
Christopher Ré
94
109
0
09 Mar 2018
Exponential Discriminative Metric Embedding in Deep Learning
Bowen Wu
Zhangling Chen
Jun Wang
Hua-Ming Wu
73
11
0
07 Mar 2018
Learning SMaLL Predictors
Vikas Garg
O. Dekel
Lin Xiao
76
3
0
06 Mar 2018
Personalized Exposure Control Using Adaptive Metering and Reinforcement Learning
Huan Yang
Baoyuan Wang
Noranart Vesdapunt
Minyi Guo
S. B. Kang
62
22
0
06 Mar 2018
Deep Neural Network Compression with Single and Multiple Level Quantization
Yuhui Xu
Yongzhuang Wang
Aojun Zhou
Weiyao Lin
H. Xiong
MQ
70
115
0
06 Mar 2018
Stochastic Activation Pruning for Robust Adversarial Defense
Guneet Singh Dhillon
Kamyar Azizzadenesheli
Zachary Chase Lipton
Jeremy Bernstein
Jean Kossaifi
Aran Khanna
Anima Anandkumar
AAML
107
548
0
05 Mar 2018
An Optimal Control Approach to Deep Learning and Applications to Discrete-Weight Neural Networks
Qianxiao Li
Shuji Hao
103
76
0
04 Mar 2018
The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches
Md. Zahangir Alom
T. Taha
C. Yakopcic
Stefan Westberg
P. Sidike
Mst Shamima Nasrin
B. Van Essen
A. Awwal
V. Asari
VLM
133
883
0
03 Mar 2018
Scalar Quantization as Sparse Least Square Optimization
Chen Wang
Xiaomei Yang
Shaomin Fei
Kai Zhou
Xiaofeng Gong
Miao Du
Ruisen Luo
MQ
40
3
0
01 Mar 2018
Learning Sparse Structured Ensembles with SG-MCMC and Network Pruning
Yichi Zhang
Zhijian Ou
64
0
0
01 Mar 2018
Compressing Neural Networks using the Variational Information Bottleneck
Bin Dai
Chen Zhu
David Wipf
MLT
72
182
0
28 Feb 2018
Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs
Xuhao Chen
109
25
0
28 Feb 2018
Frank-Wolfe Network: An Interpretable Deep Structure for Non-Sparse Coding
Dong Liu
Ke Sun
Zhangyang Wang
Runsheng Liu
Zhengjun Zha
107
12
0
28 Feb 2018
Recurrent Residual Module for Fast Inference in Videos
Bowen Pan
Wuwei Lin
Xiaolin Fang
Chaoqin Huang
Bolei Zhou
Cewu Lu
ObjD
94
34
0
27 Feb 2018
Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis
Tal Ben-Nun
Torsten Hoefler
GNN
91
713
0
26 Feb 2018
PBGen: Partial Binarization of Deconvolution-Based Generators for Edge Intelligence
Jinglan Liu
Jiaxin Zhang
Yukun Ding
Xiaowei Xu
Meng Jiang
Yiyu Shi
103
4
0
26 Feb 2018
Wide Compression: Tensor Ring Nets
Wenqi Wang
Yifan Sun
Brian Eriksson
Wenlin Wang
Vaneet Aggarwal
69
171
0
25 Feb 2018
Loss-aware Weight Quantization of Deep Networks
Lu Hou
James T. Kwok
MQ
111
127
0
23 Feb 2018
Training wide residual networks for deployment using a single bit for each weight
Mark D Mcdonnell
MQ
96
71
0
23 Feb 2018
Previous
1
2
3
...
62
63
64
...
68
69
70
Next