Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,448 papers shown
Title
IGCV3: Interleaved Low-Rank Group Convolutions for Efficient Deep Neural Networks
Ke Sun
Mingjie Li
Dong Liu
Jingdong Wang
48
126
0
01 Jun 2018
A Highly Parallel FPGA Implementation of Sparse Neural Network Training
Sourya Dey
Diandian Chen
Zongyang Li
Souvik Kundu
Kuan-Wen Huang
K. Chugg
Peter A. Beerel
17
11
0
31 May 2018
Fine-Pruning: Defending Against Backdooring Attacks on Deep Neural Networks
Kang Liu
Brendan Dolan-Gavitt
S. Garg
AAML
26
1,022
0
30 May 2018
Channel Gating Neural Networks
Weizhe Hua
Yuan Zhou
Christopher De Sa
Zhiru Zhang
G. E. Suh
15
180
0
29 May 2018
A novel channel pruning method for deep neural network compression
Yiming Hu
Siyang Sun
Jianquan Li
Xingang Wang
Qingyi Gu
20
67
0
29 May 2018
Retraining-Based Iterative Weight Quantization for Deep Neural Networks
Dongsoo Lee
Byeongwook Kim
MQ
36
16
0
29 May 2018
Adaptive Network Sparsification with Dependent Variational Beta-Bernoulli Dropout
Juho Lee
Saehoon Kim
Jaehong Yoon
Haebeom Lee
Eunho Yang
Sung Ju Hwang
14
12
0
28 May 2018
Constructing Fast Network through Deconstruction of Convolution
Yunho Jeon
Junmo Kim
22
71
0
28 May 2018
Compact and Computationally Efficient Representation of Deep Neural Networks
Simon Wiedemann
K. Müller
Wojciech Samek
MQ
42
67
0
27 May 2018
Accelerating CNN inference on FPGAs: A Survey
K. Abdelouahab
Maxime Pelcat
Jocelyn Serot
F. Berry
AI4CE
30
147
0
26 May 2018
Heterogeneous Bitwidth Binarization in Convolutional Neural Networks
Josh Fromm
Shwetak N. Patel
Matthai Philipose
MQ
19
27
0
25 May 2018
Tensorial Neural Networks: Generalization of Neural Networks and Application to Model Compression
Jiahao Su
Jingling Li
Bobby Bhattacharjee
Furong Huang
16
20
0
25 May 2018
Scalable Methods for 8-bit Training of Neural Networks
Ron Banner
Itay Hubara
Elad Hoffer
Daniel Soudry
MQ
54
332
0
25 May 2018
Multi-Task Zipping via Layer-wise Neuron Sharing
Xiaoxi He
Zimu Zhou
Lothar Thiele
MoMe
13
61
0
24 May 2018
Learning towards Minimum Hyperspherical Energy
Weiyang Liu
Rongmei Lin
Ziqiang Liu
Lixin Liu
Zhiding Yu
Bo Dai
Le Song
30
146
0
23 May 2018
AutoPruner: An End-to-End Trainable Filter Pruning Method for Efficient Deep Model Inference
Jian-Hao Luo
Jianxin Wu
23
207
0
23 May 2018
Approximate Random Dropout
Zhuoran Song
Ru Wang
Dongyu Ru
Hongru Huang
Zhenghao Peng
Hai Zhao
Xiaoyao Liang
Li Jiang
BDL
30
9
0
23 May 2018
Echo: Compiler-based GPU Memory Footprint Reduction for LSTM RNN Training
Bojian Zheng
Abhishek Tiwari
Nandita Vijaykumar
Gennady Pekhimenko
27
44
0
22 May 2018
CascadeCNN: Pushing the performance limits of quantisation
Alexandros Kouris
Stylianos I. Venieris
C. Bouganis
MQ
30
24
0
22 May 2018
Parsimonious Bayesian deep networks
Mingyuan Zhou
BDL
20
8
0
22 May 2018
AxTrain: Hardware-Oriented Neural Network Training for Approximate Inference
Xin He
Liu Ke
Wenyan Lu
Guihai Yan
Xuan Zhang
27
34
0
21 May 2018
Compression of Deep Convolutional Neural Networks under Joint Sparsity Constraints
Yoojin Choi
Mostafa El-Khamy
Jungwon Lee
11
6
0
21 May 2018
Faster Neural Network Training with Approximate Tensor Operations
Menachem Adelman
Kfir Y. Levy
Ido Hakimi
M. Silberstein
36
26
0
21 May 2018
Neural Network Compression using Transform Coding and Clustering
Thorsten Laude
Yannick Richter
Jörn Ostermann
18
4
0
18 May 2018
RotDCF: Decomposition of Convolutional Filters for Rotation-Equivariant Deep Networks
Xiuyuan Cheng
Qiang Qiu
Robert Calderbank
Guillermo Sapiro
30
43
0
17 May 2018
Object detection at 200 Frames Per Second
Rakesh Mehta
Cemalettin Öztürk
ObjD
37
61
0
16 May 2018
Lightweight Pyramid Networks for Image Deraining
Xueyang Fu
Borong Liang
Yue Huang
Xinghao Ding
John Paisley
18
323
0
16 May 2018
Hu-Fu: Hardware and Software Collaborative Attack Framework against Neural Networks
Wenshuo Li
Jincheng Yu
Xuefei Ning
Pengjun Wang
Qi Wei
Yu Wang
Huazhong Yang
AAML
39
61
0
14 May 2018
Unifying and Merging Well-trained Deep Neural Networks for Inference Stage
Yi-Min Chou
Yi-Ming Chan
Jia-Hong Lee
Chih-Yi Chiu
Chu-Song Chen
MoMe
35
34
0
14 May 2018
ContextNet: Exploring Context and Detail for Semantic Segmentation in Real-time
Rudra P. K. Poudel
Ujwal D. Bonde
Stephan Liwicki
Christopher Zach
SSeg
38
230
0
11 May 2018
GANAX: A Unified MIMD-SIMD Acceleration for Generative Adversarial Networks
Amir Yazdanbakhsh
Hajar Falahati
Philip J. Wolfe
K. Samadi
Nam Sung Kim
H. Esmaeilzadeh
30
71
0
10 May 2018
Boosting up Scene Text Detectors with Guided CNN
Xiaoyu Yue
Zhanghui Kuang
Zhaoyang Zhang
Zhenfang Chen
Pan He
Yu Qiao
Wayne Zhang
17
8
0
10 May 2018
Neural Cache: Bit-Serial In-Cache Acceleration of Deep Neural Networks
Charles Eckert
Xiaowei Wang
Jingcheng Wang
Arun K. Subramaniyan
R. Iyer
D. Sylvester
D. Blaauw
R. Das
MQ
13
334
0
09 May 2018
Towards Accurate and High-Speed Spiking Neuromorphic Systems with Data Quantization-Aware Deep Networks
Fuqiang Liu
Chenchen Liu
16
5
0
08 May 2018
A Hierarchical Matcher using Local Classifier Chains
Lingfeng Zhang
I. Kakadiaris
9
0
0
07 May 2018
Enhancing the Regularization Effect of Weight Pruning in Artificial Neural Networks
Brian Bartoldson
Adrian Barbu
G. Erlebacher
14
5
0
04 May 2018
Power Law in Sparsified Deep Neural Networks
Lu Hou
James T. Kwok
29
3
0
04 May 2018
Pixel-wise Attentional Gating for Parsimonious Pixel Labeling
Shu Kong
Charless C. Fowlkes
57
40
0
03 May 2018
UNIQ: Uniform Noise Injection for Non-Uniform Quantization of Neural Networks
Chaim Baskin
Eli Schwartz
Evgenii Zheltonozhskii
Natan Liss
Raja Giryes
A. Bronstein
A. Mendelson
MQ
27
45
0
29 Apr 2018
Precise Box Score: Extract More Information from Datasets to Improve the Performance of Face Detection
Ce Qi
Xiaoping Chen
Pingyu Wang
Fei Su
CVBM
16
1
0
28 Apr 2018
Low-memory convolutional neural networks through incremental depth-first processing
Jonathan Binas
Yoshua Bengio
SupR
30
3
0
28 Apr 2018
Sparse Persistent RNNs: Squeezing Large Recurrent Networks On-Chip
Feiwen Zhu
Jeff Pool
M. Andersch
J. Appleyard
Fung Xie
22
29
0
26 Apr 2018
Profile-guided memory optimization for deep neural networks
Taro Sekiyama
T. Imamichi
Haruki Imai
Raymond H. Putra
39
22
0
26 Apr 2018
Accelerator-Aware Pruning for Convolutional Neural Networks
Hyeong-Ju Kang
13
88
0
26 Apr 2018
Efficient Multi-objective Neural Architecture Search via Lamarckian Evolution
T. Elsken
J. H. Metzen
Frank Hutter
131
499
0
24 Apr 2018
Measuring the Intrinsic Dimension of Objective Landscapes
Chunyuan Li
Heerad Farkhoor
Rosanne Liu
J. Yosinski
43
401
0
24 Apr 2018
MQGrad: Reinforcement Learning of Gradient Quantization in Parameter Server
Guoxin Cui
Jun Xu
Wei Zeng
Yanyan Lan
Jiafeng Guo
Xueqi Cheng
MQ
16
13
0
22 Apr 2018
MobileFaceNets: Efficient CNNs for Accurate Real-Time Face Verification on Mobile Devices
Sheng Chen
Yang Liu
Xiang Gao
Zhen Han
CVBM
3DH
30
558
0
20 Apr 2018
Minimizing Area and Energy of Deep Learning Hardware Design Using Collective Low Precision and Structured Compression
Shihui Yin
Gaurav Srivastava
S. Venkataramanaiah
C. Chakrabarti
Visar Berisha
Jae-sun Seo
25
8
0
19 Apr 2018
Pelee: A Real-Time Object Detection System on Mobile Devices
R. Wang
Xiang Li
Charles X. Ling
ObjD
30
454
0
18 Apr 2018
Previous
1
2
3
...
60
61
62
...
67
68
69
Next