Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
v1
v2
v3
v4
v5 (latest)
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,481 papers shown
Title
MoBiNet: A Mobile Binary Network for Image Classification
Hai T. Phan
Dang T. Huynh
Yihui He
Marios Savvides
Zhiqiang Shen
MQ
85
49
0
29 Jul 2019
Energy-Efficient Processing and Robust Wireless Cooperative Transmission for Edge Inference
Kai Yang
Yuanming Shi
Wei Yu
Z. Ding
46
42
0
29 Jul 2019
DeepCABAC: A Universal Compression Algorithm for Deep Neural Networks
Simon Wiedemann
H. Kirchhoffer
Stefan Matlage
Paul Haase
Arturo Marbán
...
Ahmed Osman
D. Marpe
H. Schwarz
Thomas Wiegand
Wojciech Samek
111
98
0
27 Jul 2019
Learning Instance-wise Sparsity for Accelerating Deep Models
Chuanjian Liu
Yunhe Wang
Kai Han
Chunjing Xu
Chang Xu
77
27
0
27 Jul 2019
Memory- and Communication-Aware Model Compression for Distributed Deep Learning Inference on IoT
Kartikeya Bhardwaj
Chingyi Lin
A. L. Sartor
R. Marculescu
GNN
66
54
0
26 Jul 2019
Exploiting the Redundancy in Convolutional Filters for Parameter Reduction
Kumara Kahatapitiya
Ranga Rodrigo
25
0
0
26 Jul 2019
Co-Evolutionary Compression for Unpaired Image Translation
Han Shu
Yunhe Wang
Xu Jia
Kai Han
Hanting Chen
Chunjing Xu
Qi Tian
Chang Xu
52
87
0
25 Jul 2019
Benchmarking TPU, GPU, and CPU Platforms for Deep Learning
Y. Wang
Gu-Yeon Wei
David Brooks
ELM
VLM
104
278
0
24 Jul 2019
Distilled Siamese Networks for Visual Tracking
Jianbing Shen
Yuanpei Liu
Xingping Dong
Xiankai Lu
Fahad Shahbaz Khan
Guosheng Lin
110
104
0
24 Jul 2019
Recurrent Neural Networks: An Embedded Computing Perspective
Nesma M. Rezk
M. Purnaprajna
Tomas Nordstrom
Z. Ul-Abdin
129
82
0
23 Jul 2019
RRNet: Repetition-Reduction Network for Energy Efficient Decoder of Depth Estimation
Sangyun Oh
Hye-Jin S. Kim
Jongeun Lee
Junmo Kim
3DV
OffRL
84
1
0
23 Jul 2019
Similarity-Preserving Knowledge Distillation
Frederick Tung
Greg Mori
156
987
0
23 Jul 2019
Highlight Every Step: Knowledge Distillation via Collaborative Teaching
Haoran Zhao
Changrui Chen
Junyu Dong
Xin Sun
Zihe Dong
82
59
0
23 Jul 2019
MemNet: Memory-Efficiency Guided Neural Architecture Search with Augment-Trim learning
Peiye Liu
Bo Wu
Huadong Ma
Mingoo Seok
76
6
0
22 Jul 2019
EnSyth: A Pruning Approach to Synthesis of Deep Learning Ensembles
Besher Alhalabi
M. Gaber
S. Basurra
26
3
0
22 Jul 2019
Open DNN Box by Power Side-Channel Attack
Yun Xiang
Zhuangzhi Chen
Zuohui Chen
Zebin Fang
Haiyang Hao
Jinyin Chen
Yi Liu
Zhefu Wu
Qi Xuan
Xiaoniu Yang
AAML
72
90
0
21 Jul 2019
Convergence of Edge Computing and Deep Learning: A Comprehensive Survey
Xiaofei Wang
Yiwen Han
Victor C. M. Leung
Dusit Niyato
Xueqiang Yan
Xu Chen
113
1,006
0
19 Jul 2019
Light Multi-segment Activation for Model Compression
Zhenhui Xu
Guolin Ke
Jia Zhang
Jiang Bian
Tie-Yan Liu
28
2
0
16 Jul 2019
An Inter-Layer Weight Prediction and Quantization for Deep Neural Networks based on a Smoothly Varying Weight Hypothesis
Kang-Ho Lee
Joonhyun Jeong
Sung-Ho Bae
72
4
0
16 Jul 2019
What does it mean to understand a neural network?
Timothy Lillicrap
Konrad Paul Kording
60
43
0
15 Jul 2019
Bringing Giant Neural Networks Down to Earth with Unlabeled Data
Yehui Tang
Shan You
Chang Xu
Boxin Shi
Chao Xu
85
11
0
13 Jul 2019
And the Bit Goes Down: Revisiting the Quantization of Neural Networks
Pierre Stock
Armand Joulin
Rémi Gribonval
Benjamin Graham
Hervé Jégou
MQ
137
149
0
12 Jul 2019
Neural Epitome Search for Architecture-Agnostic Network Compression
Daquan Zhou
Xiaojie Jin
Qibin Hou
Kaixin Wang
Jianchao Yang
Jiashi Feng
109
13
0
12 Jul 2019
Reinforcement Learning with Chromatic Networks for Compact Architecture Search
Xingyou Song
K. Choromanski
Jack Parker-Holder
Yunhao Tang
Wenbo Gao
Aldo Pacchiano
Tamás Sarlós
Deepali Jain
Yuxiang Yang
50
1
0
10 Jul 2019
Dual Dynamic Inference: Enabling More Efficient, Adaptive and Controllable Deep Inference
Yue Wang
Jianghao Shen
Ting-Kuei Hu
Pengfei Xu
T. Nguyen
Richard Baraniuk
Zhangyang Wang
Yingyan Lin
91
77
0
10 Jul 2019
Preferences Prediction using a Gallery of Mobile Device based on Scene Recognition and Object Detection
Andrey V. Savchenko
K. Demochkin
I. Grechikhin
54
23
0
10 Jul 2019
Data-Independent Neural Pruning via Coresets
Ben Mussay
Margarita Osadchy
Vladimir Braverman
Samson Zhou
Dan Feldman
104
60
0
09 Jul 2019
A Targeted Acceleration and Compression Framework for Low bit Neural Networks
Biao Qian
Yang Wang
MQ
68
0
0
09 Jul 2019
Point-Voxel CNN for Efficient 3D Deep Learning
Zhijian Liu
Haotian Tang
Chengyue Wu
Song Han
3DPC
201
678
0
08 Jul 2019
Exploiting Prunability for Person Re-Identification
Hugo Masson
Amran Bhuiyan
Le Thanh Nguyen-Meidine
Mehrsan Javan
P. Siva
Ismail Ben Ayed
Eric Granger
67
9
0
04 Jul 2019
A Unified Optimization Approach for CNN Model Inference on Integrated GPUs
Leyuan Wang
Zhi Chen
Yizhi Liu
Yao Wang
Lianmin Zheng
Mu Li
Yida Wang
91
30
0
03 Jul 2019
Non-Structured DNN Weight Pruning -- Is It Beneficial in Any Platform?
Xiaolong Ma
Sheng Lin
Shaokai Ye
Zhezhi He
Linfeng Zhang
...
Deliang Fan
Xuehai Qian
Xinyu Lin
Kaisheng Ma
Yanzhi Wang
MQ
132
93
0
03 Jul 2019
Neuron ranking -- an informed way to condense convolutional neural networks architecture
Kamil Adamczewski
Mijung Park
FAtt
32
2
0
03 Jul 2019
Single-Path Mobile AutoML: Efficient ConvNet Design and NAS Hyperparameter Optimization
Dimitrios Stamoulis
Ruizhou Ding
Di Wang
Dimitrios Lymberopoulos
B. Priyantha
Jie Liu
Diana Marculescu
67
34
0
01 Jul 2019
SLSNet: Skin lesion segmentation using a lightweight generative adversarial network
Md. Mostafa Kamal Sarker
Hatem A. Rashwan
Farhan Akram
V. Singh
Syeda Furruka Banu
...
Kabir Ahmed Choudhury
Sylvie Chambon
Petia Radeva
D. Puig
M. Abdel-Nasser
GAN
MedIm
71
28
0
01 Jul 2019
One Size Does Not Fit All: Quantifying and Exposing the Accuracy-Latency Trade-off in Machine Learning Cloud Service APIs via Tolerance Tiers
Matthew Halpern
Behzad Boroujerdian
Todd W. Mummert
Evelyn Duesterwald
Vijay Janapa Reddi
49
25
0
26 Jun 2019
New pointwise convolution in Deep Neural Networks through Extremely Fast and Non Parametric Transforms
Joonhyun Jeong
Sung-Ho Bae
62
1
0
25 Jun 2019
COP: Customized Deep Model Compression via Regularized Correlation-Based Filter-Level Pruning
Wenxiao Wang
Cong Fu
Jishun Guo
Deng Cai
Xiaofei He
VLM
67
72
0
25 Jun 2019
Smaller Text Classifiers with Discriminative Cluster Embeddings
Mingda Chen
Kevin Gimpel
35
6
0
23 Jun 2019
Filter Early, Match Late: Improving Network-Based Visual Place Recognition
Stephen Hausler
A. Jacobson
Michael Milford
54
17
0
21 Jun 2019
Progressive Gradient Pruning for Classification, Detection and DomainAdaptation
Le Thanh Nguyen-Meidine
Eric Granger
M. Kiran
Louis-Antoine Blais-Morin
M. Pedersoli
VLM
40
3
0
20 Jun 2019
GAN-Knowledge Distillation for one-stage Object Detection
Wanwei Wang
Jin ke Yu Fan Zong
ObjD
55
29
0
20 Jun 2019
Back to Simplicity: How to Train Accurate BNNs from Scratch?
Joseph Bethge
Haojin Yang
Marvin Bornstein
Christoph Meinel
AAML
MQ
69
58
0
19 Jun 2019
ADA-Tucker: Compressing Deep Neural Networks via Adaptive Dimension Adjustment Tucker Decomposition
Zhisheng Zhong
Fangyin Wei
Zhouchen Lin
Chao Zhang
62
29
0
18 Jun 2019
A One-step Pruning-recovery Framework for Acceleration of Convolutional Neural Networks
Dong Wang
Lei Zhou
Xiao Bai
Jun Zhou
28
2
0
18 Jun 2019
Structured Pruning of Recurrent Neural Networks through Neuron Selection
Liangjiang Wen
Xuanyang Zhang
Haoli Bai
Zenglin Xu
72
38
0
17 Jun 2019
Equivariant neural networks and equivarification
Erkao Bao
Linqi Song
64
15
0
16 Jun 2019
Finding the Needle in the Haystack with Convolutions: on the benefits of architectural bias
Stéphane dÁscoli
Levent Sagun
Joan Bruna
Giulio Biroli
98
37
0
16 Jun 2019
Scalable Model Compression by Entropy Penalized Reparameterization
Deniz Oktay
Johannes Ballé
Saurabh Singh
Abhinav Shrivastava
84
43
0
15 Jun 2019
A Signal Propagation Perspective for Pruning Neural Networks at Initialization
Namhoon Lee
Thalaiyasingam Ajanthan
Stephen Gould
Philip Torr
AAML
82
156
0
14 Jun 2019
Previous
1
2
3
...
50
51
52
...
68
69
70
Next