Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
v1
v2
v3
v4
v5 (latest)
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,481 papers shown
Title
The Disruptions of 5G on Data-driven Technologies and Applications
Dumitrel Loghin
Shaofeng Cai
Gang Chen
Tien Tuan Anh Dinh
Feiyi Fan
...
Wei Wang
Xiaokui Xiao
Yang Yang
Meihui Zhang
Zhonghua Zhang
66
66
0
06 Sep 2019
One Size Does Not Fit All: Multi-Scale, Cascaded RNNs for Radar Classification
Dhrubojyoti Roy
S. Srivastava
Aditya Kusupati
Pranshu Jain
Manik Varma
A. Arora
91
12
0
06 Sep 2019
Training Deep Neural Networks Using Posit Number System
Jinming Lu
Siyuan Lu
Zhisheng Wang
Chao Fang
Jun Lin
Zhongfeng Wang
Li Du
MQ
57
14
0
06 Sep 2019
PCONV: The Missing but Desirable Sparsity in DNN Weight Pruning for Real-time Execution on Mobile Devices
Xiaolong Ma
Fu-Ming Guo
Wei Niu
Xue Lin
Jian Tang
Kaisheng Ma
Bin Ren
Yanzhi Wang
CVBM
109
180
0
06 Sep 2019
Additive function approximation in the brain
K. Harris
91
13
0
05 Sep 2019
A Novel Design of Adaptive and Hierarchical Convolutional Neural Networks using Partial Reconfiguration on FPGA
Mohammad Farhadi
Mehdi Ghasemi
Yezhou Yang
58
27
0
05 Sep 2019
ModiPick: SLA-aware Accuracy Optimization For Mobile Deep Inference
Samuel S. Ogden
Tian Guo
24
3
0
04 Sep 2019
What Happens on the Edge, Stays on the Edge: Toward Compressive Deep Learning
Yongqian Li
Thomas Strohmer
33
2
0
04 Sep 2019
On the Downstream Performance of Compressed Word Embeddings
Avner May
Jian Zhang
Tri Dao
Christopher Ré
80
27
0
03 Sep 2019
PSDNet and DPDNet: Efficient channel expansion, Depthwise-Pointwise-Depthwise Inverted Bottleneck Block
Guoqing Li
Meng Zhang
Qianru Zhang
Ziyang Chen
Wenzhao Liu
Jiaojie Li
Xuzhao Shen
Jian Li
Zhenyu Zhu
Chau Yuen
73
5
0
03 Sep 2019
HarDNet: A Low Memory Traffic Network
P. Chao
Chao-Yang Kao
Yunxing Ruan
Chien-Hsiang Huang
Y. Lin
259
270
0
03 Sep 2019
Touché: Towards Ideal and Efficient Cache Compression By Mitigating Tag Area Overheads
Seokin Hong
B. Abali
A. Buyuktosunoglu
Michael B. Healy
Prashant J. Nair
88
17
0
02 Sep 2019
Sparse Deep Neural Network Graph Challenge
J. Kepner
Simon Alford
V. Gadepally
Michael Jones
Lauren Milechin
Ryan A. Robinett
S. Samsi
GNN
65
49
0
02 Sep 2019
EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training Accelerators
Lukas Cavigelli
Georg Rutishauser
Luca Benini
MQ
67
34
0
30 Aug 2019
Smaller Models, Better Generalization
Mayank Sharma
Suraj Tripathi
Abhimanyu Dubey
Jayadeva Jayadeva
Sai Guruju
Nihal Goalla
38
1
0
29 Aug 2019
Image Captioning with Sparse Recurrent Neural Network
J. Tan
Chee Seng Chan
Joon Huang Chuah
VLM
56
6
0
28 Aug 2019
On the Effectiveness of Low-Rank Matrix Factorization for LSTM Model Compression
Genta Indra Winata
Andrea Madotto
Jamin Shin
Elham J. Barezi
Pascale Fung
64
29
0
27 Aug 2019
DeepHoyer: Learning Sparser Neural Network with Differentiable Scale-Invariant Sparsity Measures
Huanrui Yang
W. Wen
H. Li
96
99
0
27 Aug 2019
Once-for-All: Train One Network and Specialize it for Efficient Deployment
Han Cai
Chuang Gan
Tianzhe Wang
Zhekai Zhang
Song Han
OOD
223
1,288
0
26 Aug 2019
Differentiable Product Quantization for End-to-End Embedding Compression
Ting Chen
Lala Li
Yizhou Sun
MQ
55
68
0
26 Aug 2019
Patient Knowledge Distillation for BERT Model Compression
S. Sun
Yu Cheng
Zhe Gan
Jingjing Liu
151
843
0
25 Aug 2019
SeesawFaceNets: sparse and robust face verification model for mobile platform
Jintao Zhang
3DH
CVBM
57
9
0
24 Aug 2019
Well-Read Students Learn Better: On the Importance of Pre-training Compact Models
Iulia Turc
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
84
225
0
23 Aug 2019
Learning Filter Basis for Convolutional Neural Network Compression
Yawei Li
Shuhang Gu
Luc Van Gool
Radu Timofte
SupR
79
99
0
23 Aug 2019
MobiSR: Efficient On-Device Super-Resolution through Heterogeneous Mobile Processors
Royson Lee
Stylianos I. Venieris
Łukasz Dudziak
S. Bhattacharya
Nicholas D. Lane
SupR
56
96
0
21 Aug 2019
Restricted Recurrent Neural Networks
Enmao Diao
Jie Ding
Vahid Tarokh
67
21
0
21 Aug 2019
Efficient Deep Neural Networks
Bichen Wu
63
12
0
20 Aug 2019
Implicit Deep Learning
L. Ghaoui
Fangda Gu
Bertrand Travacca
Armin Askari
Alicia Y. Tsai
AI4CE
79
182
0
17 Aug 2019
Improved Techniques for Training Adaptive Deep Networks
Hao Li
Hong Zhang
Xiaojuan Qi
Ruigang Yang
Gao Huang
78
135
0
17 Aug 2019
Survey on Deep Neural Networks in Speech and Vision Systems
M. Alam
Manar D. Samad
Lasitha Vidyaratne
Alexander M. Glandon
Khan M. Iftekharuddin
3DV
VLM
AI4TS
100
212
0
16 Aug 2019
Accelerated CNN Training Through Gradient Approximation
Ziheng Wang
Sree Harsha Nelaturu
366
5
0
15 Aug 2019
Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks
Ruihao Gong
Xianglong Liu
Shenghu Jiang
Tian-Hao Li
Peng Hu
Jiazhen Lin
F. Yu
Junjie Yan
MQ
128
460
0
14 Aug 2019
Justlookup: One Millisecond Deep Feature Extraction for Point Clouds By Lookup Tables
Hongxin Lin
Zelin Xiao
Yang Tan
Hongyang Chao
Shengyong Ding
3DPC
57
22
0
14 Aug 2019
Neural Plasticity Networks
Yongqian Li
Shihao Ji
30
1
0
13 Aug 2019
Adversarial Neural Pruning with Latent Vulnerability Suppression
Divyam Madaan
Jinwoo Shin
Sung Ju Hwang
AAML
45
3
0
12 Aug 2019
ACNet: Strengthening the Kernel Skeletons for Powerful CNN via Asymmetric Convolution Blocks
Xiaohan Ding
Yuchen Guo
Guiguang Ding
Jiawei Han
98
680
0
11 Aug 2019
Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations
Bohan Zhuang
Jing Liu
Mingkui Tan
Lingqiao Liu
Ian Reid
Chunhua Shen
MQ
107
46
0
10 Aug 2019
Recent Advances in Deep Learning for Object Detection
Xiongwei Wu
Doyen Sahoo
Guosheng Lin
VLM
ObjD
135
824
0
10 Aug 2019
Group Pruning using a Bounded-Lp norm for Group Gating and Regularization
Chaithanya Kumar Mummadi
Tim Genewein
Dan Zhang
Thomas Brox
Volker Fischer
53
3
0
09 Aug 2019
Efficient Inference of CNNs via Channel Pruning
Boyu Zhang
A. Davoodi
Y. Hu
CVBM
23
6
0
08 Aug 2019
Exploiting Channel Similarity for Accelerating Deep Convolutional Neural Networks
Yunxiang Zhang
Chenglong Zhao
Bingbing Ni
Jian Zhang
Haoran Deng
50
2
0
06 Aug 2019
Full-Stack Filters to Build Minimum Viable CNNs
Kai Han
Yunhe Wang
Yixing Xu
Chunjing Xu
Dacheng Tao
Chang Xu
MQ
39
3
0
06 Aug 2019
Knowledge Consistency between Neural Networks and Beyond
Ruofan Liang
Tianlin Li
Longfei Li
Jingchao Wang
Quanshi Zhang
86
28
0
05 Aug 2019
GDRQ: Group-based Distribution Reshaping for Quantization
Haibao Yu
Tuopu Wen
Guangliang Cheng
Jiankai Sun
Qi Han
Jianping Shi
MQ
65
3
0
05 Aug 2019
Distilling Knowledge From a Deep Pose Regressor Network
Muhamad Risqi U. Saputra
Pedro Porto Buarque de Gusmão
Yasin Almalioglu
Andrew Markham
A. Trigoni
94
103
0
02 Aug 2019
Distributed Deep Convolutional Neural Networks for the Internet-of-Things
Simone Disabato
M. Roveri
Cesare Alippi
64
50
0
02 Aug 2019
Deep Task-Based Quantization
Nir Shlezinger
Yonina C. Eldar
MQ
53
61
0
01 Aug 2019
Accelerating CNN Training by Pruning Activation Gradients
Xucheng Ye
Pengcheng Dai
Junyu Luo
Xin Guo
Weisheng Zhao
Jianlei Yang
Yiran Chen
23
2
0
01 Aug 2019
Machine Learning at the Network Edge: A Survey
M. G. Sarwar Murshed
Chris Murphy
Daqing Hou
Nazar Khan
Ganesh Ananthanarayanan
Faraz Hussain
98
393
0
31 Jul 2019
Tuning Algorithms and Generators for Efficient Edge Inference
R. Naous
Lazar Supic
Yoonhwan Kang
Ranko Seradejovic
Anish Singhani
Vladimir M. Stojanović
23
2
0
31 Jul 2019
Previous
1
2
3
...
49
50
51
...
68
69
70
Next