Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
v1
v2
v3
v4
v5 (latest)
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,481 papers shown
Title
Resource-Efficient Deep Learning: A Survey on Model-, Arithmetic-, and Implementation-Level Techniques
JunKyu Lee
L. Mukhanov
A. S. Molahosseini
U. Minhas
Yang Hua
Jesus Martinez del Rincon
K. Dichev
Cheol-Ho Hong
Hans Vandierendonck
91
31
0
30 Dec 2021
Automatic Mixed-Precision Quantization Search of BERT
Changsheng Zhao
Ting Hua
Yilin Shen
Qian Lou
Hongxia Jin
MQ
58
22
0
30 Dec 2021
EvoMoE: An Evolutional Mixture-of-Experts Training Framework via Dense-To-Sparse Gate
Xiaonan Nie
Xupeng Miao
Shijie Cao
Lingxiao Ma
Qibin Liu
Jilong Xue
Youshan Miao
Yi Liu
Zhi-Xin Yang
Tengjiao Wang
MoMe
MoE
101
24
0
29 Dec 2021
BMPQ: Bit-Gradient Sensitivity Driven Mixed-Precision Quantization of DNNs from Scratch
Souvik Kundu
Shikai Wang
Qirui Sun
Peter A. Beerel
Massoud Pedram
MQ
81
18
0
24 Dec 2021
Training Quantized Deep Neural Networks via Cooperative Coevolution
Fu Peng
Shengcai Liu
Ning Lu
Ke Tang
MQ
80
1
0
23 Dec 2021
Implicit Neural Video Compression
Yunfan Zhang
T. V. Rozendaal
Johann Brehmer
Markus Nagel
Taco S. Cohen
119
58
0
21 Dec 2021
Compact Multi-level Sparse Neural Networks with Input Independent Dynamic Rerouting
Minghai Qin
Tianyun Zhang
Fei Sun
Yen-kuang Chen
M. Fardad
Yanzhi Wang
Yuan Xie
100
0
0
21 Dec 2021
Load-balanced Gather-scatter Patterns for Sparse Deep Neural Networks
Fei Sun
Minghai Qin
Tianyun Zhang
Xiaolong Ma
Haoran Li
Junwen Luo
Zihao Zhao
Yen-kuang Chen
Yuan Xie
61
1
0
20 Dec 2021
LegoDNN: Block-grained Scaling of Deep Neural Networks for Mobile Vision
Rui Han
Qinglong Zhang
C. Liu
Guoren Wang
Jian Tang
L. Chen
75
45
0
18 Dec 2021
Federated Dynamic Sparse Training: Computing Less, Communicating Less, Yet Learning Better
Sameer Bibikar
H. Vikalo
Zhangyang Wang
Xiaohan Chen
FedML
93
106
0
18 Dec 2021
Automated Deep Learning: Neural Architecture Search Is Not the End
Xuanyi Dong
D. Kedziora
Katarzyna Musial
Bogdan Gabrys
128
27
0
16 Dec 2021
Implementation of a Binary Neural Network on a Passive Array of Magnetic Tunnel Junctions
Jonathan M. Goodwill
N. Prasad
Brian D. Hoskins
M. Daniels
A. Madhavan
...
M. Tran
J. Katine
Patrick M. Braganca
M. D. Stiles
Jabez J. McClelland
83
10
0
16 Dec 2021
Feature Distillation Interaction Weighting Network for Lightweight Image Super-Resolution
Guangwei Gao
Wenjie Li
Juncheng Li
Leilei Gan
Huimin Lu
Yi Yu
101
86
0
16 Dec 2021
Pruning Coherent Integrated Photonic Neural Networks Using the Lottery Ticket Hypothesis
Sanmitra Banerjee
Mahdi Nikdast
S. Pasricha
Krishnendu Chakrabarty
68
10
0
14 Dec 2021
An Interpretive Constrained Linear Model for ResNet and MgNet
Juncai He
Jinchao Xu
Lian Zhang
Jianqing Zhu
88
18
0
14 Dec 2021
SNF: Filter Pruning via Searching the Proper Number of Filters
Pengkun Liu
Yaru Yue
Yanjun Guo
Xingxiang Tao
Xiaoguang Zhou
3DPC
62
0
0
14 Dec 2021
Synapse Compression for Event-Based Convolutional-Neural-Network Accelerators
Lennart Bamberg
Arash Pourtaherian
Luc Waeijen
A. Chahar
Orlando Moreira
92
5
0
13 Dec 2021
DGL-GAN: Discriminator Guided Learning for GAN Compression
Yuesong Tian
Li Shen
Xiang Tian
Dacheng Tao
Zhifeng Li
Wei Liu
Yao-wu Chen
126
0
0
13 Dec 2021
Programming with Neural Surrogates of Programs
Alex Renda
Yi Ding
Michael Carbin
52
4
0
12 Dec 2021
Automated Customization of On-Thing Inference for Quality-of-Experience Enhancement
Yang Bai
Lixing Chen
Shaolei Ren
Jie Xu
72
0
0
11 Dec 2021
Effective dimension of machine learning models
Amira Abbas
David Sutter
Alessio Figalli
Stefan Woerner
135
18
0
09 Dec 2021
Implicit Neural Representations for Image Compression
Yannick Strümpler
Janis Postels
Ren Yang
Luc van Gool
F. Tombari
105
165
0
08 Dec 2021
Low-rank Tensor Decomposition for Compression of Convolutional Neural Networks Using Funnel Regularization
Bo-Shiuan Chu
Che-Rung Lee
58
11
0
07 Dec 2021
i-SpaSP: Structured Neural Pruning via Sparse Signal Recovery
Cameron R. Wolfe
Anastasios Kyrillidis
66
1
0
07 Dec 2021
Manas: Mining Software Repositories to Assist AutoML
Giang Nguyen
Johir Islam
Rangeet Pan
Hridesh Rajan
67
15
0
06 Dec 2021
A Generalized Zero-Shot Quantization of Deep Convolutional Neural Networks via Learned Weights Statistics
Prasen Kumar Sharma
Arun Abraham
V. N. Rajendiran
MQ
114
8
0
06 Dec 2021
Finding Deviated Behaviors of the Compressed DNN Models for Image Classifications
Yongqiang Tian
Wuqi Zhang
Ming Wen
Shing-Chi Cheung
Chengnian Sun
Shiqing Ma
Yu Jiang
92
7
0
06 Dec 2021
Intrinisic Gradient Compression for Federated Learning
Luke Melas-Kyriazi
Franklyn Wang
FedML
25
3
0
05 Dec 2021
Boosting Mobile CNN Inference through Semantic Memory
Yun Li
Chen Zhang
S. Han
Li Zhang
B. Yin
Yunxin Liu
Mengwei Xu
ObjD
84
16
0
05 Dec 2021
Communication and Energy Efficient Slimmable Federated Learning via Superposition Coding and Successive Decoding
Hankyul Baek
Won Joon Yun
Soyi Jung
Jihong Park
Mingyue Ji
Joongheon Kim
M. Bennis
104
1
0
05 Dec 2021
Joint Superposition Coding and Training for Federated Learning over Multi-Width Neural Networks
Hankyul Baek
Won Joon Yun
Yunseok Kwak
Soyi Jung
Mingyue Ji
M. Bennis
Jihong Park
Joongheon Kim
FedML
114
22
0
05 Dec 2021
Orientation Aware Weapons Detection In Visual Data : A Benchmark Dataset
Nazeef Ul Haq
M. Fraz
Tufail Sajjad Shah Hashmi
Muhammad Shahzad
35
10
0
04 Dec 2021
Understanding Performance Problems in Deep Learning Systems
Junming Cao
Bihuan Chen
Chao Sun
Longjie Hu
Shuai Wu
Xin Peng
83
30
0
03 Dec 2021
Challenges and Opportunities in Approximate Bayesian Deep Learning for Intelligent IoT Systems
Meet P. Vadera
Benjamin M. Marlin
UQCV
BDL
56
5
0
03 Dec 2021
Equal Bits: Enforcing Equally Distributed Binary Network Weights
Yun-qiang Li
S. Pintea
Jan van Gemert
MQ
97
15
0
02 Dec 2021
Optimizing for In-memory Deep Learning with Emerging Memory Technology
Zhehui Wang
Yaoyu Zhang
Rick Siow Mong Goh
Wei Zhang
Weng-Fai Wong
52
1
0
01 Dec 2021
Training BatchNorm Only in Neural Architecture Search and Beyond
Yichen Zhu
Jie Du
Yuqin Zhu
Yi Wang
Zhicai Ou
Feifei Feng
Jian Tang
86
1
0
01 Dec 2021
Pixelated Butterfly: Simple and Efficient Sparse training for Neural Network Models
Tri Dao
Beidi Chen
Kaizhao Liang
Jiaming Yang
Zhao Song
Atri Rudra
Christopher Ré
144
79
0
30 Nov 2021
TinyML Platforms Benchmarking
Anas Osman
Usman Abid
Luca Gemma
Matteo Perotto
Davide Brunelli
ELM
66
16
0
30 Nov 2021
A Highly Effective Low-Rank Compression of Deep Neural Networks with Modified Beam-Search and Modified Stable Rank
Moonjung Eo
Suhyun Kang
Wonjong Rhee
71
1
0
30 Nov 2021
Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation
Zechun Liu
Kwang-Ting Cheng
Dong Huang
Eric P. Xing
Zhiqiang Shen
MQ
91
111
0
29 Nov 2021
Mixed Precision Low-bit Quantization of Neural Network Language Models for Speech Recognition
Junhao Xu
Jianwei Yu
Shoukang Hu
Xunying Liu
Helen Meng
MQ
109
15
0
29 Nov 2021
Low-bit Quantization of Recurrent Neural Network Language Models Using Alternating Direction Methods of Multipliers
Junhao Xu
Xie Chen
Shoukang Hu
Jianwei Yu
Xunying Liu
Helen Meng
MQ
60
9
0
29 Nov 2021
Improved Knowledge Distillation via Adversarial Collaboration
Zhiqiang Liu
Chengkai Huang
Yanxia Liu
63
2
0
29 Nov 2021
FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer
Yang Lin
Tianyu Zhang
Peiqin Sun
Zheng Li
Shuchang Zhou
ViT
MQ
129
157
0
27 Nov 2021
Learning from learning machines: a new generation of AI technology to meet the needs of science
L. Pion-Tonachini
K. Bouchard
Héctor García Martín
S. Peisert
W. B. Holtz
...
Rick L. Stevens
Mark Anderson
Ken Kreutz-Delgado
Michael W. Mahoney
James B. Brown
75
8
0
27 Nov 2021
Hidden-Fold Networks: Random Recurrent Residuals Using Sparse Supermasks
Ángel López García-Arias
Masanori Hashimoto
Masato Motomura
Jaehoon Yu
66
5
0
24 Nov 2021
Toward Compact Parameter Representations for Architecture-Agnostic Neural Network Compression
Yuezhou Sun
Wenlong Zhao
Lijun Zhang
Xiao Liu
Hui Guan
Matei A. Zaharia
89
0
0
19 Nov 2021
DICE: Leveraging Sparsification for Out-of-Distribution Detection
Yiyou Sun
Yixuan Li
OODD
168
163
0
18 Nov 2021
COMET: A Novel Memory-Efficient Deep Learning Training Framework by Using Error-Bounded Lossy Compression
Sian Jin
Chengming Zhang
Xintong Jiang
Yunhe Feng
Hui Guan
Guanpeng Li
Shuaiwen Leon Song
Dingwen Tao
46
25
0
18 Nov 2021
Previous
1
2
3
...
25
26
27
...
68
69
70
Next