Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
v1
v2
v3
v4
v5 (latest)
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,481 papers shown
Title
ERNet Family: Hardware-Oriented CNN Models for Computational Imaging Using Block-Based Inference
Chao-Tsung Huang
48
5
0
13 Oct 2019
eCNN: A Block-Based and Highly-Parallel CNN Accelerator for Edge Inference
Chao-Tsung Huang
Yu-Chun Ding
Huan-Ching Wang
Chi-Wen Weng
Kai-Ping Lin
Li-Wei Wang
Li-De Chen
77
44
0
13 Oct 2019
JSDoop and TensorFlow.js: Volunteer Distributed Web Browser-Based Neural Network Training
José Á. Morell
Andrés Camero
Enrique Alba
63
9
0
12 Oct 2019
EDEN: Enabling Energy-Efficient, High-Performance Deep Neural Network Inference Using Approximate DRAM
Skanda Koppula
Lois Orosa
A. G. Yaglikçi
Roknoddin Azizi
Taha Shahroodi
Konstantinos Kanellopoulos
O. Mutlu
82
108
0
12 Oct 2019
SiPPing Neural Networks: Sensitivity-informed Provable Pruning of Neural Networks
Cenk Baykal
Lucas Liebenwein
Igor Gilitschenski
Dan Feldman
Daniela Rus
92
18
0
11 Oct 2019
Noise as a Resource for Learning in Knowledge Distillation
Elahe Arani
F. Sarfraz
Bahram Zonooz
64
6
0
11 Oct 2019
Structured Pruning of Large Language Models
Ziheng Wang
Jeremy Wohlwend
Tao Lei
98
293
0
10 Oct 2019
Knowledge Distillation from Internal Representations
Gustavo Aguilar
Yuan Ling
Yu Zhang
Benjamin Yao
Xing Fan
Edward Guo
110
181
0
08 Oct 2019
Differentiable Sparsification for Deep Neural Networks
Yognjin Lee
87
7
0
08 Oct 2019
Energy-Aware Neural Architecture Optimization with Fast Splitting Steepest Descent
Dilin Wang
Meng Li
Lemeng Wu
Vikas Chandra
Qiang Liu
111
21
0
07 Oct 2019
Deep Neural Network Compression for Image Classification and Object Detection
Georgios Tzelepis
A. Asif
Saimir Baci
Selçuk Çavdar
E. Aksoy
64
13
0
07 Oct 2019
Splitting Steepest Descent for Growing Neural Architectures
Qiang Liu
Lemeng Wu
Dilin Wang
113
63
0
06 Oct 2019
Distilling BERT into Simple Neural Networks with Unlabeled Transfer Data
Subhabrata Mukherjee
Ahmed Hassan Awadallah
93
25
0
04 Oct 2019
Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge Computing
En Li
Liekang Zeng
Zhi Zhou
Xu Chen
85
634
0
04 Oct 2019
SAFA: a Semi-Asynchronous Protocol for Fast Federated Learning with Low Overhead
A. Masullo
Ligang He
Toby Perrett
Rui Mao
Carsten Maple
Majid Mirmehdi
117
319
0
03 Oct 2019
On the Efficacy of Knowledge Distillation
Ligang He
Rui Mao
103
622
0
03 Oct 2019
Piracy Resistant Watermarks for Deep Neural Networks
Huiying Li
Emily Willson
Shawn Shan
Bing Ye
Shehroz S. Khan
88
26
0
02 Oct 2019
AntMan: Sparse Low-Rank Compression to Accelerate RNN inference
Samyam Rajbhandari
H. Shrivastava
J. Rho
MQ
57
8
0
02 Oct 2019
Neural networks on microcontrollers: saving memory at inference via operator reordering
Edgar Liberis
Nicholas D. Lane
66
46
0
02 Oct 2019
XNOR-Net++: Improved Binary Neural Networks
Adrian Bulat
Georgios Tzimiropoulos
MQ
92
205
0
30 Sep 2019
REQ-YOLO: A Resource-Aware, Efficient Quantization Framework for Object Detection on FPGAs
Caiwen Ding
Shuo Wang
Ning Liu
Kaidi Xu
Yanzhi Wang
Yun Liang
MQ
55
90
0
29 Sep 2019
AdaptivFloat: A Floating-point based Data Type for Resilient Deep Learning Inference
Thierry Tambe
En-Yu Yang
Zishen Wan
Yuntian Deng
Vijay Janapa Reddi
Alexander M. Rush
David Brooks
Gu-Yeon Wei
MQ
69
21
0
29 Sep 2019
Learning Efficient Convolutional Networks through Irregular Convolutional Kernels
Weiyu Guo
Jiabin Ma
Liang Wang
Yongzhen Huang
28
5
0
29 Sep 2019
Additive Powers-of-Two Quantization: An Efficient Non-uniform Discretization for Neural Networks
Yuhang Li
Xin Dong
Wei Wang
MQ
86
260
0
28 Sep 2019
Training convolutional neural networks with cheap convolutions and online distillation
Jiao Xie
Shaohui Lin
Yichen Zhang
Linkai Luo
65
12
0
28 Sep 2019
A Dual Camera System for High Spatiotemporal Resolution Video Acquisition
Ming Cheng
Zhan Ma
M. Salman Asif
Yiling Xu
Haojie Liu
Wenbo Bao
Jun Sun
65
21
0
28 Sep 2019
Training-Free Uncertainty Estimation for Dense Regression: Sensitivity as a Surrogate
Lu Mi
Hao Wang
Yonglong Tian
Hao He
Nir Shavit
UQCV
62
32
0
28 Sep 2019
Robust Membership Encoding: Inference Attacks and Copyright Protection for Deep Learning
Congzheng Song
Reza Shokri
MIACV
33
5
0
27 Sep 2019
MutualNet: Adaptive ConvNet via Mutual Learning from Network Width and Resolution
Taojiannan Yang
Sijie Zhu
Chong Chen
Shen Yan
Mi Zhang
Andrew Willis
OOD
93
75
0
27 Sep 2019
Global Sparse Momentum SGD for Pruning Very Deep Neural Networks
Xiaohan Ding
Guiguang Ding
Xiangxin Zhou
Yuchen Guo
Jungong Han
Ji Liu
131
165
0
27 Sep 2019
Impact of Low-bitwidth Quantization on the Adversarial Robustness for Embedded Neural Networks
Rémi Bernhard
Pierre-Alain Moëllic
J. Dutertre
AAML
MQ
98
18
0
27 Sep 2019
Pruning from Scratch
Yulong Wang
Xiaolu Zhang
Lingxi Xie
Jun Zhou
Hang Su
Bo Zhang
Xiaolin Hu
77
196
0
27 Sep 2019
Balanced Binary Neural Networks with Gated Residual
Mingzhu Shen
Xianglong Liu
Ruihao Gong
Kai Han
MQ
79
36
0
26 Sep 2019
CAT: Compression-Aware Training for bandwidth reduction
Chaim Baskin
Brian Chmiel
Evgenii Zheltonozhskii
Ron Banner
A. Bronstein
A. Mendelson
MQ
69
12
0
25 Sep 2019
FALCON: Lightweight and Accurate Convolution
Jun-Gi Jang
Chun Quan
Hyun Dong Lee
U. Kang
13
1
0
25 Sep 2019
Learning to Seek: Autonomous Source Seeking with Deep Reinforcement Learning Onboard a Nano Drone Microcontroller
Bardienus P. Duisterhof
Srivatsan Krishnan
Jonathan J. Cruz
Colby R. Banbury
William Fu
Aleksandra Faust
Guido de Croon
Vijay Janapa Reddi
125
25
0
25 Sep 2019
Forward and Backward Information Retention for Accurate Binary Neural Networks
Haotong Qin
Ruihao Gong
Xianglong Liu
Mingzhu Shen
Ziran Wei
F. Yu
Jingkuan Song
MQ
224
334
0
24 Sep 2019
TinyBERT: Distilling BERT for Natural Language Understanding
Xiaoqi Jiao
Yichun Yin
Lifeng Shang
Xin Jiang
Xiao Chen
Linlin Li
F. Wang
Qun Liu
VLM
148
1,881
0
23 Sep 2019
A generalization of regularized dual averaging and its dynamics
Shih-Kang Chao
Guang Cheng
65
18
0
22 Sep 2019
Structured Binary Neural Networks for Image Recognition
Bohan Zhuang
Chunhua Shen
Mingkui Tan
Peng Chen
Lingqiao Liu
Ian Reid
MQ
137
19
0
22 Sep 2019
SkyNet: a Hardware-Efficient Method for Object Detection and Tracking on Embedded Systems
Xiaofan Zhang
Haoming Lu
Cong Hao
Jiachen Li
Bowen Cheng
...
Jinjun Xiong
Thomas Huang
Humphrey Shi
Wen-mei W. Hwu
Deming Chen
106
92
0
20 Sep 2019
Gate Decorator: Global Filter Pruning Method for Accelerating Deep Convolutional Neural Networks
Zhonghui You
Kun Yan
Jinmian Ye
Meng Ma
Ping Wang
3DPC
93
252
0
18 Sep 2019
Ensemble Knowledge Distillation for Learning Improved and Efficient Networks
Umar Asif
Jianbin Tang
S. Harrer
FedML
103
76
0
17 Sep 2019
Searching for Accurate Binary Neural Architectures
Mingzhu Shen
Kai Han
Chunjing Xu
Yunhe Wang
MQ
156
64
0
16 Sep 2019
Comparison of UNet, ENet, and BoxENet for Segmentation of Mast Cells in Scans of Histological Slices
A. Karimov
A. Razumov
Ruslana Manbatchurina
Ksenia Simonova
Irina Donets
A. Vlasova
Y. Khramtsova
K. Ushenin
SSeg
15
10
0
15 Sep 2019
Neural Machine Translation with 4-Bit Precision and Beyond
Alham Fikri Aji
Kenneth Heafield
MQ
31
7
0
13 Sep 2019
DASNet: Dynamic Activation Sparsity for Neural Network Efficiency Improvement
Qing Yang
Jiachen Mao
Zuoguan Wang
H. Li
65
15
0
13 Sep 2019
Characterizing the Deep Neural Networks Inference Performance of Mobile Applications
Samuel S. Ogden
Tian Guo
49
17
0
10 Sep 2019
VACL: Variance-Aware Cross-Layer Regularization for Pruning Deep Residual Networks
Shuang Gao
Xin Liu
Lung-Sheng Chien
William Zhang
J. Álvarez
VLM
3DPC
63
15
0
10 Sep 2019
DeepObfuscator: Obfuscating Intermediate Representations with Privacy-Preserving Adversarial Learning on Smartphones
Ang Li
Jiayi Guo
Huanrui Yang
Flora D. Salim
Yiran Chen
AAML
55
37
0
09 Sep 2019
Previous
1
2
3
...
48
49
50
...
68
69
70
Next