Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
v1
v2
v3
v4
v5 (latest)
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,481 papers shown
Title
QUENN: QUantization Engine for low-power Neural Networks
Miguel de Prado
Maurizio Denna
Luca Benini
Nuria Pazos
MQ
84
14
0
14 Nov 2018
YOLO-LITE: A Real-Time Object Detection Algorithm Optimized for Non-GPU Computers
J. Pedoeem
Rachel Huang
ObjD
72
507
0
14 Nov 2018
Iteratively Training Look-Up Tables for Network Quantization
Fabien Cardinaux
Stefan Uhlich
K. Yoshiyama
J. A. García
Stephen Tiedemann
Thomas Kemp
Akira Nakamura
MQ
56
1
0
13 Nov 2018
Intelligent Drone Swarm for Search and Rescue Operations at Sea
Vincenzo Lomonaco
A. Trotta
M. Ziosi
Juan de Dios Yáñez Ávila
Natalia Díaz Rodríguez
38
25
0
13 Nov 2018
Private Model Compression via Knowledge Distillation
Ji Wang
Weidong Bao
Lichao Sun
Xiaomin Zhu
Bokai Cao
Philip S. Yu
FedML
83
120
0
13 Nov 2018
Generalized Ternary Connect: End-to-End Learning and Compression of Multiplication-Free Deep Neural Networks
Samyak Parajuli
Aswin Raghavan
S. Chai
52
7
0
12 Nov 2018
Sequence-Level Knowledge Distillation for Model Compression of Attention-based Sequence-to-Sequence Speech Recognition
Raden Muáz Muním
Nakamasa Inoue
Koichi Shinoda
64
26
0
12 Nov 2018
Fast On-the-fly Retraining-free Sparsification of Convolutional Neural Networks
Amir H. Ashouri
T. Abdelrahman
Alwyn Dos Remedios
MQ
104
12
0
10 Nov 2018
A First Look at Deep Learning Apps on Smartphones
Mengwei Xu
Jiawei Liu
Yuanqiang Liu
F. Lin
Yunxin Liu
Xuanzhe Liu
HAI
91
183
0
08 Nov 2018
Packing Sparse Convolutional Neural Networks for Efficient Systolic Array Implementations: Column Combining Under Joint Optimization
H. T. Kung
Bradley McDanel
Shanghang Zhang
98
135
0
07 Nov 2018
Training Domain Specific Models for Energy-Efficient Object Detection
Kentaro Yoshioka
Edward Lee
Roummel F. Marcia
ObjD
10
0
0
06 Nov 2018
Revealing Fine Structures of the Retinal Receptive Field by Deep Learning Networks
Hui Li
Yajing Zheng
Shanshan Jia
Yichen Zhang
Zhaofei Yu
Feng Chen
Yonghong Tian
Tiejun Huang
Jian K. Liu
98
23
0
06 Nov 2018
A Unified Framework of DNN Weight Pruning and Weight Clustering/Quantization Using ADMM
David Cortes
Tianyun Zhang
Kaiqi Zhang
Jiayu Li
Jiaming Xie
Yun Liang
Sijia Liu
Xinyu Lin
Yanzhi Wang
MQ
58
45
0
05 Nov 2018
ReLeQ: A Reinforcement Learning Approach for Deep Quantization of Neural Networks
Ahmed T. Elthakeb
Prannoy Pilligundla
Fatemehsadat Mireshghallah
Amir Yazdanbakhsh
H. Esmaeilzadeh
MQ
146
68
0
05 Nov 2018
Dynamic Representations Toward Efficient Inference on Deep Neural Networks by Decision Gates
Mohammad Saeed Shafiee
M. Shafiee
A. Wong
AI4CE
18
4
0
05 Nov 2018
Bi-Real Net: Binarizing Deep Network Towards Real-Network Performance
Zechun Liu
Wenhan Luo
Baoyuan Wu
Xin Yang
Wen Liu
K. Cheng
MQ
80
96
0
04 Nov 2018
A Batched Scalable Multi-Objective Bayesian Optimization Algorithm
Xi Lin
Hui-Ling Zhen
Zhenhua Li
Qingfu Zhang
Sam Kwong
51
11
0
04 Nov 2018
ReXCam: Resource-Efficient, Cross-Camera Video Analytics at Scale
Samvit Jain
Xun Zhang
Yuhao Zhou
Ganesh Ananthanarayanan
Junchen Jiang
Yuanchao Shu
Joseph E. Gonzalez
HAI
119
9
0
03 Nov 2018
Online Embedding Compression for Text Classification using Low Rank Matrix Factorization
Anish Acharya
Rahul Goel
A. Metallinou
Inderjit Dhillon
102
62
0
01 Nov 2018
Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration
Yang He
Ping Liu
Ziwei Wang
Zhilan Hu
Yi Yang
AAML
3DPC
122
1,052
0
01 Nov 2018
Balanced Sparsity for Efficient DNN Inference on GPU
Zhuliang Yao
Shijie Cao
Wencong Xiao
Chen Zhang
Lanshun Nie
85
93
0
01 Nov 2018
Low-Precision Random Fourier Features for Memory-Constrained Kernel Approximation
Jian Zhang
Avner May
Tri Dao
Christopher Ré
80
29
0
31 Oct 2018
Convolutional Neural Network Quantization using Generalized Gamma Distribution
Doyun Kim
H. Yim
Sanghyuck Ha
Changgwun Lee
Inyup Kang
MQ
40
4
0
31 Oct 2018
SplineNets: Continuous Neural Decision Graphs
Cem Keskin
Shahram Izadi
51
11
0
31 Oct 2018
Low-Rank Embedding of Kernels in Convolutional Neural Networks under Random Shuffling
Chong Li
Zhun Sun
Jinshi Yu
Ming Hou
Qibin Zhao
32
5
0
31 Oct 2018
JavaScript Convolutional Neural Networks for Keyword Spotting in the Browser: An Experimental Analysis
Jaejun Lee
Raphael Tang
Jimmy J. Lin
21
2
0
30 Oct 2018
DeepTwist: Learning Model Compression via Occasional Weight Distortion
Dongsoo Lee
Parichay Kapoor
Byeongwook Kim
67
19
0
30 Oct 2018
Demystifying Neural Network Filter Pruning
Zhuwei Qin
Fuxun Yu
Adam Lesnikowski
Xiang Chen
67
5
0
29 Oct 2018
Low-complexity Recurrent Neural Network-based Polar Decoder with Weight Quantization Mechanism
Chieh-Fang Teng
Chengyang Wu
A. K. Ho
A. Wu
55
58
0
29 Oct 2018
Discrimination-aware Channel Pruning for Deep Neural Networks
Zhuangwei Zhuang
Mingkui Tan
Bohan Zhuang
Jing Liu
Yong Guo
Qingyao Wu
Junzhou Huang
Jin-Hui Zhu
166
601
0
28 Oct 2018
Learning Sparse Neural Networks via Sensitivity-Driven Regularization
Enzo Tartaglione
S. Lepsøy
Attilio Fiandrotti
Gianluca Francini
64
71
0
28 Oct 2018
A Miniaturized Semantic Segmentation Method for Remote Sensing Image
Shou-Yu Chen
Guang-Sheng Chen
Wei-Peng Jing
29
1
0
27 Oct 2018
Distilling with Performance Enhanced Students
Jack Turner
Elliot J. Crowley
Valentin Radu
José Cano
Amos Storkey
Michael F. P. O'Boyle
38
3
0
24 Oct 2018
NestDNN: Resource-Aware Multi-Tenant On-Device Deep Learning for Continuous Mobile Vision
Biyi Fang
Xiao Zeng
Mi Zhang
3DH
96
270
0
23 Oct 2018
Deep Neural Network inference with reduced word length
Lukas Mauch
Binh Yang
MQ
26
0
0
23 Oct 2018
Convolutional Neural Network Pruning to Accelerate Membrane Segmentation in Electron Microscopy
J. Roels
Jonas De Vylder
J. Aelterman
Yvan Saeys
Wilfried Philips
18
6
0
23 Oct 2018
Learning sparse transformations through backpropagation
Peter Bloem
43
0
0
22 Oct 2018
To Compress, or Not to Compress: Characterizing Deep Learning Model Compression for Embedded Inference
Qing Qin
Jie Ren
Jia-Le Yu
Ling Gao
Hai Wang
Jie Zheng
Yansong Feng
Jianbin Fang
Zheng Wang
47
24
0
21 Oct 2018
CNN inference acceleration using dictionary of centroids
D.Babin
I.Mazurenko
D.Parkhomenko
A.Voloshko
MQ
24
0
0
19 Oct 2018
Real-time Neural-based Input Method
Jiali Yao
Raphael Shu
Xinjian Li
K. Ohtsuki
Hideki Nakayama
30
4
0
19 Oct 2018
KTAN: Knowledge Transfer Adversarial Network
Peiye Liu
Wu Liu
Huadong Ma
Tao Mei
Mingoo Seok
GAN
91
28
0
18 Oct 2018
Progressive Weight Pruning of Deep Neural Networks using ADMM
Shaokai Ye
Tianyun Zhang
Kaiqi Zhang
Jiayu Li
Kaidi Xu
...
M. Fardad
Sijia Liu
Xiang Chen
Xinyu Lin
Yanzhi Wang
AI4CE
120
38
0
17 Oct 2018
Quantization for Rapid Deployment of Deep Neural Networks
J. Lee
Sangwon Ha
Saerom Choi
Won-Jo Lee
Seungwon Lee
MQ
80
49
0
12 Oct 2018
Rethinking the Value of Network Pruning
Zhuang Liu
Mingjie Sun
Tinghui Zhou
Gao Huang
Trevor Darrell
100
1,480
0
11 Oct 2018
A Closer Look at Structured Pruning for Neural Network Compression
Elliot J. Crowley
Jack Turner
Amos Storkey
Michael F. P. O'Boyle
3DPC
80
31
0
10 Oct 2018
Extreme Classification in Log Memory
Qixuan Huang
Yiqiu Wang
Tharun Medini
Anshumali Shrivastava
VLM
70
3
0
09 Oct 2018
Deep Neural Network Compression for Aircraft Collision Avoidance Systems
Kyle D. Julian
Mykel J. Kochenderfer
Michael P. Owen
63
173
0
09 Oct 2018
Rate Distortion For Model Compression: From Theory To Practice
Weihao Gao
Yu-Han Liu
Chong-Jun Wang
Sewoong Oh
100
31
0
09 Oct 2018
Light-Weight RefineNet for Real-Time Semantic Segmentation
Vladimir Nekrasov
Chunhua Shen
Ian Reid
SSeg
VLM
93
148
0
08 Oct 2018
Sparse Winograd Convolutional neural networks on small-scale systolic arrays
Feng Shi
Haochen Li
Yuhe Gao
Benjamin Kuschner
Song-Chun Zhu
55
15
0
03 Oct 2018
Previous
1
2
3
...
57
58
59
...
68
69
70
Next