ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown
Title
The Disruptions of 5G on Data-driven Technologies and Applications
The Disruptions of 5G on Data-driven Technologies and Applications
Dumitrel Loghin
Shaofeng Cai
Gang Chen
Tien Tuan Anh Dinh
Feiyi Fan
...
Wei Wang
Xiaokui Xiao
Yang Yang
Meihui Zhang
Zhonghua Zhang
66
66
0
06 Sep 2019
One Size Does Not Fit All: Multi-Scale, Cascaded RNNs for Radar
  Classification
One Size Does Not Fit All: Multi-Scale, Cascaded RNNs for Radar Classification
Dhrubojyoti Roy
S. Srivastava
Aditya Kusupati
Pranshu Jain
Manik Varma
A. Arora
91
12
0
06 Sep 2019
Training Deep Neural Networks Using Posit Number System
Training Deep Neural Networks Using Posit Number System
Jinming Lu
Siyuan Lu
Zhisheng Wang
Chao Fang
Jun Lin
Zhongfeng Wang
Li Du
MQ
57
14
0
06 Sep 2019
PCONV: The Missing but Desirable Sparsity in DNN Weight Pruning for
  Real-time Execution on Mobile Devices
PCONV: The Missing but Desirable Sparsity in DNN Weight Pruning for Real-time Execution on Mobile Devices
Xiaolong Ma
Fu-Ming Guo
Wei Niu
Xue Lin
Jian Tang
Kaisheng Ma
Bin Ren
Yanzhi Wang
CVBM
109
180
0
06 Sep 2019
Additive function approximation in the brain
Additive function approximation in the brain
K. Harris
91
13
0
05 Sep 2019
A Novel Design of Adaptive and Hierarchical Convolutional Neural
  Networks using Partial Reconfiguration on FPGA
A Novel Design of Adaptive and Hierarchical Convolutional Neural Networks using Partial Reconfiguration on FPGA
Mohammad Farhadi
Mehdi Ghasemi
Yezhou Yang
58
27
0
05 Sep 2019
ModiPick: SLA-aware Accuracy Optimization For Mobile Deep Inference
ModiPick: SLA-aware Accuracy Optimization For Mobile Deep Inference
Samuel S. Ogden
Tian Guo
24
3
0
04 Sep 2019
What Happens on the Edge, Stays on the Edge: Toward Compressive Deep
  Learning
What Happens on the Edge, Stays on the Edge: Toward Compressive Deep Learning
Yongqian Li
Thomas Strohmer
33
2
0
04 Sep 2019
On the Downstream Performance of Compressed Word Embeddings
On the Downstream Performance of Compressed Word Embeddings
Avner May
Jian Zhang
Tri Dao
Christopher Ré
80
27
0
03 Sep 2019
PSDNet and DPDNet: Efficient channel expansion,
  Depthwise-Pointwise-Depthwise Inverted Bottleneck Block
PSDNet and DPDNet: Efficient channel expansion, Depthwise-Pointwise-Depthwise Inverted Bottleneck Block
Guoqing Li
Meng Zhang
Qianru Zhang
Ziyang Chen
Wenzhao Liu
Jiaojie Li
Xuzhao Shen
Jian Li
Zhenyu Zhu
Chau Yuen
73
5
0
03 Sep 2019
HarDNet: A Low Memory Traffic Network
HarDNet: A Low Memory Traffic Network
P. Chao
Chao-Yang Kao
Yunxing Ruan
Chien-Hsiang Huang
Y. Lin
259
270
0
03 Sep 2019
Touché: Towards Ideal and Efficient Cache Compression By Mitigating
  Tag Area Overheads
Touché: Towards Ideal and Efficient Cache Compression By Mitigating Tag Area Overheads
Seokin Hong
B. Abali
A. Buyuktosunoglu
Michael B. Healy
Prashant J. Nair
88
17
0
02 Sep 2019
Sparse Deep Neural Network Graph Challenge
Sparse Deep Neural Network Graph Challenge
J. Kepner
Simon Alford
V. Gadepally
Michael Jones
Lauren Milechin
Ryan A. Robinett
S. Samsi
GNN
65
49
0
02 Sep 2019
EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference
  and Training Accelerators
EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training Accelerators
Lukas Cavigelli
Georg Rutishauser
Luca Benini
MQ
67
34
0
30 Aug 2019
Smaller Models, Better Generalization
Smaller Models, Better Generalization
Mayank Sharma
Suraj Tripathi
Abhimanyu Dubey
Jayadeva Jayadeva
Sai Guruju
Nihal Goalla
38
1
0
29 Aug 2019
Image Captioning with Sparse Recurrent Neural Network
Image Captioning with Sparse Recurrent Neural Network
J. Tan
Chee Seng Chan
Joon Huang Chuah
VLM
56
6
0
28 Aug 2019
On the Effectiveness of Low-Rank Matrix Factorization for LSTM Model
  Compression
On the Effectiveness of Low-Rank Matrix Factorization for LSTM Model Compression
Genta Indra Winata
Andrea Madotto
Jamin Shin
Elham J. Barezi
Pascale Fung
64
29
0
27 Aug 2019
DeepHoyer: Learning Sparser Neural Network with Differentiable
  Scale-Invariant Sparsity Measures
DeepHoyer: Learning Sparser Neural Network with Differentiable Scale-Invariant Sparsity Measures
Huanrui Yang
W. Wen
H. Li
96
99
0
27 Aug 2019
Once-for-All: Train One Network and Specialize it for Efficient
  Deployment
Once-for-All: Train One Network and Specialize it for Efficient Deployment
Han Cai
Chuang Gan
Tianzhe Wang
Zhekai Zhang
Song Han
OOD
223
1,288
0
26 Aug 2019
Differentiable Product Quantization for End-to-End Embedding Compression
Differentiable Product Quantization for End-to-End Embedding Compression
Ting Chen
Lala Li
Yizhou Sun
MQ
55
68
0
26 Aug 2019
Patient Knowledge Distillation for BERT Model Compression
Patient Knowledge Distillation for BERT Model Compression
S. Sun
Yu Cheng
Zhe Gan
Jingjing Liu
151
843
0
25 Aug 2019
SeesawFaceNets: sparse and robust face verification model for mobile
  platform
SeesawFaceNets: sparse and robust face verification model for mobile platform
Jintao Zhang
3DHCVBM
57
9
0
24 Aug 2019
Well-Read Students Learn Better: On the Importance of Pre-training
  Compact Models
Well-Read Students Learn Better: On the Importance of Pre-training Compact Models
Iulia Turc
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
84
225
0
23 Aug 2019
Learning Filter Basis for Convolutional Neural Network Compression
Learning Filter Basis for Convolutional Neural Network Compression
Yawei Li
Shuhang Gu
Luc Van Gool
Radu Timofte
SupR
79
99
0
23 Aug 2019
MobiSR: Efficient On-Device Super-Resolution through Heterogeneous
  Mobile Processors
MobiSR: Efficient On-Device Super-Resolution through Heterogeneous Mobile Processors
Royson Lee
Stylianos I. Venieris
Łukasz Dudziak
S. Bhattacharya
Nicholas D. Lane
SupR
56
96
0
21 Aug 2019
Restricted Recurrent Neural Networks
Restricted Recurrent Neural Networks
Enmao Diao
Jie Ding
Vahid Tarokh
67
21
0
21 Aug 2019
Efficient Deep Neural Networks
Efficient Deep Neural Networks
Bichen Wu
63
12
0
20 Aug 2019
Implicit Deep Learning
Implicit Deep Learning
L. Ghaoui
Fangda Gu
Bertrand Travacca
Armin Askari
Alicia Y. Tsai
AI4CE
79
182
0
17 Aug 2019
Improved Techniques for Training Adaptive Deep Networks
Improved Techniques for Training Adaptive Deep Networks
Hao Li
Hong Zhang
Xiaojuan Qi
Ruigang Yang
Gao Huang
78
135
0
17 Aug 2019
Survey on Deep Neural Networks in Speech and Vision Systems
Survey on Deep Neural Networks in Speech and Vision Systems
M. Alam
Manar D. Samad
Lasitha Vidyaratne
Alexander M. Glandon
Khan M. Iftekharuddin
3DVVLMAI4TS
100
212
0
16 Aug 2019
Accelerated CNN Training Through Gradient Approximation
Accelerated CNN Training Through Gradient Approximation
Ziheng Wang
Sree Harsha Nelaturu
366
5
0
15 Aug 2019
Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit
  Neural Networks
Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks
Ruihao Gong
Xianglong Liu
Shenghu Jiang
Tian-Hao Li
Peng Hu
Jiazhen Lin
F. Yu
Junjie Yan
MQ
128
460
0
14 Aug 2019
Justlookup: One Millisecond Deep Feature Extraction for Point Clouds By
  Lookup Tables
Justlookup: One Millisecond Deep Feature Extraction for Point Clouds By Lookup Tables
Hongxin Lin
Zelin Xiao
Yang Tan
Hongyang Chao
Shengyong Ding
3DPC
57
22
0
14 Aug 2019
Neural Plasticity Networks
Neural Plasticity Networks
Yongqian Li
Shihao Ji
30
1
0
13 Aug 2019
Adversarial Neural Pruning with Latent Vulnerability Suppression
Adversarial Neural Pruning with Latent Vulnerability Suppression
Divyam Madaan
Jinwoo Shin
Sung Ju Hwang
AAML
45
3
0
12 Aug 2019
ACNet: Strengthening the Kernel Skeletons for Powerful CNN via
  Asymmetric Convolution Blocks
ACNet: Strengthening the Kernel Skeletons for Powerful CNN via Asymmetric Convolution Blocks
Xiaohan Ding
Yuchen Guo
Guiguang Ding
Jiawei Han
98
680
0
11 Aug 2019
Effective Training of Convolutional Neural Networks with Low-bitwidth
  Weights and Activations
Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations
Bohan Zhuang
Jing Liu
Mingkui Tan
Lingqiao Liu
Ian Reid
Chunhua Shen
MQ
107
46
0
10 Aug 2019
Recent Advances in Deep Learning for Object Detection
Recent Advances in Deep Learning for Object Detection
Xiongwei Wu
Doyen Sahoo
Guosheng Lin
VLMObjD
135
824
0
10 Aug 2019
Group Pruning using a Bounded-Lp norm for Group Gating and
  Regularization
Group Pruning using a Bounded-Lp norm for Group Gating and Regularization
Chaithanya Kumar Mummadi
Tim Genewein
Dan Zhang
Thomas Brox
Volker Fischer
53
3
0
09 Aug 2019
Efficient Inference of CNNs via Channel Pruning
Efficient Inference of CNNs via Channel Pruning
Boyu Zhang
A. Davoodi
Y. Hu
CVBM
23
6
0
08 Aug 2019
Exploiting Channel Similarity for Accelerating Deep Convolutional Neural
  Networks
Exploiting Channel Similarity for Accelerating Deep Convolutional Neural Networks
Yunxiang Zhang
Chenglong Zhao
Bingbing Ni
Jian Zhang
Haoran Deng
50
2
0
06 Aug 2019
Full-Stack Filters to Build Minimum Viable CNNs
Full-Stack Filters to Build Minimum Viable CNNs
Kai Han
Yunhe Wang
Yixing Xu
Chunjing Xu
Dacheng Tao
Chang Xu
MQ
39
3
0
06 Aug 2019
Knowledge Consistency between Neural Networks and Beyond
Knowledge Consistency between Neural Networks and Beyond
Ruofan Liang
Tianlin Li
Longfei Li
Jingchao Wang
Quanshi Zhang
86
28
0
05 Aug 2019
GDRQ: Group-based Distribution Reshaping for Quantization
GDRQ: Group-based Distribution Reshaping for Quantization
Haibao Yu
Tuopu Wen
Guangliang Cheng
Jiankai Sun
Qi Han
Jianping Shi
MQ
65
3
0
05 Aug 2019
Distilling Knowledge From a Deep Pose Regressor Network
Distilling Knowledge From a Deep Pose Regressor Network
Muhamad Risqi U. Saputra
Pedro Porto Buarque de Gusmão
Yasin Almalioglu
Andrew Markham
A. Trigoni
94
103
0
02 Aug 2019
Distributed Deep Convolutional Neural Networks for the
  Internet-of-Things
Distributed Deep Convolutional Neural Networks for the Internet-of-Things
Simone Disabato
M. Roveri
Cesare Alippi
64
50
0
02 Aug 2019
Deep Task-Based Quantization
Deep Task-Based Quantization
Nir Shlezinger
Yonina C. Eldar
MQ
53
61
0
01 Aug 2019
Accelerating CNN Training by Pruning Activation Gradients
Accelerating CNN Training by Pruning Activation Gradients
Xucheng Ye
Pengcheng Dai
Junyu Luo
Xin Guo
Weisheng Zhao
Jianlei Yang
Yiran Chen
23
2
0
01 Aug 2019
Machine Learning at the Network Edge: A Survey
Machine Learning at the Network Edge: A Survey
M. G. Sarwar Murshed
Chris Murphy
Daqing Hou
Nazar Khan
Ganesh Ananthanarayanan
Faraz Hussain
98
393
0
31 Jul 2019
Tuning Algorithms and Generators for Efficient Edge Inference
Tuning Algorithms and Generators for Efficient Edge Inference
R. Naous
Lazar Supic
Yoonhwan Kang
Ranko Seradejovic
Anish Singhani
Vladimir M. Stojanović
23
2
0
31 Jul 2019
Previous
123...495051...686970
Next