ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown
Title
An Overview of Datatype Quantization Techniques for Convolutional Neural
  Networks
An Overview of Datatype Quantization Techniques for Convolutional Neural Networks
A. Athar
BDLMQ
20
0
0
22 Aug 2018
Asymptotic Soft Filter Pruning for Deep Convolutional Neural Networks
Asymptotic Soft Filter Pruning for Deep Convolutional Neural Networks
Yang He
Xuanyi Dong
Guoliang Kang
Yanwei Fu
C. Yan
Yi Yang
129
135
0
22 Aug 2018
Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks
Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks
Yang He
Guoliang Kang
Xuanyi Dong
Yanwei Fu
Yi Yang
AAMLVLM
132
969
0
21 Aug 2018
Constrained-size Tensorflow Models for YouTube-8M Video Understanding
  Challenge
Constrained-size Tensorflow Models for YouTube-8M Video Understanding Challenge
Tianqi Liu
Bo Liu
ALMMQ
68
5
0
21 Aug 2018
Learning to Quantize Deep Networks by Optimizing Quantization Intervals
  with Task Loss
Learning to Quantize Deep Networks by Optimizing Quantization Intervals with Task Loss
S. Jung
Changyong Son
Seohyung Lee
JinWoo Son
Youngjun Kwak
Jae-Joon Han
Sung Ju Hwang
Changkyu Choi
MQ
103
376
0
17 Aug 2018
Network Decoupling: From Regular to Depthwise Separable Convolutions
Network Decoupling: From Regular to Depthwise Separable Convolutions
Jianbo Guo
Yuxi Li
Weiyao Lin
Yurong Chen
Jianguo Li
3DVOOD
65
86
0
16 Aug 2018
Fast and Accurate, Convolutional Neural Network Based Approach for
  Object Detection from UAV
Fast and Accurate, Convolutional Neural Network Based Approach for Object Detection from UAV
Xiaoliang Wang
Peng Cheng
Xinchuan Liu
Benedict Uzochukwu
58
49
0
16 Aug 2018
Rank-1 Convolutional Neural Network
Rank-1 Convolutional Neural Network
Hyein Kim
Jungho Yoon
Byeongseon Jeong
Sukho Lee
3DPC3DV
13
2
0
13 Aug 2018
A Survey on Methods and Theories of Quantized Neural Networks
A Survey on Methods and Theories of Quantized Neural Networks
Yunhui Guo
MQ
125
236
0
13 Aug 2018
VerIDeep: Verifying Integrity of Deep Neural Networks through
  Sensitive-Sample Fingerprinting
VerIDeep: Verifying Integrity of Deep Neural Networks through Sensitive-Sample Fingerprinting
Zecheng He
Tianwei Zhang
R. Lee
FedMLAAMLMLAU
62
19
0
09 Aug 2018
Parallax: Sparsity-aware Data Parallel Training of Deep Neural Networks
Parallax: Sparsity-aware Data Parallel Training of Deep Neural Networks
Soojeong Kim
Gyeong-In Yu
Hojin Park
Sungwoo Cho
Eunji Jeong
Hyeonmin Ha
Sanha Lee
Joo Seong Jeong
Byung-Gon Chun
67
75
0
08 Aug 2018
Efficient Fusion of Sparse and Complementary Convolutions
Efficient Fusion of Sparse and Complementary Convolutions
Chun-Fu Chen
Quanfu Fan
Marco Pistoia
G. Lee
36
0
0
07 Aug 2018
Deep Generative Modeling for Scene Synthesis via Hybrid Representations
Deep Generative Modeling for Scene Synthesis via Hybrid Representations
Zaiwei Zhang
Zhenpei Yang
Chongyang Ma
Linjie Luo
Alexander G. Huth
E. Vouga
Qi-Xing Huang
GAN3DPC
74
130
0
06 Aug 2018
Bi-Real Net: Enhancing the Performance of 1-bit CNNs With Improved
  Representational Capability and Advanced Training Algorithm
Bi-Real Net: Enhancing the Performance of 1-bit CNNs With Improved Representational Capability and Advanced Training Algorithm
Zechun Liu
Baoyuan Wu
Wenhan Luo
Xin Yang
Wen Liu
K. Cheng
MQ
148
559
0
01 Aug 2018
Universal Approximation with Quadratic Deep Networks
Universal Approximation with Quadratic Deep Networks
Fenglei Fan
Jinjun Xiong
Ge Wang
PINN
131
83
0
31 Jul 2018
Design Flow of Accelerating Hybrid Extremely Low Bit-width Neural
  Network in Embedded FPGA
Design Flow of Accelerating Hybrid Extremely Low Bit-width Neural Network in Embedded FPGA
Junsong Wang
Qiuwen Lou
Xiaofan Zhang
Chao Zhu
Yonghua Lin
Deming Chen
MQ
94
93
0
31 Jul 2018
DFTerNet: Towards 2-bit Dynamic Fusion Networks for Accurate Human
  Activity Recognition
DFTerNet: Towards 2-bit Dynamic Fusion Networks for Accurate Human Activity Recognition
Zhan Yang
Osolo Ian Raymond
Chengyuan Zhang
Ying Wan
J. Long
CVBM
74
36
0
31 Jul 2018
MnasNet: Platform-Aware Neural Architecture Search for Mobile
MnasNet: Platform-Aware Neural Architecture Search for Mobile
Mingxing Tan
Bo Chen
Ruoming Pang
Vijay Vasudevan
Mark Sandler
Andrew G. Howard
Quoc V. Le
MQ
154
3,023
0
31 Jul 2018
Extreme Network Compression via Filter Group Approximation
Extreme Network Compression via Filter Group Approximation
Bo Peng
Wenming Tan
Zheyang Li
Shun Zhang
Di Xie
Shiliang Pu
90
63
0
30 Jul 2018
Robust Student Network Learning
Robust Student Network Learning
Tianyu Guo
Chang Xu
Shiyi He
Boxin Shi
Chao Xu
Dacheng Tao
OOD
104
30
0
30 Jul 2018
StructADMM: A Systematic, High-Efficiency Framework of Structured Weight
  Pruning for DNNs
StructADMM: A Systematic, High-Efficiency Framework of Structured Weight Pruning for DNNs
Tianyun Zhang
Shaokai Ye
Kaiqi Zhang
Xiaolong Ma
Ning Liu
...
Jian Tang
Kaisheng Ma
Xue Lin
M. Fardad
Yanzhi Wang
98
51
0
29 Jul 2018
MaskConnect: Connectivity Learning by Gradient Descent
MaskConnect: Connectivity Learning by Gradient Descent
Karim Ahmed
Lorenzo Torresani
97
49
0
28 Jul 2018
FPGA-Based CNN Inference Accelerator Synthesized from Multi-Threaded C
  Software
FPGA-Based CNN Inference Accelerator Synthesized from Multi-Threaded C Software
Jin Hee Kim
Brett Grady
Ruolong Lian
J. Brothers
J. Anderson
39
41
0
27 Jul 2018
A Unified Approximation Framework for Compressing and Accelerating Deep
  Neural Networks
A Unified Approximation Framework for Compressing and Accelerating Deep Neural Networks
Yuzhe Ma
Ran Chen
Wei Li
Fanhua Shang
Wenjian Yu
Minsik Cho
Bei Yu
27
3
0
26 Jul 2018
LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep
  Neural Networks
LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks
Dongqing Zhang
Jiaolong Yang
Dongqiangzi Ye
G. Hua
MQ
120
705
0
26 Jul 2018
Crossbar-aware neural network pruning
Crossbar-aware neural network pruning
Ling Liang
Lei Deng
Y. Zeng
Xing Hu
Yu Ji
Xin Ma
Guoqi Li
Yuan Xie
92
41
0
25 Jul 2018
Coreset-Based Neural Network Compression
Coreset-Based Neural Network Compression
Abhimanyu Dubey
Moitreya Chatterjee
Narendra Ahuja
66
81
0
25 Jul 2018
CReaM: Condensed Real-time Models for Depth Prediction using
  Convolutional Neural Networks
CReaM: Condensed Real-time Models for Depth Prediction using Convolutional Neural Networks
Andrew Spek
Thanuja Dharmasiri
Tom Drummond
70
18
0
24 Jul 2018
Supporting Very Large Models using Automatic Dataflow Graph Partitioning
Supporting Very Large Models using Automatic Dataflow Graph Partitioning
Minjie Wang
Chien-chin Huang
Jinyang Li
127
155
0
24 Jul 2018
Recent Advances in Convolutional Neural Network Acceleration
Recent Advances in Convolutional Neural Network Acceleration
Qianru Zhang
Meng Zhang
Tinghuan Chen
Zhifei Sun
Yuzhe Ma
Bei Yu
84
352
0
23 Jul 2018
Spatial Correlation and Value Prediction in Convolutional Neural
  Networks
Spatial Correlation and Value Prediction in Convolutional Neural Networks
Gil Shomron
U. Weiser
63
43
0
21 Jul 2018
Filter Distillation for Network Compression
Filter Distillation for Network Compression
Xavier Suau
Luca Zappella
N. Apostoloff
46
38
0
20 Jul 2018
Optimize Deep Convolutional Neural Network with Ternarized Weights and
  High Accuracy
Optimize Deep Convolutional Neural Network with Ternarized Weights and High Accuracy
Zhezhi He
Boqing Gong
Deliang Fan
49
22
0
20 Jul 2018
Statistical Model Compression for Small-Footprint Natural Language
  Understanding
Statistical Model Compression for Small-Footprint Natural Language Understanding
Grant P. Strimel
Kanthashree Mysore Sathyendra
Stanislav Peshterliev
49
9
0
19 Jul 2018
Defend Deep Neural Networks Against Adversarial Examples via Fixed and
  Dynamic Quantized Activation Functions
Defend Deep Neural Networks Against Adversarial Examples via Fixed and Dynamic Quantized Activation Functions
Adnan Siraj Rakin
Jinfeng Yi
Boqing Gong
Deliang Fan
AAMLMQ
87
50
0
18 Jul 2018
BRIEF: Backward Reduction of CNNs with Information Flow Analysis
BRIEF: Backward Reduction of CNNs with Information Flow Analysis
Yu-Hsun Lin
Chun-Nan Chou
Edward Y. Chang
36
0
0
16 Jul 2018
Morse Code Datasets for Machine Learning
Morse Code Datasets for Machine Learning
Sourya Dey
K. Chugg
Peter A. Beerel
28
10
0
11 Jul 2018
Make $\ell_1$ Regularization Effective in Training Sparse CNN
Make ℓ1\ell_1ℓ1​ Regularization Effective in Training Sparse CNN
Juncai He
Xiaodong Jia
Jinchao Xu
Lian Zhang
Liang Zhao
62
5
0
11 Jul 2018
FINN-L: Library Extensions and Design Trade-off Analysis for Variable
  Precision LSTM Networks on FPGAs
FINN-L: Library Extensions and Design Trade-off Analysis for Variable Precision LSTM Networks on FPGAs
Vladimir Rybalkin
Alessandro Pappalardo
M. M. Ghaffar
Giulio Gambardella
Norbert Wehn
Michaela Blott
126
72
0
11 Jul 2018
Big-Little Net: An Efficient Multi-Scale Feature Representation for
  Visual and Speech Recognition
Big-Little Net: An Efficient Multi-Scale Feature Representation for Visual and Speech Recognition
Chun-Fu Chen
Quanfu Fan
Neil Rohit Mallinar
Tom Sercu
Rogerio Feris
78
98
0
10 Jul 2018
Auto Deep Compression by Reinforcement Learning Based Actor-Critic
  Structure
Auto Deep Compression by Reinforcement Learning Based Actor-Critic Structure
Hamed Hakkak
OffRLAI4CE
98
1
0
08 Jul 2018
Anytime Neural Prediction via Slicing Networks Vertically
Anytime Neural Prediction via Slicing Networks Vertically
Hankook Lee
Jinwoo Shin
AI4CE
68
16
0
07 Jul 2018
Sparse Deep Neural Network Exact Solutions
Sparse Deep Neural Network Exact Solutions
J. Kepner
V. Gadepally
Hayden Jananthan
Lauren Milechin
S. Samsi
113
14
0
06 Jul 2018
SGAD: Soft-Guided Adaptively-Dropped Neural Network
SGAD: Soft-Guided Adaptively-Dropped Neural Network
Zhisheng Wang
Fangxuan Sun
Jun Lin
Zhongfeng Wang
Bo Yuan
44
7
0
04 Jul 2018
Restructuring Batch Normalization to Accelerate CNN Training
Restructuring Batch Normalization to Accelerate CNN Training
Wonkyung Jung
Daejin Jung
and Byeongho Kim
Sunjung Lee
Wonjong Rhee
Jung Ho Ahn
57
64
0
04 Jul 2018
Confidential Inference via Ternary Model Partitioning
Confidential Inference via Ternary Model Partitioning
Zhongshu Gu
Heqing Huang
Jialong Zhang
D. Su
Hani Jamjoom
Ankita Lamba
Dimitrios E. Pendarakis
Ian Molloy
103
53
0
03 Jul 2018
Stochastic Layer-Wise Precision in Deep Neural Networks
Stochastic Layer-Wise Precision in Deep Neural Networks
Griffin Lacey
Graham W. Taylor
S. Areibi
91
18
0
03 Jul 2018
Weight-importance sparse training in keyword spotting
Weight-importance sparse training in keyword spotting
Sihao Xue
Zhenyi Ying
Fan Mo
Min Wang
Jue Sun
24
0
0
02 Jul 2018
Evenly Cascaded Convolutional Networks
Evenly Cascaded Convolutional Networks
Chengxi Ye
Chinmaya Devaraj
Michael Maynord
Cornelia Fermuller
Yiannis Aloimonos
35
7
0
02 Jul 2018
SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks
SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks
Julian Faraone
Nicholas J. Fraser
Michaela Blott
Philip H. W. Leong
MQ
114
133
0
01 Jul 2018
Previous
123...596061...686970
Next