ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXivPDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,450 papers shown
Title
Deep Asymmetric Networks with a Set of Node-wise Variant Activation
  Functions
Deep Asymmetric Networks with a Set of Node-wise Variant Activation Functions
Jinhyeok Jang
Hyunjoong Cho
Jaehong Kim
Jaeyeon Lee
Seungjoon Yang
18
2
0
11 Sep 2018
Deep Learning Towards Mobile Applications
Deep Learning Towards Mobile Applications
Ji Wang
Bokai Cao
Philip S. Yu
Lichao Sun
Weidong Bao
Xiaomin Zhu
HAI
32
98
0
10 Sep 2018
Not Just Privacy: Improving Performance of Private Deep Learning in
  Mobile Cloud
Not Just Privacy: Improving Performance of Private Deep Learning in Mobile Cloud
Ji Wang
Jianguo Zhang
Weidong Bao
Xiaomin Zhu
Bokai Cao
Philip S. Yu
29
194
0
10 Sep 2018
Probabilistic Binary Neural Networks
Probabilistic Binary Neural Networks
Jorn W. T. Peters
Max Welling
BDL
UQCV
MQ
30
52
0
10 Sep 2018
Recent Advances in Object Detection in the Age of Deep Convolutional
  Neural Networks
Recent Advances in Object Detection in the Age of Deep Convolutional Neural Networks
Shivang Agarwal
Jean Ogier du Terrail
F. Jurie
ObjD
34
123
0
10 Sep 2018
Fast and Efficient Information Transmission with Burst Spikes in Deep
  Spiking Neural Networks
Fast and Efficient Information Transmission with Burst Spikes in Deep Spiking Neural Networks
Seongsik Park
Seijoon Kim
Hyeokjun Choe
Sungroh Yoon
28
94
0
10 Sep 2018
Training for Faster Adversarial Robustness Verification via Inducing
  ReLU Stability
Training for Faster Adversarial Robustness Verification via Inducing ReLU Stability
Kai Y. Xiao
Vincent Tjeng
Nur Muhammad (Mahi) Shafiullah
Aleksander Madry
AAML
OOD
12
200
0
09 Sep 2018
2PFPCE: Two-Phase Filter Pruning Based on Conditional Entropy
2PFPCE: Two-Phase Filter Pruning Based on Conditional Entropy
Chuhan Min
Aosen Wang
Yiran Chen
Wenyao Xu
Xin Chen
30
41
0
06 Sep 2018
Deep Learning for Generic Object Detection: A Survey
Deep Learning for Generic Object Detection: A Survey
Li Liu
Wanli Ouyang
Xiaogang Wang
Paul Fieguth
Jie Chen
Xinwang Liu
M. Pietikäinen
ObjD
VLM
OOD
93
2,435
0
06 Sep 2018
Pack and Detect: Fast Object Detection in Videos Using
  Region-of-Interest Packing
Pack and Detect: Fast Object Detection in Videos Using Region-of-Interest Packing
Athindran Ramesh Kumar
Balaraman Ravindran
A. Raghunathan
ObjD
29
13
0
05 Sep 2018
ChannelNets: Compact and Efficient Convolutional Neural Networks via
  Channel-Wise Convolutions
ChannelNets: Compact and Efficient Convolutional Neural Networks via Channel-Wise Convolutions
Hongyang Gao
Zhengyang Wang
Shuiwang Ji
11
70
0
05 Sep 2018
DeepHunter: Hunting Deep Neural Network Defects via Coverage-Guided
  Fuzzing
DeepHunter: Hunting Deep Neural Network Defects via Coverage-Guided Fuzzing
Xiaofei Xie
Lei Ma
Felix Juefei Xu
Hongxu Chen
Minhui Xue
Yue Liu
Yang Liu
Jianjun Zhao
Jianxiong Yin
Simon See
47
40
0
04 Sep 2018
Learning Sparse Low-Precision Neural Networks With Learnable
  Regularization
Learning Sparse Low-Precision Neural Networks With Learnable Regularization
Yoojin Choi
Mostafa El-Khamy
Jungwon Lee
MQ
32
31
0
01 Sep 2018
An Adaptive Locally Connected Neuron Model: Focusing Neuron
An Adaptive Locally Connected Neuron Model: Focusing Neuron
F. Boray Tek
27
5
0
31 Aug 2018
Fixed-Point Convolutional Neural Network for Real-Time Video Processing
  in FPGA
Fixed-Point Convolutional Neural Network for Real-Time Video Processing in FPGA
R. Solovyev
A. Kustov
D. Telpukhov
V. S. Ruhlov
Alexandr A Kalinin
MQ
29
41
0
29 Aug 2018
Sparsity in Deep Neural Networks - An Empirical Investigation with
  TensorQuant
Sparsity in Deep Neural Networks - An Empirical Investigation with TensorQuant
D. Loroch
Franz-Josef Pfreundt
Norbert Wehn
J. Keuper
31
5
0
27 Aug 2018
Predefined Sparseness in Recurrent Sequence Models
Predefined Sparseness in Recurrent Sequence Models
T. Demeester
Johannes Deleu
Fréderic Godin
Chris Develder
21
3
0
27 Aug 2018
Spectral Pruning: Compressing Deep Neural Networks via Spectral Analysis
  and its Generalization Error
Spectral Pruning: Compressing Deep Neural Networks via Spectral Analysis and its Generalization Error
Taiji Suzuki
Hiroshi Abe
Tomoya Murata
Shingo Horiuchi
Kotaro Ito
Tokuma Wachi
So Hirai
Masatoshi Yukishima
Tomoaki Nishimura
MLT
27
10
0
26 Aug 2018
DeepTracker: Visualizing the Training Process of Convolutional Neural
  Networks
DeepTracker: Visualizing the Training Process of Convolutional Neural Networks
Dongyu Liu
Weiwei Cui
Kai Jin
Yuxiao Guo
Huamin Qu
HAI
11
34
0
26 Aug 2018
An Overview of Datatype Quantization Techniques for Convolutional Neural
  Networks
An Overview of Datatype Quantization Techniques for Convolutional Neural Networks
A. Athar
BDL
MQ
8
0
0
22 Aug 2018
Asymptotic Soft Filter Pruning for Deep Convolutional Neural Networks
Asymptotic Soft Filter Pruning for Deep Convolutional Neural Networks
Yang He
Xuanyi Dong
Guoliang Kang
Yanwei Fu
C. Yan
Yi Yang
54
134
0
22 Aug 2018
Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks
Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks
Yang He
Guoliang Kang
Xuanyi Dong
Yanwei Fu
Yi Yang
AAML
VLM
36
958
0
21 Aug 2018
Constrained-size Tensorflow Models for YouTube-8M Video Understanding
  Challenge
Constrained-size Tensorflow Models for YouTube-8M Video Understanding Challenge
Tianqi Liu
Bo Liu
ALM
MQ
36
5
0
21 Aug 2018
Learning to Quantize Deep Networks by Optimizing Quantization Intervals
  with Task Loss
Learning to Quantize Deep Networks by Optimizing Quantization Intervals with Task Loss
S. Jung
Changyong Son
Seohyung Lee
JinWoo Son
Youngjun Kwak
Jae-Joon Han
Sung Ju Hwang
Changkyu Choi
MQ
25
373
0
17 Aug 2018
Network Decoupling: From Regular to Depthwise Separable Convolutions
Network Decoupling: From Regular to Depthwise Separable Convolutions
Jianbo Guo
Yuxi Li
Weiyao Lin
Yurong Chen
Jianguo Li
3DV
OOD
19
84
0
16 Aug 2018
Fast and Accurate, Convolutional Neural Network Based Approach for
  Object Detection from UAV
Fast and Accurate, Convolutional Neural Network Based Approach for Object Detection from UAV
Xiaoliang Wang
Peng Cheng
Xinchuan Liu
Benedict Uzochukwu
19
48
0
16 Aug 2018
Rank-1 Convolutional Neural Network
Rank-1 Convolutional Neural Network
Hyein Kim
Jungho Yoon
Byeongseon Jeong
Sukho Lee
3DPC
3DV
6
2
0
13 Aug 2018
A Survey on Methods and Theories of Quantized Neural Networks
A Survey on Methods and Theories of Quantized Neural Networks
Yunhui Guo
MQ
42
232
0
13 Aug 2018
VerIDeep: Verifying Integrity of Deep Neural Networks through
  Sensitive-Sample Fingerprinting
VerIDeep: Verifying Integrity of Deep Neural Networks through Sensitive-Sample Fingerprinting
Zecheng He
Tianwei Zhang
R. Lee
FedML
AAML
MLAU
36
18
0
09 Aug 2018
Parallax: Sparsity-aware Data Parallel Training of Deep Neural Networks
Parallax: Sparsity-aware Data Parallel Training of Deep Neural Networks
Soojeong Kim
Gyeong-In Yu
Hojin Park
Sungwoo Cho
Eunji Jeong
Hyeonmin Ha
Sanha Lee
Joo Seong Jeong
Byung-Gon Chun
28
73
0
08 Aug 2018
Efficient Fusion of Sparse and Complementary Convolutions
Efficient Fusion of Sparse and Complementary Convolutions
Chun-Fu Chen
Quanfu Fan
Marco Pistoia
G. Lee
24
0
0
07 Aug 2018
Deep Generative Modeling for Scene Synthesis via Hybrid Representations
Deep Generative Modeling for Scene Synthesis via Hybrid Representations
Zaiwei Zhang
Zhenpei Yang
Chongyang Ma
Linjie Luo
Alexander G. Huth
E. Vouga
Qi-Xing Huang
GAN
3DPC
23
126
0
06 Aug 2018
Bi-Real Net: Enhancing the Performance of 1-bit CNNs With Improved
  Representational Capability and Advanced Training Algorithm
Bi-Real Net: Enhancing the Performance of 1-bit CNNs With Improved Representational Capability and Advanced Training Algorithm
Zechun Liu
Baoyuan Wu
Wenhan Luo
Xin Yang
Wen Liu
K. Cheng
MQ
53
551
0
01 Aug 2018
Universal Approximation with Quadratic Deep Networks
Universal Approximation with Quadratic Deep Networks
Fenglei Fan
Jinjun Xiong
Ge Wang
PINN
41
79
0
31 Jul 2018
Design Flow of Accelerating Hybrid Extremely Low Bit-width Neural
  Network in Embedded FPGA
Design Flow of Accelerating Hybrid Extremely Low Bit-width Neural Network in Embedded FPGA
Junsong Wang
Qiuwen Lou
Xiaofan Zhang
Chao Zhu
Yonghua Lin
Deming Chen
MQ
36
93
0
31 Jul 2018
DFTerNet: Towards 2-bit Dynamic Fusion Networks for Accurate Human
  Activity Recognition
DFTerNet: Towards 2-bit Dynamic Fusion Networks for Accurate Human Activity Recognition
Zhan Yang
Osolo Ian Raymond
Chengyuan Zhang
Ying Wan
J. Long
CVBM
47
36
0
31 Jul 2018
MnasNet: Platform-Aware Neural Architecture Search for Mobile
MnasNet: Platform-Aware Neural Architecture Search for Mobile
Mingxing Tan
Bo Chen
Ruoming Pang
Vijay Vasudevan
Mark Sandler
Andrew G. Howard
Quoc V. Le
MQ
53
2,991
0
31 Jul 2018
Extreme Network Compression via Filter Group Approximation
Extreme Network Compression via Filter Group Approximation
Bo Peng
Wenming Tan
Zheyang Li
Shun Zhang
Di Xie
Shiliang Pu
29
63
0
30 Jul 2018
Robust Student Network Learning
Robust Student Network Learning
Tianyu Guo
Chang Xu
Shiyi He
Boxin Shi
Chao Xu
Dacheng Tao
OOD
42
30
0
30 Jul 2018
StructADMM: A Systematic, High-Efficiency Framework of Structured Weight
  Pruning for DNNs
StructADMM: A Systematic, High-Efficiency Framework of Structured Weight Pruning for DNNs
Tianyun Zhang
Shaokai Ye
Kaiqi Zhang
Xiaolong Ma
Ning Liu
...
Jian Tang
Kaisheng Ma
Xue Lin
M. Fardad
Yanzhi Wang
31
50
0
29 Jul 2018
MaskConnect: Connectivity Learning by Gradient Descent
MaskConnect: Connectivity Learning by Gradient Descent
Karim Ahmed
Lorenzo Torresani
30
49
0
28 Jul 2018
FPGA-Based CNN Inference Accelerator Synthesized from Multi-Threaded C
  Software
FPGA-Based CNN Inference Accelerator Synthesized from Multi-Threaded C Software
Jin Hee Kim
Brett Grady
Ruolong Lian
J. Brothers
J. Anderson
14
41
0
27 Jul 2018
A Unified Approximation Framework for Compressing and Accelerating Deep
  Neural Networks
A Unified Approximation Framework for Compressing and Accelerating Deep Neural Networks
Yuzhe Ma
Ran Chen
Wei Li
Fanhua Shang
Wenjian Yu
Minsik Cho
Bei Yu
25
3
0
26 Jul 2018
LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep
  Neural Networks
LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks
Dongqing Zhang
Jiaolong Yang
Dongqiangzi Ye
G. Hua
MQ
23
699
0
26 Jul 2018
Crossbar-aware neural network pruning
Crossbar-aware neural network pruning
Ling Liang
Lei Deng
Y. Zeng
Xing Hu
Yu Ji
Xin Ma
Guoqi Li
Yuan Xie
36
41
0
25 Jul 2018
Coreset-Based Neural Network Compression
Coreset-Based Neural Network Compression
Abhimanyu Dubey
Moitreya Chatterjee
Narendra Ahuja
32
79
0
25 Jul 2018
CReaM: Condensed Real-time Models for Depth Prediction using
  Convolutional Neural Networks
CReaM: Condensed Real-time Models for Depth Prediction using Convolutional Neural Networks
Andrew Spek
Thanuja Dharmasiri
Tom Drummond
31
18
0
24 Jul 2018
Supporting Very Large Models using Automatic Dataflow Graph Partitioning
Supporting Very Large Models using Automatic Dataflow Graph Partitioning
Minjie Wang
Chien-chin Huang
Jinyang Li
54
154
0
24 Jul 2018
Recent Advances in Convolutional Neural Network Acceleration
Recent Advances in Convolutional Neural Network Acceleration
Qianru Zhang
Meng Zhang
Tinghuan Chen
Zhifei Sun
Yuzhe Ma
Bei Yu
36
348
0
23 Jul 2018
Spatial Correlation and Value Prediction in Convolutional Neural
  Networks
Spatial Correlation and Value Prediction in Convolutional Neural Networks
Gil Shomron
U. Weiser
11
43
0
21 Jul 2018
Previous
123...585960...676869
Next