ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXivPDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,448 papers shown
Title
Learning SMaLL Predictors
Learning SMaLL Predictors
Vikas K. Garg
O. Dekel
Lin Xiao
8
3
0
06 Mar 2018
Personalized Exposure Control Using Adaptive Metering and Reinforcement
  Learning
Personalized Exposure Control Using Adaptive Metering and Reinforcement Learning
Huan Yang
Baoyuan Wang
Noranart Vesdapunt
Minyi Guo
S. B. Kang
32
22
0
06 Mar 2018
Deep Neural Network Compression with Single and Multiple Level
  Quantization
Deep Neural Network Compression with Single and Multiple Level Quantization
Yuhui Xu
Yongzhuang Wang
Aojun Zhou
Weiyao Lin
H. Xiong
MQ
20
114
0
06 Mar 2018
Stochastic Activation Pruning for Robust Adversarial Defense
Stochastic Activation Pruning for Robust Adversarial Defense
Guneet Singh Dhillon
Kamyar Azizzadenesheli
Zachary Chase Lipton
Jeremy Bernstein
Jean Kossaifi
Aran Khanna
Anima Anandkumar
AAML
33
545
0
05 Mar 2018
An Optimal Control Approach to Deep Learning and Applications to
  Discrete-Weight Neural Networks
An Optimal Control Approach to Deep Learning and Applications to Discrete-Weight Neural Networks
Qianxiao Li
Shuji Hao
32
75
0
04 Mar 2018
The History Began from AlexNet: A Comprehensive Survey on Deep Learning
  Approaches
The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches
Md. Zahangir Alom
T. Taha
C. Yakopcic
Stefan Westberg
P. Sidike
Mst Shamima Nasrin
B. Van Essen
A. Awwal
V. Asari
VLM
34
875
0
03 Mar 2018
Scalar Quantization as Sparse Least Square Optimization
Scalar Quantization as Sparse Least Square Optimization
Chen Wang
Xiaomei Yang
Shaomin Fei
Kai Zhou
Xiaofeng Gong
Miao Du
Ruisen Luo
MQ
20
3
0
01 Mar 2018
Learning Sparse Structured Ensembles with SG-MCMC and Network Pruning
Learning Sparse Structured Ensembles with SG-MCMC and Network Pruning
Yichi Zhang
Zhijian Ou
27
0
0
01 Mar 2018
Compressing Neural Networks using the Variational Information Bottleneck
Compressing Neural Networks using the Variational Information Bottleneck
Bin Dai
Chen Zhu
David Wipf
MLT
28
179
0
28 Feb 2018
Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs
Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs
Xuhao Chen
21
25
0
28 Feb 2018
Frank-Wolfe Network: An Interpretable Deep Structure for Non-Sparse
  Coding
Frank-Wolfe Network: An Interpretable Deep Structure for Non-Sparse Coding
Dong Liu
Ke Sun
Zhangyang Wang
Runsheng Liu
Zhengjun Zha
24
12
0
28 Feb 2018
Recurrent Residual Module for Fast Inference in Videos
Recurrent Residual Module for Fast Inference in Videos
Bowen Pan
Wuwei Lin
Xiaolin Fang
Chaoqin Huang
Bolei Zhou
Cewu Lu
ObjD
28
33
0
27 Feb 2018
Demystifying Parallel and Distributed Deep Learning: An In-Depth
  Concurrency Analysis
Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis
Tal Ben-Nun
Torsten Hoefler
GNN
33
704
0
26 Feb 2018
PBGen: Partial Binarization of Deconvolution-Based Generators for Edge
  Intelligence
PBGen: Partial Binarization of Deconvolution-Based Generators for Edge Intelligence
Jinglan Liu
Jiaxin Zhang
Yukun Ding
Xiaowei Xu
Meng Jiang
Yiyu Shi
41
4
0
26 Feb 2018
Wide Compression: Tensor Ring Nets
Wide Compression: Tensor Ring Nets
Wenqi Wang
Yifan Sun
Brian Eriksson
Wenlin Wang
Vaneet Aggarwal
13
167
0
25 Feb 2018
Loss-aware Weight Quantization of Deep Networks
Loss-aware Weight Quantization of Deep Networks
Lu Hou
James T. Kwok
MQ
35
127
0
23 Feb 2018
Training wide residual networks for deployment using a single bit for
  each weight
Training wide residual networks for deployment using a single bit for each weight
Mark D Mcdonnell
MQ
41
71
0
23 Feb 2018
Approximation Algorithms for Cascading Prediction Models
Approximation Algorithms for Cascading Prediction Models
Matthew J. Streeter
TPM
13
19
0
21 Feb 2018
Building Efficient ConvNets using Redundant Feature Pruning
Building Efficient ConvNets using Redundant Feature Pruning
B. Ayinde
J. Zurada
VLM
3DPC
29
47
0
21 Feb 2018
3LC: Lightweight and Effective Traffic Compression for Distributed
  Machine Learning
3LC: Lightweight and Effective Traffic Compression for Distributed Machine Learning
Hyeontaek Lim
D. Andersen
M. Kaminsky
21
70
0
21 Feb 2018
The Description Length of Deep Learning Models
The Description Length of Deep Learning Models
Léonard Blier
Yann Ollivier
32
97
0
20 Feb 2018
DeepThin: A Self-Compressing Library for Deep Neural Networks
DeepThin: A Self-Compressing Library for Deep Neural Networks
Matthew Sotoudeh
Sara S. Baghsorkhi
18
4
0
20 Feb 2018
Layer-wise synapse optimization for implementing neural networks on
  general neuromorphic architectures
Layer-wise synapse optimization for implementing neural networks on general neuromorphic architectures
John Mern
Jayesh K. Gupta
Mykel Kochenderfer
40
1
0
20 Feb 2018
A Scalable Near-Memory Architecture for Training Deep Neural Networks on
  Large In-Memory Datasets
A Scalable Near-Memory Architecture for Training Deep Neural Networks on Large In-Memory Datasets
Fabian Schuiki
Michael Schaffner
Frank K. Gürkaynak
Luca Benini
31
70
0
19 Feb 2018
Towards Ultra-High Performance and Energy Efficiency of Deep Learning
  Systems: An Algorithm-Hardware Co-Optimization Framework
Towards Ultra-High Performance and Energy Efficiency of Deep Learning Systems: An Algorithm-Hardware Co-Optimization Framework
Yanzhi Wang
Caiwen Ding
Zhe Li
Geng Yuan
Siyu Liao
...
Bo Yuan
Xuehai Qian
Jian Tang
Qinru Qiu
Xinyu Lin
31
33
0
18 Feb 2018
Efficient Sparse-Winograd Convolutional Neural Networks
Efficient Sparse-Winograd Convolutional Neural Networks
Xingyu Liu
Jeff Pool
Song Han
W. Dally
13
122
0
18 Feb 2018
Towards Principled Design of Deep Convolutional Networks: Introducing
  SimpNet
Towards Principled Design of Deep Convolutional Networks: Introducing SimpNet
S. H. HasanPour
Mohammad Rouhani
Mohsen Fayyaz
Mohammad Sabokrou
Ehsan Adeli
52
45
0
17 Feb 2018
Systematic Weight Pruning of DNNs using Alternating Direction Method of
  Multipliers
Systematic Weight Pruning of DNNs using Alternating Direction Method of Multipliers
Tianyun Zhang
Shaokai Ye
Yipeng Zhang
Yanzhi Wang
M. Fardad
22
21
0
15 Feb 2018
Model compression via distillation and quantization
Model compression via distillation and quantization
A. Polino
Razvan Pascanu
Dan Alistarh
MQ
48
718
0
15 Feb 2018
Security Analysis and Enhancement of Model Compressed Deep Learning
  Systems under Adversarial Attacks
Security Analysis and Enhancement of Model Compressed Deep Learning Systems under Adversarial Attacks
Qi Liu
Tao Liu
Zihao Liu
Yanzhi Wang
Yier Jin
Wujie Wen
AAML
35
48
0
14 Feb 2018
Paraphrasing Complex Network: Network Compression via Factor Transfer
Paraphrasing Complex Network: Network Compression via Factor Transfer
Jangho Kim
Seonguk Park
Nojun Kwak
32
543
0
14 Feb 2018
SLAQ: Quality-Driven Scheduling for Distributed Machine Learning
SLAQ: Quality-Driven Scheduling for Distributed Machine Learning
Haoyu Zhang
Logan Stafman
Andrew Or
M. Freedman
38
140
0
13 Feb 2018
Training and Inference with Integers in Deep Neural Networks
Training and Inference with Integers in Deep Neural Networks
Shuang Wu
Guoqi Li
F. Chen
Luping Shi
MQ
41
389
0
13 Feb 2018
Attention-Based Guided Structured Sparsity of Deep Neural Networks
Attention-Based Guided Structured Sparsity of Deep Neural Networks
A. Torfi
Rouzbeh A. Shirvani
Sobhan Soleymani
Nasser M. Nasrabadi
29
23
0
13 Feb 2018
DCFNet: Deep Neural Network with Decomposed Convolutional Filters
DCFNet: Deep Neural Network with Decomposed Convolutional Filters
Qiang Qiu
Xiuyuan Cheng
Robert Calderbank
Guillermo Sapiro
41
69
0
12 Feb 2018
ClosNets: a Priori Sparse Topologies for Faster DNN Training
ClosNets: a Priori Sparse Topologies for Faster DNN Training
Mihailo Isakov
Michel A. Kinsy
CVBM
36
0
0
12 Feb 2018
Edge-Host Partitioning of Deep Neural Networks with Feature Space
  Encoding for Resource-Constrained Internet-of-Things Platforms
Edge-Host Partitioning of Deep Neural Networks with Feature Space Encoding for Resource-Constrained Internet-of-Things Platforms
J. Ko
Taesik Na
M. Amir
Saibal Mukhopadhyay
24
148
0
11 Feb 2018
ThUnderVolt: Enabling Aggressive Voltage Underscaling and Timing Error
  Resilience for Energy Efficient Deep Neural Network Accelerators
ThUnderVolt: Enabling Aggressive Voltage Underscaling and Timing Error Resilience for Energy Efficient Deep Neural Network Accelerators
Jeff Zhang
Kartheek Rangineni
Zahra Ghodsi
S. Garg
36
118
0
11 Feb 2018
Analyzing and Mitigating the Impact of Permanent Faults on a Systolic
  Array Based Neural Network Accelerator
Analyzing and Mitigating the Impact of Permanent Faults on a Systolic Array Based Neural Network Accelerator
Jeff Zhang
Tianyu Gu
K. Basu
S. Garg
14
134
0
11 Feb 2018
The Need for Speed of AI Applications: Performance Comparison of Native
  vs. Browser-based Algorithm Implementations
The Need for Speed of AI Applications: Performance Comparison of Native vs. Browser-based Algorithm Implementations
Bernd Malle
Nicola Giuliani
Peter Kieseberg
Andreas Holzinger
13
8
0
11 Feb 2018
On the Universal Approximability and Complexity Bounds of Quantized ReLU
  Neural Networks
On the Universal Approximability and Complexity Bounds of Quantized ReLU Neural Networks
Yukun Ding
Jinglan Liu
Jinjun Xiong
Yiyu Shi
MQ
37
21
0
10 Feb 2018
AMC: AutoML for Model Compression and Acceleration on Mobile Devices
AMC: AutoML for Model Compression and Acceleration on Mobile Devices
Yihui He
Ji Lin
Zhijian Liu
Hanrui Wang
Li Li
Song Han
35
1,343
0
10 Feb 2018
Nature vs. Nurture: The Role of Environmental Resources in Evolutionary
  Deep Intelligence
Nature vs. Nurture: The Role of Environmental Resources in Evolutionary Deep Intelligence
A. Chung
Paul Fieguth
A. Wong
24
1
0
09 Feb 2018
Going Deeper in Spiking Neural Networks: VGG and Residual Architectures
Going Deeper in Spiking Neural Networks: VGG and Residual Architectures
Abhronil Sengupta
Yuting Ye
Robert Y. Wang
Chiao Liu
Kaushik Roy
38
989
0
07 Feb 2018
Effective Quantization Approaches for Recurrent Neural Networks
Effective Quantization Approaches for Recurrent Neural Networks
Md. Zahangir Alom
A. Moody
N. Maruyama
B. Van Essen
T. Taha
MQ
8
33
0
07 Feb 2018
CryptoRec: Privacy-preserving Recommendation as a Service
CryptoRec: Privacy-preserving Recommendation as a Service
Jun Wang
Afonso Arriaga
Qiang Tang
Peter Y. A. Ryan
13
3
0
07 Feb 2018
Universal Deep Neural Network Compression
Universal Deep Neural Network Compression
Yoojin Choi
Mostafa El-Khamy
Jungwon Lee
MQ
86
86
0
07 Feb 2018
Digital Watermarking for Deep Neural Networks
Digital Watermarking for Deep Neural Networks
Yuki Nagai
Yusuke Uchida
S. Sakazawa
Shiníchi Satoh
WIGM
31
144
0
06 Feb 2018
Musical Chair: Efficient Real-Time Recognition Using Collaborative IoT
  Devices
Musical Chair: Efficient Real-Time Recognition Using Collaborative IoT Devices
Ramyad Hadidi
Jiashen Cao
M. Woodward
Michael S. Ryoo
Hyesoon Kim
22
34
0
05 Feb 2018
Learning Compact Neural Networks with Regularization
Learning Compact Neural Networks with Regularization
Samet Oymak
MLT
46
39
0
05 Feb 2018
Previous
123...626364...676869
Next