ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXivPDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,448 papers shown
Title
SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks
SCNN: An Accelerator for Compressed-sparse Convolutional Neural Networks
A. Parashar
Minsoo Rhu
Anurag Mukkara
A. Puglielli
Rangharajan Venkatesan
Brucek Khailany
J. Emer
S. Keckler
W. Dally
35
1,115
0
23 May 2017
TernGrad: Ternary Gradients to Reduce Communication in Distributed Deep
  Learning
TernGrad: Ternary Gradients to Reduce Communication in Distributed Deep Learning
W. Wen
Cong Xu
Feng Yan
Chunpeng Wu
Yandan Wang
Yiran Chen
Hai Helen Li
52
984
0
22 May 2017
Structural Compression of Convolutional Neural Networks
Structural Compression of Convolutional Neural Networks
R. Abbasi-Asl
Bin-Xia Yu
33
16
0
20 May 2017
Structured Bayesian Pruning via Log-Normal Multiplicative Noise
Structured Bayesian Pruning via Log-Normal Multiplicative Noise
Kirill Neklyudov
Dmitry Molchanov
Arsenii Ashukha
Dmitry Vetrov
BDL
22
188
0
20 May 2017
The High-Dimensional Geometry of Binary Neural Networks
The High-Dimensional Geometry of Binary Neural Networks
Alexander G. Anderson
C. P. Berg
MQ
35
75
0
19 May 2017
Espresso: Efficient Forward Propagation for BCNNs
Espresso: Efficient Forward Propagation for BCNNs
Fabrizio Pedersoli
George Tzanetakis
Andrea Tagliasacchi
MQ
21
13
0
19 May 2017
Building effective deep neural network architectures one feature at a
  time
Building effective deep neural network architectures one feature at a time
Martin Mundt
Tobias Weis
K. Konda
Visvanathan Ramesh
27
1
0
18 May 2017
Design of a Very Compact CNN Classifier for Online Handwritten Chinese
  Character Recognition Using DropWeight and Global Pooling
Design of a Very Compact CNN Classifier for Online Handwritten Chinese Character Recognition Using DropWeight and Global Pooling
Xuefeng Xiao
Yafeng Yang
Tasweer Ahmad
Lianwen Jin
Tianhai Chang
34
21
0
15 May 2017
Incremental Learning Through Deep Adaptation
Incremental Learning Through Deep Adaptation
Amir Rosenfeld
John K. Tsotsos
CLL
19
276
0
11 May 2017
Sharp Models on Dull Hardware: Fast and Accurate Neural Machine
  Translation Decoding on the CPU
Sharp Models on Dull Hardware: Fast and Accurate Neural Machine Translation Decoding on the CPU
Jacob Devlin
29
36
0
04 May 2017
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep
  Neural Networks
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks
Minsoo Rhu
Mike O'Connor
Niladrish Chatterjee
Jeff Pool
S. Keckler
33
176
0
03 May 2017
Image reconstruction by domain transform manifold learning
Image reconstruction by domain transform manifold learning
Bo Zhu
Jeremiah Zhe Liu
Bruce Rosen
Matthew S. Rosen
34
1,518
0
28 Apr 2017
ICNet for Real-Time Semantic Segmentation on High-Resolution Images
ICNet for Real-Time Semantic Segmentation on High-Resolution Images
Hengshuang Zhao
Xiaojuan Qi
Xiaoyong Shen
Jianping Shi
Jiaya Jia
SSeg
34
1,403
0
27 Apr 2017
Speeding up Convolutional Neural Networks By Exploiting the Sparsity of
  Rectifier Units
Speeding up Convolutional Neural Networks By Exploiting the Sparsity of Rectifier Units
Shaoshuai Shi
Xuming Hu
29
43
0
25 Apr 2017
Accurate Optical Flow via Direct Cost Volume Processing
Accurate Optical Flow via Direct Cost Volume Processing
Jia Xu
René Ranftl
V. Koltun
21
238
0
24 Apr 2017
A Review on Deep Learning Techniques Applied to Semantic Segmentation
A Review on Deep Learning Techniques Applied to Semantic Segmentation
Alberto Garcia-Garcia
Sergio Orts
Sergiu Oprea
Victor Villena-Martinez
Jose Garcia-Rodriguez
3DV
SSeg
34
1,270
0
22 Apr 2017
Exploring Sparsity in Recurrent Neural Networks
Exploring Sparsity in Recurrent Neural Networks
Sharan Narang
Erich Elsen
G. Diamos
Shubho Sengupta
21
308
0
17 Apr 2017
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision
  Applications
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
Andrew G. Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
M. Andreetto
Hartwig Adam
3DH
950
20,613
0
17 Apr 2017
Enabling Embedded Inference Engine with ARM Compute Library: A Case
  Study
Enabling Embedded Inference Engine with ARM Compute Library: A Case Study
Dawei Sun
Shaoshan Liu
J. Gaudiot
8
13
0
12 Apr 2017
DyVEDeep: Dynamic Variable Effort Deep Neural Networks
DyVEDeep: Dynamic Variable Effort Deep Neural Networks
Sanjay Ganapathy
Swagath Venkataramani
Balaraman Ravindran
A. Raghunathan
27
8
0
04 Apr 2017
Soft-to-Hard Vector Quantization for End-to-End Learning Compressible
  Representations
Soft-to-Hard Vector Quantization for End-to-End Learning Compressible Representations
E. Agustsson
Fabian Mentzer
Michael Tschannen
Lukas Cavigelli
Radu Timofte
Luca Benini
Luc Van Gool
MQ
24
480
0
03 Apr 2017
Multi-Scale Dense Networks for Resource Efficient Image Classification
Multi-Scale Dense Networks for Resource Efficient Image Classification
Gao Huang
Danlu Chen
Tianhong Li
Felix Wu
Laurens van der Maaten
Kilian Q. Weinberger
VLM
24
137
0
29 Mar 2017
Coordinating Filters for Faster Deep Neural Networks
Coordinating Filters for Faster Deep Neural Networks
W. Wen
Cong Xu
Chunpeng Wu
Yandan Wang
Yiran Chen
Hai Helen Li
16
138
0
28 Mar 2017
Efficient Processing of Deep Neural Networks: A Tutorial and Survey
Efficient Processing of Deep Neural Networks: A Tutorial and Survey
Vivienne Sze
Yu-hsin Chen
Tien-Ju Yang
J. Emer
AAML
3DV
59
2,996
0
27 Mar 2017
More is Less: A More Complicated Network with Less Inference Complexity
More is Less: A More Complicated Network with Less Inference Complexity
Xuanyi Dong
Junshi Huang
Yi Yang
Shuicheng Yan
26
288
0
25 Mar 2017
Quality Resilient Deep Neural Networks
Quality Resilient Deep Neural Networks
Samuel F. Dodge
Lina Karam
OOD
13
46
0
23 Mar 2017
A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification
  and Domain Adaptation
A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification and Domain Adaptation
Chunpeng Wu
W. Wen
Tariq Afzal
Yongmei Zhang
Yiran Chen
Hai Helen Li
32
46
0
12 Mar 2017
Deep Convolutional Neural Network Inference with Floating-point Weights
  and Fixed-point Activations
Deep Convolutional Neural Network Inference with Floating-point Weights and Fixed-point Activations
Liangzhen Lai
Naveen Suda
Vikas Chandra
MQ
33
85
0
08 Mar 2017
NoScope: Optimizing Neural Network Queries over Video at Scale
NoScope: Optimizing Neural Network Queries over Video at Scale
Daniel Kang
John Emmons
Firas Abuzaid
Peter Bailis
Matei A. Zaharia
29
205
0
07 Mar 2017
Theoretical Properties for Neural Networks with Weight Matrices of Low
  Displacement Rank
Theoretical Properties for Neural Networks with Weight Matrices of Low Displacement Rank
Liang Zhao
Siyu Liao
Yanzhi Wang
Zhe Li
Jian Tang
Victor Pan
Bo Yuan
33
61
0
01 Mar 2017
ShaResNet: reducing residual network parameter number by sharing weights
ShaResNet: reducing residual network parameter number by sharing weights
Alexandre Boulch
32
26
0
28 Feb 2017
Billion-scale similarity search with GPUs
Billion-scale similarity search with GPUs
Jeff Johnson
Matthijs Douze
Hervé Jégou
93
3,650
0
28 Feb 2017
Memory-Efficient Global Refinement of Decision-Tree Ensembles and its
  Application to Face Alignment
Memory-Efficient Global Refinement of Decision-Tree Ensembles and its Application to Face Alignment
Nenad Markuš
Ivan Gogić
Igor S. Pandzic
Jörgen Ahlberg
CVBM
26
1
0
27 Feb 2017
Adaptive Ensemble Prediction for Deep Neural Networks based on
  Confidence Level
Adaptive Ensemble Prediction for Deep Neural Networks based on Confidence Level
H. Inoue
UQCV
FedML
16
1
0
27 Feb 2017
Low-Precision Batch-Normalized Activations
Low-Precision Batch-Normalized Activations
Benjamin Graham
MQ
27
9
0
27 Feb 2017
Fixed-point optimization of deep neural networks with adaptive step size
  retraining
Fixed-point optimization of deep neural networks with adaptive step size retraining
Sungho Shin
Yoonho Boo
Wonyong Sung
MQ
32
34
0
27 Feb 2017
Building Fast and Compact Convolutional Neural Networks for Offline
  Handwritten Chinese Character Recognition
Building Fast and Compact Convolutional Neural Networks for Offline Handwritten Chinese Character Recognition
Xuefeng Xiao
Lianwen Jin
Yafeng Yang
Weixin Yang
Jun Sun
Tianhai Chang
21
153
0
26 Feb 2017
Tuple-oriented Compression for Large-scale Mini-batch Stochastic
  Gradient Descent
Tuple-oriented Compression for Large-scale Mini-batch Stochastic Gradient Descent
Fengan Li
Lingjiao Chen
Yijing Zeng
Arun Kumar
Jeffrey F. Naughton
J. Patel
Xi Wu
26
19
0
22 Feb 2017
The Power of Sparsity in Convolutional Neural Networks
The Power of Sparsity in Convolutional Neural Networks
Soravit Changpinyo
Mark Sandler
A. Zhmoginov
22
132
0
21 Feb 2017
Soft Weight-Sharing for Neural Network Compression
Soft Weight-Sharing for Neural Network Compression
Karen Ullrich
Edward Meeds
Max Welling
37
412
0
13 Feb 2017
Incremental Network Quantization: Towards Lossless CNNs with
  Low-Precision Weights
Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights
Aojun Zhou
Anbang Yao
Yiwen Guo
Lin Xu
Yurong Chen
MQ
337
1,049
0
10 Feb 2017
Exploiting Domain Knowledge via Grouped Weight Sharing with Application
  to Text Categorization
Exploiting Domain Knowledge via Grouped Weight Sharing with Application to Text Categorization
Ye Zhang
Matthew Lease
Byron C. Wallace
21
15
0
08 Feb 2017
Deep Learning with Low Precision by Half-wave Gaussian Quantization
Deep Learning with Low Precision by Half-wave Gaussian Quantization
Zhaowei Cai
Xiaodong He
Jian Sun
Nuno Vasconcelos
MQ
50
503
0
03 Feb 2017
Deep Reinforcement Learning: An Overview
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRL
VLM
106
1,505
0
25 Jan 2017
Variational Dropout Sparsifies Deep Neural Networks
Variational Dropout Sparsifies Deep Neural Networks
Dmitry Molchanov
Arsenii Ashukha
Dmitry Vetrov
BDL
17
820
0
19 Jan 2017
Compression of Deep Neural Networks for Image Instance Retrieval
Compression of Deep Neural Networks for Image Instance Retrieval
V. Chandrasekhar
Jie Lin
Q. Liao
Olivier Morère
D. Shapiro
Lingyu Duan
Tomaso Poggio
33
25
0
18 Jan 2017
The Incredible Shrinking Neural Network: New Perspectives on Learning
  Representations Through The Lens of Pruning
The Incredible Shrinking Neural Network: New Perspectives on Learning Representations Through The Lens of Pruning
Aditya Sharma
Nikolas Wolfe
Bhiksha Raj
18
18
0
16 Jan 2017
Embedding Watermarks into Deep Neural Networks
Embedding Watermarks into Deep Neural Networks
Yusuke Uchida
Yuki Nagai
S. Sakazawa
Shiníchi Satoh
62
598
0
15 Jan 2017
Scaling Binarized Neural Networks on Reconfigurable Logic
Scaling Binarized Neural Networks on Reconfigurable Logic
Nicholas J. Fraser
Yaman Umuroglu
Giulio Gambardella
Michaela Blott
Philip H. W. Leong
Magnus Jahre
K. Vissers
MQ
20
57
0
12 Jan 2017
QuickNet: Maximizing Efficiency and Efficacy in Deep Architectures
QuickNet: Maximizing Efficiency and Efficacy in Deep Architectures
Tapabrata Ghosh
19
6
0
09 Jan 2017
Previous
123...66676869
Next