ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown
Title
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision
  Applications
MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications
Andrew G. Howard
Menglong Zhu
Bo Chen
Dmitry Kalenichenko
Weijun Wang
Tobias Weyand
M. Andreetto
Hartwig Adam
3DH
1.2K
20,998
0
17 Apr 2017
Enabling Embedded Inference Engine with ARM Compute Library: A Case
  Study
Enabling Embedded Inference Engine with ARM Compute Library: A Case Study
Dawei Sun
Shaoshan Liu
J. Gaudiot
37
13
0
12 Apr 2017
DyVEDeep: Dynamic Variable Effort Deep Neural Networks
DyVEDeep: Dynamic Variable Effort Deep Neural Networks
Sanjay Ganapathy
Swagath Venkataramani
Balaraman Ravindran
A. Raghunathan
51
8
0
04 Apr 2017
Soft-to-Hard Vector Quantization for End-to-End Learning Compressible
  Representations
Soft-to-Hard Vector Quantization for End-to-End Learning Compressible Representations
E. Agustsson
Fabian Mentzer
Michael Tschannen
Lukas Cavigelli
Radu Timofte
Luca Benini
Luc Van Gool
MQ
113
484
0
03 Apr 2017
Multi-Scale Dense Networks for Resource Efficient Image Classification
Multi-Scale Dense Networks for Resource Efficient Image Classification
Gao Huang
Danlu Chen
Tianhong Li
Felix Wu
Laurens van der Maaten
Kilian Q. Weinberger
VLM
73
140
0
29 Mar 2017
Coordinating Filters for Faster Deep Neural Networks
Coordinating Filters for Faster Deep Neural Networks
W. Wen
Cong Xu
Chunpeng Wu
Yandan Wang
Yiran Chen
Hai Helen Li
66
138
0
28 Mar 2017
Efficient Processing of Deep Neural Networks: A Tutorial and Survey
Efficient Processing of Deep Neural Networks: A Tutorial and Survey
Vivienne Sze
Yu-hsin Chen
Tien-Ju Yang
J. Emer
AAML3DV
130
3,039
0
27 Mar 2017
More is Less: A More Complicated Network with Less Inference Complexity
More is Less: A More Complicated Network with Less Inference Complexity
Xuanyi Dong
Junshi Huang
Yi Yang
Shuicheng Yan
88
288
0
25 Mar 2017
Quality Resilient Deep Neural Networks
Quality Resilient Deep Neural Networks
Samuel F. Dodge
Lina Karam
OOD
70
46
0
23 Mar 2017
A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification
  and Domain Adaptation
A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification and Domain Adaptation
Chunpeng Wu
W. Wen
Tariq Afzal
Yongmei Zhang
Yiran Chen
Hai Helen Li
89
46
0
12 Mar 2017
Deep Convolutional Neural Network Inference with Floating-point Weights
  and Fixed-point Activations
Deep Convolutional Neural Network Inference with Floating-point Weights and Fixed-point Activations
Liangzhen Lai
Naveen Suda
Vikas Chandra
MQ
76
85
0
08 Mar 2017
NoScope: Optimizing Neural Network Queries over Video at Scale
NoScope: Optimizing Neural Network Queries over Video at Scale
Daniel Kang
John Emmons
Firas Abuzaid
Peter Bailis
Matei A. Zaharia
135
211
0
07 Mar 2017
Theoretical Properties for Neural Networks with Weight Matrices of Low
  Displacement Rank
Theoretical Properties for Neural Networks with Weight Matrices of Low Displacement Rank
Liang Zhao
Siyu Liao
Yanzhi Wang
Zhe Li
Jian Tang
Victor Pan
Bo Yuan
130
61
0
01 Mar 2017
ShaResNet: reducing residual network parameter number by sharing weights
ShaResNet: reducing residual network parameter number by sharing weights
Alexandre Boulch
107
26
0
28 Feb 2017
Billion-scale similarity search with GPUs
Billion-scale similarity search with GPUs
Jeff Johnson
Matthijs Douze
Hervé Jégou
469
3,759
0
28 Feb 2017
Memory-Efficient Global Refinement of Decision-Tree Ensembles and its
  Application to Face Alignment
Memory-Efficient Global Refinement of Decision-Tree Ensembles and its Application to Face Alignment
Nenad Markuš
Ivan Gogić
Igor S. Pandzic
Jörgen Ahlberg
CVBM
74
1
0
27 Feb 2017
Adaptive Ensemble Prediction for Deep Neural Networks based on
  Confidence Level
Adaptive Ensemble Prediction for Deep Neural Networks based on Confidence Level
H. Inoue
UQCVFedML
24
1
0
27 Feb 2017
Low-Precision Batch-Normalized Activations
Low-Precision Batch-Normalized Activations
Benjamin Graham
MQ
79
9
0
27 Feb 2017
Fixed-point optimization of deep neural networks with adaptive step size
  retraining
Fixed-point optimization of deep neural networks with adaptive step size retraining
Sungho Shin
Yoonho Boo
Wonyong Sung
MQ
111
34
0
27 Feb 2017
Building Fast and Compact Convolutional Neural Networks for Offline
  Handwritten Chinese Character Recognition
Building Fast and Compact Convolutional Neural Networks for Offline Handwritten Chinese Character Recognition
Xuefeng Xiao
Lianwen Jin
Yafeng Yang
Weixin Yang
Jun Sun
Tianhai Chang
69
155
0
26 Feb 2017
Tuple-oriented Compression for Large-scale Mini-batch Stochastic
  Gradient Descent
Tuple-oriented Compression for Large-scale Mini-batch Stochastic Gradient Descent
Fengan Li
Lingjiao Chen
Yijing Zeng
Arun Kumar
Jeffrey F. Naughton
J. Patel
Xi Wu
44
19
0
22 Feb 2017
The Power of Sparsity in Convolutional Neural Networks
The Power of Sparsity in Convolutional Neural Networks
Soravit Changpinyo
Mark Sandler
A. Zhmoginov
89
133
0
21 Feb 2017
Soft Weight-Sharing for Neural Network Compression
Soft Weight-Sharing for Neural Network Compression
Karen Ullrich
Edward Meeds
Max Welling
178
421
0
13 Feb 2017
Incremental Network Quantization: Towards Lossless CNNs with
  Low-Precision Weights
Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights
Aojun Zhou
Anbang Yao
Yiwen Guo
Lin Xu
Yurong Chen
MQ
415
1,057
0
10 Feb 2017
Exploiting Domain Knowledge via Grouped Weight Sharing with Application
  to Text Categorization
Exploiting Domain Knowledge via Grouped Weight Sharing with Application to Text Categorization
Ye Zhang
Matthew Lease
Byron C. Wallace
88
15
0
08 Feb 2017
Deep Learning with Low Precision by Half-wave Gaussian Quantization
Deep Learning with Low Precision by Half-wave Gaussian Quantization
Zhaowei Cai
Xiaodong He
Jian Sun
Nuno Vasconcelos
MQ
149
507
0
03 Feb 2017
Deep Reinforcement Learning: An Overview
Deep Reinforcement Learning: An Overview
Yuxi Li
OffRLVLM
352
1,551
0
25 Jan 2017
Variational Dropout Sparsifies Deep Neural Networks
Variational Dropout Sparsifies Deep Neural Networks
Dmitry Molchanov
Arsenii Ashukha
Dmitry Vetrov
BDL
201
831
0
19 Jan 2017
Compression of Deep Neural Networks for Image Instance Retrieval
Compression of Deep Neural Networks for Image Instance Retrieval
V. Chandrasekhar
Jie Lin
Q. Liao
Olivier Morère
D. Shapiro
Lingyu Duan
Tomaso Poggio
60
25
0
18 Jan 2017
The Incredible Shrinking Neural Network: New Perspectives on Learning
  Representations Through The Lens of Pruning
The Incredible Shrinking Neural Network: New Perspectives on Learning Representations Through The Lens of Pruning
Aditya Sharma
Nikolas Wolfe
Bhiksha Raj
59
18
0
16 Jan 2017
Embedding Watermarks into Deep Neural Networks
Embedding Watermarks into Deep Neural Networks
Yusuke Uchida
Yuki Nagai
S. Sakazawa
Shiníchi Satoh
140
616
0
15 Jan 2017
Scaling Binarized Neural Networks on Reconfigurable Logic
Scaling Binarized Neural Networks on Reconfigurable Logic
Nicholas J. Fraser
Yaman Umuroglu
Giulio Gambardella
Michaela Blott
Philip H. W. Leong
Magnus Jahre
K. Vissers
MQ
92
57
0
12 Jan 2017
QuickNet: Maximizing Efficiency and Efficacy in Deep Architectures
QuickNet: Maximizing Efficiency and Efficacy in Deep Architectures
Tapabrata Ghosh
44
6
0
09 Jan 2017
Hardware for Machine Learning: Challenges and Opportunities
Hardware for Machine Learning: Challenges and Opportunities
Vivienne Sze
Yu-hsin Chen
Joel S. Einer
Amr Suleiman
Zhengdong Zhang
128
78
0
22 Dec 2016
Wide-Slice Residual Networks for Food Recognition
Wide-Slice Residual Networks for Food Recognition
N. Martinel
G. Foresti
C. Micheloni
128
202
0
20 Dec 2016
Exploring the Design Space of Deep Convolutional Neural Networks at
  Large Scale
Exploring the Design Space of Deep Convolutional Neural Networks at Large Scale
F. Iandola
3DV
48
19
0
20 Dec 2016
Quantization and Training of Low Bit-Width Convolutional Neural Networks
  for Object Detection
Quantization and Training of Low Bit-Width Convolutional Neural Networks for Object Detection
Penghang Yin
Shuai Zhang
Y. Qi
Jack Xin
MQ
174
42
0
19 Dec 2016
Delta Networks for Optimized Recurrent Network Computation
Delta Networks for Optimized Recurrent Network Computation
Daniel Neil
Junhaeng Lee
T. Delbruck
Shih-Chii Liu
111
66
0
16 Dec 2016
FastText.zip: Compressing text classification models
FastText.zip: Compressing text classification models
Armand Joulin
Edouard Grave
Piotr Bojanowski
Matthijs Douze
Hervé Jégou
Tomas Mikolov
MQ
143
1,216
0
12 Dec 2016
Learning in the Machine: Random Backpropagation and the Deep Learning
  Channel
Learning in the Machine: Random Backpropagation and the Deep Learning Channel
Pierre Baldi
Peter Sadowski
Zhiqin Lu
AAML
82
16
0
08 Dec 2016
Filter sharing: Efficient learning of parameters for volumetric
  convolutions
Filter sharing: Efficient learning of parameters for volumetric convolutions
Rahul Venkataramani
S. Thiruvenkadam
Prasad Sudhakar
Hariharan Ravishankar
V. Vaidya
3DPCMedIm
36
0
0
08 Dec 2016
Spatially Adaptive Computation Time for Residual Networks
Spatially Adaptive Computation Time for Residual Networks
Michael Figurnov
Maxwell D. Collins
Yukun Zhu
Li Zhang
Jonathan Huang
Dmitry Vetrov
Ruslan Salakhutdinov
77
351
0
07 Dec 2016
Towards the Limit of Network Quantization
Towards the Limit of Network Quantization
Yoojin Choi
Mostafa El-Khamy
Jungwon Lee
MQ
98
195
0
05 Dec 2016
Trained Ternary Quantization
Trained Ternary Quantization
Chenzhuo Zhu
Song Han
Huizi Mao
W. Dally
MQ
203
1,035
0
04 Dec 2016
FINN: A Framework for Fast, Scalable Binarized Neural Network Inference
FINN: A Framework for Fast, Scalable Binarized Neural Network Inference
Yaman Umuroglu
Nicholas J. Fraser
Giulio Gambardella
Michaela Blott
Philip H. W. Leong
Magnus Jahre
K. Vissers
MQ
115
1,005
0
01 Dec 2016
ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA
ESE: Efficient Speech Recognition Engine with Sparse LSTM on FPGA
Song Han
Junlong Kang
Huizi Mao
Yiming Hu
Xin Li
...
Hong Luo
Song Yao
Yu Wang
Huazhong Yang
W. Dally
82
630
0
01 Dec 2016
Effective Quantization Methods for Recurrent Neural Networks
Effective Quantization Methods for Recurrent Neural Networks
Qinyao He
He Wen
Shuchang Zhou
Yuxin Wu
Cong Yao
Xinyu Zhou
Yuheng Zou
MQ
88
76
0
30 Nov 2016
Deep Cuboid Detection: Beyond 2D Bounding Boxes
Deep Cuboid Detection: Beyond 2D Bounding Boxes
Debidatta Dwibedi
Tomasz Malisiewicz
Vijay Badrinarayanan
Andrew Rabinovich
57
18
0
30 Nov 2016
Capacity and Trainability in Recurrent Neural Networks
Capacity and Trainability in Recurrent Neural Networks
Jasmine Collins
Jascha Narain Sohl-Dickstein
David Sussillo
123
205
0
29 Nov 2016
LCNN: Lookup-based Convolutional Neural Network
LCNN: Lookup-based Convolutional Neural Network
Hessam Bagherinezhad
Mohammad Rastegari
Ali Farhadi
77
90
0
20 Nov 2016
Previous
123...67686970
Next