ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown
Title
The Loss Surface of XOR Artificial Neural Networks
The Loss Surface of XOR Artificial Neural Networks
D. Mehta
Xiaojun Zhao
Edgar A. Bernal
D. Wales
167
19
0
06 Apr 2018
Structured Evolution with Compact Architectures for Scalable Policy
  Optimization
Structured Evolution with Compact Architectures for Scalable Policy Optimization
K. Choromanski
Mark Rowland
Vikas Sindhwani
Richard Turner
Adrian Weller
118
149
0
06 Apr 2018
Building Efficient CNN Architecture for Offline Handwritten Chinese
  Character Recognition
Building Efficient CNN Architecture for Offline Handwritten Chinese Character Recognition
Zhiyuan Li
Nanjun Teng
Min Jin
Huaxiang Lu
39
52
0
04 Apr 2018
DeepSigns: A Generic Watermarking Framework for IP Protection of Deep
  Learning Models
DeepSigns: A Generic Watermarking Framework for IP Protection of Deep Learning Models
B. Rouhani
Huili Chen
F. Koushanfar
130
48
0
02 Apr 2018
Structured Weight Matrices-Based Hardware Accelerators in Deep Neural
  Networks: FPGAs and ASICs
Structured Weight Matrices-Based Hardware Accelerators in Deep Neural Networks: FPGAs and ASICs
Caiwen Ding
Ao Ren
Geng Yuan
Xiaolong Ma
Jiayu Li
Ning Liu
Bo Yuan
Yanzhi Wang
77
23
0
28 Mar 2018
Adversarial Network Compression
Adversarial Network Compression
Vasileios Belagiannis
Azade Farshad
Fabio Galasso
GANAAML
69
58
0
28 Mar 2018
FPGA Implementations of 3D-SIMD Processor Architecture for Deep Neural
  Networks Using Relative Indexed Compressed Sparse Filter Encoding Format and
  Stacked Filters Stationary Flow
FPGA Implementations of 3D-SIMD Processor Architecture for Deep Neural Networks Using Relative Indexed Compressed Sparse Filter Encoding Format and Stacked Filters Stationary Flow
Yuechao Gao
Nianhong Liu
Shenmin Zhang
98
1
0
28 Mar 2018
Incremental Training of Deep Convolutional Neural Networks
Incremental Training of Deep Convolutional Neural Networks
R. Istrate
A. Malossi
C. Bekas
Dimitrios S. Nikolopoulos
CLL
68
21
0
27 Mar 2018
Diagonalwise Refactorization: An Efficient Training Method for Depthwise
  Convolutions
Diagonalwise Refactorization: An Efficient Training Method for Depthwise Convolutions
Zheng Qin
Zhaoning Zhang
Dongsheng Li
Yiming Zhang
Yuxing Peng
62
28
0
27 Mar 2018
Face Recognition with Hybrid Efficient Convolution Algorithms on FPGAs
Face Recognition with Hybrid Efficient Convolution Algorithms on FPGAs
Chuanhao Zhuge
Xinheng Liu
Xiaofan Zhang
S. Gummadi
Jinjun Xiong
Deming Chen
CVBM
64
36
0
23 Mar 2018
Iterative Low-Rank Approximation for CNN Compression
Iterative Low-Rank Approximation for CNN Compression
Maksym Kholiavchenko
36
9
0
23 Mar 2018
SqueezeNext: Hardware-Aware Neural Network Design
SqueezeNext: Hardware-Aware Neural Network Design
A. Gholami
K. Kwon
Bichen Wu
Zizheng Tai
Xiangyu Yue
Peter H. Jin
Sicheng Zhao
Kurt Keutzer
69
300
0
23 Mar 2018
Fast, Accurate, and Lightweight Super-Resolution with Cascading Residual
  Network
Fast, Accurate, and Lightweight Super-Resolution with Cascading Residual Network
Namhyuk Ahn
Byungkon Kang
Kyung-ah Sohn
SupR
123
1,131
0
23 Mar 2018
Design Principles for Sparse Matrix Multiplication on the GPU
Design Principles for Sparse Matrix Multiplication on the GPU
Carl Yang
A. Buluç
John Douglas Owens
61
109
0
22 Mar 2018
Task dependent Deep LDA pruning of neural networks
Task dependent Deep LDA pruning of neural networks
Qing Tian
Tal Arbel
James J. Clark
31
0
0
21 Mar 2018
Efficient Recurrent Neural Networks using Structured Matrices in FPGAs
Efficient Recurrent Neural Networks using Structured Matrices in FPGAs
Zhe Li
Shuo Wang
Caiwen Ding
Qinru Qiu
Yanzhi Wang
Yun Liang
GNN
41
21
0
20 Mar 2018
Local Binary Pattern Networks
Local Binary Pattern Networks
Jeng-Hau Lin
Yunfan Yang
Rajesh K. Gupta
Zhuowen Tu
MQ
50
13
0
19 Mar 2018
ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic
  Segmentation
ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation
Sachin Mehta
Mohammad Rastegari
A. Caspi
Linda G. Shapiro
Hannaneh Hajishirzi
SSeg
156
784
0
19 Mar 2018
Constrained Deep Learning using Conditional Gradient and Applications in
  Computer Vision
Constrained Deep Learning using Conditional Gradient and Applications in Computer Vision
Sathya Ravi
Tuan Dinh
Vishnu Suresh Lokhande
Vikas Singh
AI4CE
71
22
0
17 Mar 2018
EVA$^2$: Exploiting Temporal Redundancy in Live Computer Vision
EVA2^22: Exploiting Temporal Redundancy in Live Computer Vision
Mark Buckler
Philip Bedoukian
Suren Jayasuriya
Adrian Sampson
126
79
0
16 Mar 2018
TBD: Benchmarking and Analyzing Deep Neural Network Training
TBD: Benchmarking and Analyzing Deep Neural Network Training
Hongyu Zhu
Mohamed Akrout
Bojian Zheng
Andrew Pelegris
Amar Phanishayee
Bianca Schroeder
Gennady Pekhimenko
103
81
0
16 Mar 2018
Efficient Hardware Realization of Convolutional Neural Networks using
  Intra-Kernel Regular Pruning
Efficient Hardware Realization of Convolutional Neural Networks using Intra-Kernel Regular Pruning
Maurice Yang
Mahmoud Faraj
Assem Hussein
V. Gaudet
CVBM
69
12
0
15 Mar 2018
Toolflows for Mapping Convolutional Neural Networks on FPGAs: A Survey
  and Future Directions
Toolflows for Mapping Convolutional Neural Networks on FPGAs: A Survey and Future Directions
Stylianos I. Venieris
Alexandros Kouris
C. Bouganis
76
185
0
15 Mar 2018
Exploring Linear Relationship in Feature Map Subspace for ConvNets
  Compression
Exploring Linear Relationship in Feature Map Subspace for ConvNets Compression
Dong Wang
Lei Zhou
Xueni Zhang
Xiao Bai
Jun Zhou
80
47
0
15 Mar 2018
C-LSTM: Enabling Efficient LSTM using Structured Compression Techniques
  on FPGAs
C-LSTM: Enabling Efficient LSTM using Structured Compression Techniques on FPGAs
Shuo Wang
Zhe Li
Caiwen Ding
Bo Yuan
Yanzhi Wang
Qinru Qiu
Yun Liang
61
197
0
14 Mar 2018
LCANet: End-to-End Lipreading with Cascaded Attention-CTC
LCANet: End-to-End Lipreading with Cascaded Attention-CTC
Kai Xu
Dawei Li
N. Cassimatis
Xiaolong Wang
86
97
0
13 Mar 2018
Quantization of Fully Convolutional Networks for Accurate Biomedical
  Image Segmentation
Quantization of Fully Convolutional Networks for Accurate Biomedical Image Segmentation
Xiaowei Xu
Q. Lu
Yu Hu
Lin Yang
X. S. Hu
Benlin Liu
Yiyu Shi
MedIm
84
85
0
13 Mar 2018
FeTa: A DCA Pruning Algorithm with Generalization Error Guarantees
FeTa: A DCA Pruning Algorithm with Generalization Error Guarantees
Konstantinos Pitas
Mike Davies
P. Vandergheynst
27
2
0
12 Mar 2018
ShuffleSeg: Real-time Semantic Segmentation Network
ShuffleSeg: Real-time Semantic Segmentation Network
M. Gamal
Mennatullah Siam
Moemen Abdel-Razek
SSeg
68
60
0
10 Mar 2018
Bit-Tactical: Exploiting Ineffectual Computations in Convolutional
  Neural Networks: Which, Why, and How
Bit-Tactical: Exploiting Ineffectual Computations in Convolutional Neural Networks: Which, Why, and How
A. Delmas
Patrick Judd
Dylan Malone Stuart
Zissis Poulos
Mostafa Mahmoud
Sayeh Sharify
Milos Nikolic
Andreas Moshovos
59
24
0
09 Mar 2018
The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
Jonathan Frankle
Michael Carbin
445
3,497
0
09 Mar 2018
High-Accuracy Low-Precision Training
High-Accuracy Low-Precision Training
Christopher De Sa
Megan Leszczynski
Jian Zhang
Alana Marzoev
Christopher R. Aberger
K. Olukotun
Christopher Ré
94
109
0
09 Mar 2018
Exponential Discriminative Metric Embedding in Deep Learning
Exponential Discriminative Metric Embedding in Deep Learning
Bowen Wu
Zhangling Chen
Jun Wang
Hua-Ming Wu
73
11
0
07 Mar 2018
Learning SMaLL Predictors
Learning SMaLL Predictors
Vikas Garg
O. Dekel
Lin Xiao
76
3
0
06 Mar 2018
Personalized Exposure Control Using Adaptive Metering and Reinforcement
  Learning
Personalized Exposure Control Using Adaptive Metering and Reinforcement Learning
Huan Yang
Baoyuan Wang
Noranart Vesdapunt
Minyi Guo
S. B. Kang
62
22
0
06 Mar 2018
Deep Neural Network Compression with Single and Multiple Level
  Quantization
Deep Neural Network Compression with Single and Multiple Level Quantization
Yuhui Xu
Yongzhuang Wang
Aojun Zhou
Weiyao Lin
H. Xiong
MQ
70
115
0
06 Mar 2018
Stochastic Activation Pruning for Robust Adversarial Defense
Stochastic Activation Pruning for Robust Adversarial Defense
Guneet Singh Dhillon
Kamyar Azizzadenesheli
Zachary Chase Lipton
Jeremy Bernstein
Jean Kossaifi
Aran Khanna
Anima Anandkumar
AAML
107
548
0
05 Mar 2018
An Optimal Control Approach to Deep Learning and Applications to
  Discrete-Weight Neural Networks
An Optimal Control Approach to Deep Learning and Applications to Discrete-Weight Neural Networks
Qianxiao Li
Shuji Hao
103
76
0
04 Mar 2018
The History Began from AlexNet: A Comprehensive Survey on Deep Learning
  Approaches
The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches
Md. Zahangir Alom
T. Taha
C. Yakopcic
Stefan Westberg
P. Sidike
Mst Shamima Nasrin
B. Van Essen
A. Awwal
V. Asari
VLM
133
883
0
03 Mar 2018
Scalar Quantization as Sparse Least Square Optimization
Scalar Quantization as Sparse Least Square Optimization
Chen Wang
Xiaomei Yang
Shaomin Fei
Kai Zhou
Xiaofeng Gong
Miao Du
Ruisen Luo
MQ
40
3
0
01 Mar 2018
Learning Sparse Structured Ensembles with SG-MCMC and Network Pruning
Learning Sparse Structured Ensembles with SG-MCMC and Network Pruning
Yichi Zhang
Zhijian Ou
64
0
0
01 Mar 2018
Compressing Neural Networks using the Variational Information Bottleneck
Compressing Neural Networks using the Variational Information Bottleneck
Bin Dai
Chen Zhu
David Wipf
MLT
72
182
0
28 Feb 2018
Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs
Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs
Xuhao Chen
109
25
0
28 Feb 2018
Frank-Wolfe Network: An Interpretable Deep Structure for Non-Sparse
  Coding
Frank-Wolfe Network: An Interpretable Deep Structure for Non-Sparse Coding
Dong Liu
Ke Sun
Zhangyang Wang
Runsheng Liu
Zhengjun Zha
107
12
0
28 Feb 2018
Recurrent Residual Module for Fast Inference in Videos
Recurrent Residual Module for Fast Inference in Videos
Bowen Pan
Wuwei Lin
Xiaolin Fang
Chaoqin Huang
Bolei Zhou
Cewu Lu
ObjD
94
34
0
27 Feb 2018
Demystifying Parallel and Distributed Deep Learning: An In-Depth
  Concurrency Analysis
Demystifying Parallel and Distributed Deep Learning: An In-Depth Concurrency Analysis
Tal Ben-Nun
Torsten Hoefler
GNN
91
713
0
26 Feb 2018
PBGen: Partial Binarization of Deconvolution-Based Generators for Edge
  Intelligence
PBGen: Partial Binarization of Deconvolution-Based Generators for Edge Intelligence
Jinglan Liu
Jiaxin Zhang
Yukun Ding
Xiaowei Xu
Meng Jiang
Yiyu Shi
103
4
0
26 Feb 2018
Wide Compression: Tensor Ring Nets
Wide Compression: Tensor Ring Nets
Wenqi Wang
Yifan Sun
Brian Eriksson
Wenlin Wang
Vaneet Aggarwal
69
171
0
25 Feb 2018
Loss-aware Weight Quantization of Deep Networks
Loss-aware Weight Quantization of Deep Networks
Lu Hou
James T. Kwok
MQ
111
127
0
23 Feb 2018
Training wide residual networks for deployment using a single bit for
  each weight
Training wide residual networks for deployment using a single bit for each weight
Mark D Mcdonnell
MQ
96
71
0
23 Feb 2018
Previous
123...626364...686970
Next