ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXivPDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,448 papers shown
Title
Deep Rewiring: Training very sparse deep networks
Deep Rewiring: Training very sparse deep networks
G. Bellec
David Kappel
Wolfgang Maass
Robert Legenstein
BDL
29
275
0
14 Nov 2017
ADaPTION: Toolbox and Benchmark for Training Convolutional Neural
  Networks with Reduced Numerical Precision Weights and Activation
ADaPTION: Toolbox and Benchmark for Training Convolutional Neural Networks with Reduced Numerical Precision Weights and Activation
Moritz B. Milde
Daniel Neil
Alessandro Aimar
T. Delbruck
Giacomo Indiveri
MQ
42
9
0
13 Nov 2017
Weightless: Lossy Weight Encoding For Deep Neural Network Compression
Weightless: Lossy Weight Encoding For Deep Neural Network Compression
Brandon Reagen
Udit Gupta
Bob Adolf
Michael Mitzenmacher
Alexander M. Rush
Gu-Yeon Wei
David Brooks
27
38
0
13 Nov 2017
CT-SRCNN: Cascade Trained and Trimmed Deep Convolutional Neural Networks
  for Image Super Resolution
CT-SRCNN: Cascade Trained and Trimmed Deep Convolutional Neural Networks for Image Super Resolution
Haoyu Ren
Mostafa El-Khamy
Jungwon Lee
SupR
33
27
0
11 Nov 2017
Learning K-way D-dimensional Discrete Code For Compact Embedding
  Representations
Learning K-way D-dimensional Discrete Code For Compact Embedding Representations
Ting Chen
Martin Renqiang Min
Yizhou Sun
19
10
0
08 Nov 2017
Revealing structure components of the retina by deep learning networks
Revealing structure components of the retina by deep learning networks
Qianyu Yan
Zhaofei Yu
Feng Chen
Jian K. Liu
FAtt
16
7
0
08 Nov 2017
Block-Sparse Recurrent Neural Networks
Block-Sparse Recurrent Neural Networks
Sharan Narang
Eric Undersander
G. Diamos
19
136
0
08 Nov 2017
Compression-aware Training of Deep Networks
Compression-aware Training of Deep Networks
J. Álvarez
Mathieu Salzmann
21
172
0
07 Nov 2017
Moonshine: Distilling with Cheap Convolutions
Moonshine: Distilling with Cheap Convolutions
Elliot J. Crowley
Gavia Gray
Amos Storkey
33
120
0
07 Nov 2017
Interpreting Convolutional Neural Networks Through Compression
Interpreting Convolutional Neural Networks Through Compression
R. Abbasi-Asl
Bin-Xia Yu
FAtt
19
21
0
07 Nov 2017
SparCE: Sparsity aware General Purpose Core Extensions to Accelerate
  Deep Neural Networks
SparCE: Sparsity aware General Purpose Core Extensions to Accelerate Deep Neural Networks
Sanchari Sen
Shubham Jain
Swagath Venkataramani
A. Raghunathan
24
30
0
07 Nov 2017
Characterizing Sparse Connectivity Patterns in Neural Networks
Characterizing Sparse Connectivity Patterns in Neural Networks
Sourya Dey
Kuan-Wen Huang
Peter A. Beerel
K. Chugg
24
11
0
06 Nov 2017
Neural Speed Reading via Skim-RNN
Neural Speed Reading via Skim-RNN
Minjoon Seo
Sewon Min
Ali Farhadi
Hannaneh Hajishirzi
42
79
0
06 Nov 2017
Accelerating Training of Deep Neural Networks via Sparse Edge Processing
Accelerating Training of Deep Neural Networks via Sparse Edge Processing
Sourya Dey
Yinan Shao
K. Chugg
Peter A. Beerel
38
16
0
03 Nov 2017
ReBNet: Residual Binarized Neural Network
ReBNet: Residual Binarized Neural Network
M. Ghasemzadeh
Mohammad Samragh
F. Koushanfar
MQ
30
4
0
03 Nov 2017
SparseNN: An Energy-Efficient Neural Network Accelerator Exploiting
  Input and Output Sparsity
SparseNN: An Energy-Efficient Neural Network Accelerator Exploiting Input and Output Sparsity
Jingyang Zhu
Jingbo Jiang
Xizi Chen
Chi-Ying Tsui
23
36
0
03 Nov 2017
Compressing Word Embeddings via Deep Compositional Code Learning
Compressing Word Embeddings via Deep Compositional Code Learning
Raphael Shu
Hideki Nakayama
40
129
0
03 Nov 2017
Efficient Inferencing of Compressed Deep Neural Networks
Efficient Inferencing of Compressed Deep Neural Networks
Dharma Teja Vooturi
Saurabh Goyal
Anamitra R. Choudhury
Yogish Sabharwal
Ashish Verma
24
6
0
01 Nov 2017
Towards Effective Low-bitwidth Convolutional Neural Networks
Towards Effective Low-bitwidth Convolutional Neural Networks
Bohan Zhuang
Chunhua Shen
Mingkui Tan
Lingqiao Liu
Ian Reid
MQ
39
232
0
01 Nov 2017
Tensorizing Generative Adversarial Nets
Tensorizing Generative Adversarial Nets
Xingwei Cao
Xuyang Zhao
Qibin Zhao
GAN
25
9
0
30 Oct 2017
Knowledge Projection for Deep Neural Networks
Knowledge Projection for Deep Neural Networks
Zhi Zhang
G. Ning
Zhihai He
38
15
0
26 Oct 2017
Trace norm regularization and faster inference for embedded speech
  recognition RNNs
Trace norm regularization and faster inference for embedded speech recognition RNNs
Markus Kliegl
Siddharth Goyal
Kexin Zhao
Kavya Srinet
Mohammad Shoeybi
40
8
0
25 Oct 2017
A Survey of Model Compression and Acceleration for Deep Neural Networks
A Survey of Model Compression and Acceleration for Deep Neural Networks
Yu Cheng
Duo Wang
Pan Zhou
Zhang Tao
40
1,087
0
23 Oct 2017
Learning Discrete Weights Using the Local Reparameterization Trick
Learning Discrete Weights Using the Local Reparameterization Trick
Oran Shayer
Dan Levi
Ethan Fetaya
21
88
0
21 Oct 2017
Data-Free Knowledge Distillation for Deep Neural Networks
Data-Free Knowledge Distillation for Deep Neural Networks
Raphael Gontijo-Lopes
Stefano Fenu
Thad Starner
25
270
0
19 Oct 2017
TensorQuant - A Simulation Toolbox for Deep Neural Network Quantization
TensorQuant - A Simulation Toolbox for Deep Neural Network Quantization
D. Loroch
Norbert Wehn
Franz-Josef Pfreundt
J. Keuper
MQ
33
23
0
13 Oct 2017
STDP Based Pruning of Connections and Weight Quantization in Spiking
  Neural Networks for Energy Efficient Recognition
STDP Based Pruning of Connections and Weight Quantization in Spiking Neural Networks for Energy Efficient Recognition
Nitin Rathi
Priyadarshini Panda
Kaushik Roy
27
112
0
12 Oct 2017
Energy-efficient Amortized Inference with Cascaded Deep Classifiers
Energy-efficient Amortized Inference with Cascaded Deep Classifiers
Jiaqi Guan
Yang Liu
Qiang Liu
Jian-wei Peng
22
33
0
10 Oct 2017
Artificial Neural Networks-Based Machine Learning for Wireless Networks:
  A Tutorial
Artificial Neural Networks-Based Machine Learning for Wireless Networks: A Tutorial
Mingzhe Chen
Ursula Challita
Walid Saad
Changchuan Yin
Mérouane Debbah
25
208
0
09 Oct 2017
Keynote: Small Neural Nets Are Beautiful: Enabling Embedded Systems with
  Small Deep-Neural-Network Architectures
Keynote: Small Neural Nets Are Beautiful: Enabling Embedded Systems with Small Deep-Neural-Network Architectures
F. Iandola
Kurt Keutzer
31
37
0
07 Oct 2017
To prune, or not to prune: exploring the efficacy of pruning for model
  compression
To prune, or not to prune: exploring the efficacy of pruning for model compression
Michael Zhu
Suyog Gupta
60
1,253
0
05 Oct 2017
Improving Efficiency in Convolutional Neural Network with Multilinear
  Filters
Improving Efficiency in Convolutional Neural Network with Multilinear Filters
D. Tran
Alexandros Iosifidis
Moncef Gabbouj
18
40
0
28 Sep 2017
Connectivity Learning in Multi-Branch Networks
Connectivity Learning in Multi-Branch Networks
Karim Ahmed
Lorenzo Torresani
24
26
0
27 Sep 2017
Machine Learning Models that Remember Too Much
Machine Learning Models that Remember Too Much
Congzheng Song
Thomas Ristenpart
Vitaly Shmatikov
VLM
36
505
0
22 Sep 2017
Computation Error Analysis of Block Floating Point Arithmetic Oriented
  Convolution Neural Network Accelerator Design
Computation Error Analysis of Block Floating Point Arithmetic Oriented Convolution Neural Network Accelerator Design
Zhourui Song
Zhenyu Liu
Dongsheng Wang
31
41
0
22 Sep 2017
Structured Probabilistic Pruning for Convolutional Neural Network
  Acceleration
Structured Probabilistic Pruning for Convolutional Neural Network Acceleration
Huan Wang
Qiming Zhang
Yuehai Wang
Roland Hu
35
11
0
20 Sep 2017
Compressing Low Precision Deep Neural Networks Using Sparsity-Induced
  Regularization in Ternary Networks
Compressing Low Precision Deep Neural Networks Using Sparsity-Induced Regularization in Ternary Networks
Julian Faraone
Nicholas J. Fraser
Giulio Gambardella
Michaela Blott
Philip H. W. Leong
MQ
UQCV
29
12
0
19 Sep 2017
N2N Learning: Network to Network Compression via Policy Gradient
  Reinforcement Learning
N2N Learning: Network to Network Compression via Policy Gradient Reinforcement Learning
A. Ashok
Nicholas Rhinehart
Fares N. Beainy
Kris Kitani
26
170
0
18 Sep 2017
Recursive Binary Neural Network Learning Model with 2.28b/Weight Storage
  Requirement
Recursive Binary Neural Network Learning Model with 2.28b/Weight Storage Requirement
Tianchan Guan
Xiaoyang Zeng
Mingoo Seok
MQ
24
6
0
15 Sep 2017
A Streaming Accelerator for Deep Convolutional Neural Networks with
  Image and Feature Decomposition for Resource-limited System Applications
A Streaming Accelerator for Deep Convolutional Neural Networks with Image and Feature Decomposition for Resource-limited System Applications
Yuan Du
Li Du
Yilei Li
Junjie Su
Mau-Chung Frank Chang
25
6
0
15 Sep 2017
Learning Intrinsic Sparse Structures within Long Short-Term Memory
Learning Intrinsic Sparse Structures within Long Short-Term Memory
W. Wen
Yuxiong He
Samyam Rajbhandari
Minjia Zhang
Wenhan Wang
Fang Liu
Bin Hu
Yiran Chen
H. Li
MQ
35
140
0
15 Sep 2017
Supervising Unsupervised Learning
Supervising Unsupervised Learning
Vikas K. Garg
Adam Kalai
SSL
FedML
26
29
0
14 Sep 2017
Binary-decomposed DCNN for accelerating computation and compressing
  model without retraining
Binary-decomposed DCNN for accelerating computation and compressing model without retraining
Ryuji Kamiya
Takayoshi Yamashita
Mitsuru Ambai
Ikuro Sato
Yuji Yamauchi
H. Fujiyoshi
MQ
17
4
0
14 Sep 2017
Flexible Network Binarization with Layer-wise Priority
He Wang
Yi Tian Xu
Bingbing Ni
Hongteng Xu
MQ
36
10
0
13 Sep 2017
Model Distillation with Knowledge Transfer from Face Classification to
  Alignment and Verification
Model Distillation with Knowledge Transfer from Face Classification to Alignment and Verification
Chong-Jun Wang
Xipeng Lan
Yang Zhang
CVBM
17
26
0
09 Sep 2017
Real-time convolutional networks for sonar image classification in
  low-power embedded systems
Real-time convolutional networks for sonar image classification in low-power embedded systems
Matias Valdenegro-Toro
31
10
0
07 Sep 2017
The Mating Rituals of Deep Neural Networks: Learning Compact Feature
  Representations through Sexual Evolutionary Synthesis
The Mating Rituals of Deep Neural Networks: Learning Compact Feature Representations through Sexual Evolutionary Synthesis
A. Chung
M. Shafiee
Paul Fieguth
A. Wong
32
4
0
07 Sep 2017
BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks
BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks
Surat Teerapittayanon
Bradley McDanel
H. T. Kung
UQCV
33
1,118
0
06 Sep 2017
Domain-adaptive deep network compression
Domain-adaptive deep network compression
Marc Masana
Joost van de Weijer
Luis Herranz
Andrew D. Bagdanov
J. Álvarez
44
62
0
04 Sep 2017
Fast Image Processing with Fully-Convolutional Networks
Fast Image Processing with Fully-Convolutional Networks
Qifeng Chen
Jia Xu
V. Koltun
17
322
0
02 Sep 2017
Previous
123...646566676869
Next