ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown
Title
Efficient Inferencing of Compressed Deep Neural Networks
Efficient Inferencing of Compressed Deep Neural Networks
Dharma Teja Vooturi
Saurabh Goyal
Anamitra R. Choudhury
Yogish Sabharwal
Ashish Verma
42
6
0
01 Nov 2017
Towards Effective Low-bitwidth Convolutional Neural Networks
Towards Effective Low-bitwidth Convolutional Neural Networks
Bohan Zhuang
Chunhua Shen
Mingkui Tan
Lingqiao Liu
Ian Reid
MQ
110
234
0
01 Nov 2017
Tensorizing Generative Adversarial Nets
Tensorizing Generative Adversarial Nets
Xingwei Cao
Xuyang Zhao
Qibin Zhao
GAN
56
9
0
30 Oct 2017
Knowledge Projection for Deep Neural Networks
Knowledge Projection for Deep Neural Networks
Zhi Zhang
G. Ning
Zhihai He
62
15
0
26 Oct 2017
Trace norm regularization and faster inference for embedded speech
  recognition RNNs
Trace norm regularization and faster inference for embedded speech recognition RNNs
Markus Kliegl
Siddharth Goyal
Kexin Zhao
Kavya Srinet
Mohammad Shoeybi
67
8
0
25 Oct 2017
A Survey of Model Compression and Acceleration for Deep Neural Networks
A Survey of Model Compression and Acceleration for Deep Neural Networks
Yu Cheng
Duo Wang
Pan Zhou
Zhang Tao
179
1,101
0
23 Oct 2017
Learning Discrete Weights Using the Local Reparameterization Trick
Learning Discrete Weights Using the Local Reparameterization Trick
Oran Shayer
Dan Levi
Ethan Fetaya
90
88
0
21 Oct 2017
Data-Free Knowledge Distillation for Deep Neural Networks
Data-Free Knowledge Distillation for Deep Neural Networks
Raphael Gontijo-Lopes
Stefano Fenu
Thad Starner
100
273
0
19 Oct 2017
TensorQuant - A Simulation Toolbox for Deep Neural Network Quantization
TensorQuant - A Simulation Toolbox for Deep Neural Network Quantization
D. Loroch
Norbert Wehn
Franz-Josef Pfreundt
J. Keuper
MQ
57
23
0
13 Oct 2017
STDP Based Pruning of Connections and Weight Quantization in Spiking
  Neural Networks for Energy Efficient Recognition
STDP Based Pruning of Connections and Weight Quantization in Spiking Neural Networks for Energy Efficient Recognition
Nitin Rathi
Priyadarshini Panda
Kaushik Roy
83
114
0
12 Oct 2017
Energy-efficient Amortized Inference with Cascaded Deep Classifiers
Energy-efficient Amortized Inference with Cascaded Deep Classifiers
Jiaqi Guan
Yang Liu
Qiang Liu
Jian-wei Peng
68
33
0
10 Oct 2017
Artificial Neural Networks-Based Machine Learning for Wireless Networks:
  A Tutorial
Artificial Neural Networks-Based Machine Learning for Wireless Networks: A Tutorial
Mingzhe Chen
Ursula Challita
Walid Saad
Changchuan Yin
Mérouane Debbah
110
209
0
09 Oct 2017
Keynote: Small Neural Nets Are Beautiful: Enabling Embedded Systems with
  Small Deep-Neural-Network Architectures
Keynote: Small Neural Nets Are Beautiful: Enabling Embedded Systems with Small Deep-Neural-Network Architectures
F. Iandola
Kurt Keutzer
77
37
0
07 Oct 2017
To prune, or not to prune: exploring the efficacy of pruning for model
  compression
To prune, or not to prune: exploring the efficacy of pruning for model compression
Michael Zhu
Suyog Gupta
213
1,289
0
05 Oct 2017
Improving Efficiency in Convolutional Neural Network with Multilinear
  Filters
Improving Efficiency in Convolutional Neural Network with Multilinear Filters
D. Tran
Alexandros Iosifidis
Moncef Gabbouj
60
40
0
28 Sep 2017
Connectivity Learning in Multi-Branch Networks
Connectivity Learning in Multi-Branch Networks
Karim Ahmed
Lorenzo Torresani
73
26
0
27 Sep 2017
Machine Learning Models that Remember Too Much
Machine Learning Models that Remember Too Much
Congzheng Song
Thomas Ristenpart
Vitaly Shmatikov
VLM
77
522
0
22 Sep 2017
Computation Error Analysis of Block Floating Point Arithmetic Oriented
  Convolution Neural Network Accelerator Design
Computation Error Analysis of Block Floating Point Arithmetic Oriented Convolution Neural Network Accelerator Design
Zhourui Song
Zhenyu Liu
Dongsheng Wang
57
42
0
22 Sep 2017
Structured Probabilistic Pruning for Convolutional Neural Network
  Acceleration
Structured Probabilistic Pruning for Convolutional Neural Network Acceleration
Huan Wang
Qiming Zhang
Yuehai Wang
Roland Hu
117
11
0
20 Sep 2017
Compressing Low Precision Deep Neural Networks Using Sparsity-Induced
  Regularization in Ternary Networks
Compressing Low Precision Deep Neural Networks Using Sparsity-Induced Regularization in Ternary Networks
Julian Faraone
Nicholas J. Fraser
Giulio Gambardella
Michaela Blott
Philip H. W. Leong
MQUQCV
62
12
0
19 Sep 2017
N2N Learning: Network to Network Compression via Policy Gradient
  Reinforcement Learning
N2N Learning: Network to Network Compression via Policy Gradient Reinforcement Learning
A. Ashok
Nicholas Rhinehart
Fares N. Beainy
Kris Kitani
98
171
0
18 Sep 2017
Recursive Binary Neural Network Learning Model with 2.28b/Weight Storage
  Requirement
Recursive Binary Neural Network Learning Model with 2.28b/Weight Storage Requirement
Tianchan Guan
Xiaoyang Zeng
Mingoo Seok
MQ
31
6
0
15 Sep 2017
A Streaming Accelerator for Deep Convolutional Neural Networks with
  Image and Feature Decomposition for Resource-limited System Applications
A Streaming Accelerator for Deep Convolutional Neural Networks with Image and Feature Decomposition for Resource-limited System Applications
Yuan Du
Li Du
Yilei Li
Junjie Su
Mau-Chung Frank Chang
36
6
0
15 Sep 2017
Learning Intrinsic Sparse Structures within Long Short-Term Memory
Learning Intrinsic Sparse Structures within Long Short-Term Memory
W. Wen
Yuxiong He
Samyam Rajbhandari
Minjia Zhang
Wenhan Wang
Fang Liu
Bin Hu
Yiran Chen
H. Li
MQ
141
142
0
15 Sep 2017
Supervising Unsupervised Learning
Supervising Unsupervised Learning
Vikas Garg
Adam Kalai
SSLFedML
51
30
0
14 Sep 2017
Binary-decomposed DCNN for accelerating computation and compressing
  model without retraining
Binary-decomposed DCNN for accelerating computation and compressing model without retraining
Ryuji Kamiya
Takayoshi Yamashita
Mitsuru Ambai
Ikuro Sato
Yuji Yamauchi
H. Fujiyoshi
MQ
26
5
0
14 Sep 2017
Flexible Network Binarization with Layer-wise Priority
He Wang
Yi Tian Xu
Bingbing Ni
Hongteng Xu
MQ
72
10
0
13 Sep 2017
Model Distillation with Knowledge Transfer from Face Classification to
  Alignment and Verification
Model Distillation with Knowledge Transfer from Face Classification to Alignment and Verification
Chong-Jun Wang
Xipeng Lan
Yang Zhang
CVBM
87
26
0
09 Sep 2017
Real-time convolutional networks for sonar image classification in
  low-power embedded systems
Real-time convolutional networks for sonar image classification in low-power embedded systems
Matias Valdenegro-Toro
42
10
0
07 Sep 2017
The Mating Rituals of Deep Neural Networks: Learning Compact Feature
  Representations through Sexual Evolutionary Synthesis
The Mating Rituals of Deep Neural Networks: Learning Compact Feature Representations through Sexual Evolutionary Synthesis
A. Chung
M. Shafiee
Paul Fieguth
A. Wong
34
4
0
07 Sep 2017
BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks
BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks
Surat Teerapittayanon
Bradley McDanel
H. T. Kung
UQCV
121
1,153
0
06 Sep 2017
Domain-adaptive deep network compression
Domain-adaptive deep network compression
Marc Masana
Joost van de Weijer
Luis Herranz
Andrew D. Bagdanov
J. Álvarez
82
62
0
04 Sep 2017
Fast Image Processing with Fully-Convolutional Networks
Fast Image Processing with Fully-Convolutional Networks
Qifeng Chen
Jia Xu
V. Koltun
83
323
0
02 Sep 2017
Continual One-Shot Learning of Hidden Spike-Patterns with Neural Network
  Simulation Expansion and STDP Convergence Predictions
Continual One-Shot Learning of Hidden Spike-Patterns with Neural Network Simulation Expansion and STDP Convergence Predictions
Toby Lightheart
S. Grainger
Tien-Fu Lu
21
0
0
30 Aug 2017
Performance Guaranteed Network Acceleration via High-Order Residual
  Quantization
Performance Guaranteed Network Acceleration via High-Order Residual Quantization
Zefan Li
Bingbing Ni
Wenjun Zhang
Xiaokang Yang
Wen Gao
MQ
91
107
0
29 Aug 2017
CirCNN: Accelerating and Compressing Deep Neural Networks Using
  Block-CirculantWeight Matrices
CirCNN: Accelerating and Compressing Deep Neural Networks Using Block-CirculantWeight Matrices
Caiwen Ding
Siyu Liao
Yanzhi Wang
Zhe Li
Ning Liu
...
Yipeng Zhang
Jian Tang
Qinru Qiu
Xinyu Lin
Bo Yuan
GNN
73
260
0
29 Aug 2017
Deep Learning Sparse Ternary Projections for Compressed Sensing of
  Images
Deep Learning Sparse Ternary Projections for Compressed Sensing of Images
Duc Minh Nguyen
Evaggelia Tsiligianni
Nikos Deligiannis
37
27
0
28 Aug 2017
The Convergence of Machine Learning and Communications
The Convergence of Machine Learning and Communications
Wojciech Samek
S. Stańczak
Thomas Wiegand
AI4CE
46
29
0
28 Aug 2017
Learning Efficient Convolutional Networks through Network Slimming
Learning Efficient Convolutional Networks through Network Slimming
Zhuang Liu
Jianguo Li
Zhiqiang Shen
Gao Huang
Shoumeng Yan
Changshui Zhang
229
2,431
0
22 Aug 2017
Neural Networks Compression for Language Modeling
Neural Networks Compression for Language Modeling
Artem M. Grachev
D. Ignatov
Andrey V. Savchenko
72
30
0
20 Aug 2017
Deep Neural Network Capacity
Aosen Wang
Huan Zhou
Wenyao Xu
Xin Chen
20
4
0
16 Aug 2017
BitNet: Bit-Regularized Deep Neural Networks
BitNet: Bit-Regularized Deep Neural Networks
Aswin Raghavan
Mohamed R. Amer
S. Chai
Graham Taylor
MQ
77
10
0
16 Aug 2017
DeepRebirth: Accelerating Deep Neural Network Execution on Mobile
  Devices
DeepRebirth: Accelerating Deep Neural Network Execution on Mobile Devices
Dawei Li
Xiaolong Wang
Deguang Kong
81
99
0
16 Aug 2017
Enabling Massive Deep Neural Networks with the GraphBLAS
Enabling Massive Deep Neural Networks with the GraphBLAS
J. Kepner
Manoj Kumar
José Moreira
P. Pattnaik
M. Serrano
H. Tufo
GNN
98
33
0
09 Aug 2017
Prune the Convolutional Neural Networks with Sparse Shrink
Prune the Convolutional Neural Networks with Sparse Shrink
Xuzhao Li
Changsong Liu
CVBM
20
4
0
08 Aug 2017
Natural Language Processing with Small Feed-Forward Networks
Natural Language Processing with Small Feed-Forward Networks
Jan A. Botha
Emily Pitler
Ji Ma
A. Bakalov
Alexandru Salcianu
David J. Weiss
Ryan T. McDonald
Slav Petrov
HAI
75
38
0
01 Aug 2017
Streaming Architecture for Large-Scale Quantized Neural Networks on an
  FPGA-Based Dataflow Platform
Streaming Architecture for Large-Scale Quantized Neural Networks on an FPGA-Based Dataflow Platform
Chaim Baskin
Natan Liss
Evgenii Zheltonozhskii
A. Bronstein
A. Mendelson
GNNMQ
139
35
0
31 Jul 2017
Fine-Pruning: Joint Fine-Tuning and Compression of a Convolutional
  Network with Bayesian Optimization
Fine-Pruning: Joint Fine-Tuning and Compression of a Convolutional Network with Bayesian Optimization
Frederick Tung
S. Muralidharan
Greg Mori
72
36
0
28 Jul 2017
Learning Bag-of-Features Pooling for Deep Convolutional Neural Networks
Learning Bag-of-Features Pooling for Deep Convolutional Neural Networks
Nikolaos Passalis
Anastasios Tefas
86
70
0
25 Jul 2017
Towards Evolutional Compression
Towards Evolutional Compression
Yunhe Wang
Chang Xu
Jiayan Qiu
Chao Xu
Dacheng Tao
63
14
0
25 Jul 2017
Previous
123...656667686970
Next