ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown
Title
A GAN-based Tunable Image Compression System
A GAN-based Tunable Image Compression System
Lirong Wu
Kejie Huang
Haibin Shen
GAN
83
43
0
18 Jan 2020
Harmonic Convolutional Networks based on Discrete Cosine Transform
Harmonic Convolutional Networks based on Discrete Cosine Transform
Matej Ulicny
V. Krylov
Rozenn Dahyot
65
36
0
18 Jan 2020
Driver Drowsiness Detection Model Using Convolutional Neural Networks
  Techniques for Android Application
Driver Drowsiness Detection Model Using Convolutional Neural Networks Techniques for Android Application
Rateb Jabbar
Mohammed Shinoy
Mohamed Kharbeche
K. Al-Khalifa
M. Krichen
Kamel Barkaoui
37
121
0
17 Jan 2020
MeliusNet: Can Binary Neural Networks Achieve MobileNet-level Accuracy?
MeliusNet: Can Binary Neural Networks Achieve MobileNet-level Accuracy?
Joseph Bethge
Christian Bartz
Haojin Yang
Ying-Cong Chen
Christoph Meinel
MQ
89
91
0
16 Jan 2020
A "Network Pruning Network" Approach to Deep Model Compression
A "Network Pruning Network" Approach to Deep Model Compression
Vinay Kumar Verma
Pravendra Singh
Vinay P. Namboodiri
Piyush Rai
3DPCVLM
58
8
0
15 Jan 2020
Understanding Generalization in Deep Learning via Tensor Methods
Understanding Generalization in Deep Learning via Tensor Methods
Jingling Li
Yanchao Sun
Jiahao Su
Taiji Suzuki
Furong Huang
131
28
0
14 Jan 2020
On Iterative Neural Network Pruning, Reinitialization, and the
  Similarity of Masks
On Iterative Neural Network Pruning, Reinitialization, and the Similarity of Masks
Michela Paganini
Jessica Zosa Forde
86
19
0
14 Jan 2020
PoPS: Policy Pruning and Shrinking for Deep Reinforcement Learning
PoPS: Policy Pruning and Shrinking for Deep Reinforcement Learning
Dor Livne
Kobi Cohen
64
52
0
14 Jan 2020
Reliable and Energy Efficient MLC STT-RAM Buffer for CNN Accelerators
Reliable and Energy Efficient MLC STT-RAM Buffer for CNN Accelerators
Masoomeh Jasemi
S. Hessabi
N. Bagherzadeh
27
9
0
14 Jan 2020
Quantisation and Pruning for Neural Network Compression and
  Regularisation
Quantisation and Pruning for Neural Network Compression and Regularisation
Kimessha Paupamah
Steven D. James
Richard Klein
38
23
0
14 Jan 2020
Block-wise Dynamic Sparseness
Block-wise Dynamic Sparseness
Amir Hadifar
Johannes Deleu
Chris Develder
Thomas Demeester
34
2
0
14 Jan 2020
AdaBERT: Task-Adaptive BERT Compression with Differentiable Neural
  Architecture Search
AdaBERT: Task-Adaptive BERT Compression with Differentiable Neural Architecture Search
Daoyuan Chen
Yaliang Li
Minghui Qiu
Zhen Wang
Bofang Li
Bolin Ding
Hongbo Deng
Jun Huang
Wei Lin
Jingren Zhou
MQ
108
104
0
13 Jan 2020
Modeling of Pruning Techniques for Deep Neural Networks Simplification
Modeling of Pruning Techniques for Deep Neural Networks Simplification
Morteza Mousa Pasandi
M. Hajabdollahi
N. Karimi
S. Samavi
3DPC
80
20
0
13 Jan 2020
Functional Error Correction for Robust Neural Networks
Functional Error Correction for Robust Neural Networks
Kunping Huang
P. Siegel
Anxiao
Anxiao Jiang
31
26
0
12 Jan 2020
Embedding Compression with Isotropic Iterative Quantization
Embedding Compression with Isotropic Iterative Quantization
Siyu Liao
Jie Chen
Yanzhi Wang
Qinru Qiu
Bo Yuan
MQ
64
13
0
11 Jan 2020
Intelligence, physics and information -- the tradeoff between accuracy
  and simplicity in machine learning
Intelligence, physics and information -- the tradeoff between accuracy and simplicity in machine learning
Tailin Wu
139
1
0
11 Jan 2020
ReluDiff: Differential Verification of Deep Neural Networks
ReluDiff: Differential Verification of Deep Neural Networks
Brandon Paulsen
Jingbo Wang
Chao Wang
169
54
0
10 Jan 2020
Adaptive Anomaly Detection for IoT Data in Hierarchical Edge Computing
Adaptive Anomaly Detection for IoT Data in Hierarchical Edge Computing
Mao V. Ngo
H. Chaouchi
Tie-Mei Luo
Tony Q.S. Quek
60
17
0
10 Jan 2020
Backdoor Attacks against Transfer Learning with Pre-trained Deep
  Learning Models
Backdoor Attacks against Transfer Learning with Pre-trained Deep Learning Models
Shuo Wang
Surya Nepal
Carsten Rudolph
M. Grobler
Shangyu Chen
Tianle Chen
AAML
67
105
0
10 Jan 2020
Campfire: Compressible, Regularization-Free, Structured Sparse Training
  for Hardware Accelerators
Campfire: Compressible, Regularization-Free, Structured Sparse Training for Hardware Accelerators
Noah Gamboa
Kais Kudrolli
Anand Dhoot
A. Pedram
33
10
0
09 Jan 2020
Least squares binary quantization of neural networks
Least squares binary quantization of neural networks
Hadi Pouransari
Zhucheng Tu
Oncel Tuzel
MQ
82
32
0
09 Jan 2020
Resource-Efficient Neural Networks for Embedded Systems
Resource-Efficient Neural Networks for Embedded Systems
Wolfgang Roth
Günther Schindler
Lukas Pfeifenberger
Robert Peharz
Sebastian Tschiatschek
Holger Fröning
Franz Pernkopf
Zoubin Ghahramani
92
51
0
07 Jan 2020
Sparse Weight Activation Training
Sparse Weight Activation Training
Md Aamir Raihan
Tor M. Aamodt
155
73
0
07 Jan 2020
RPR: Random Partition Relaxation for Training; Binary and Ternary Weight
  Neural Networks
RPR: Random Partition Relaxation for Training; Binary and Ternary Weight Neural Networks
Lukas Cavigelli
Luca Benini
MQ
70
9
0
04 Jan 2020
Discrimination-aware Network Pruning for Deep Model Compression
Discrimination-aware Network Pruning for Deep Model Compression
Jing Liu
Bohan Zhuang
Zhuangwei Zhuang
Yong Guo
Junzhou Huang
Jin-Hui Zhu
Mingkui Tan
CVBM
95
121
0
04 Jan 2020
Fractional Skipping: Towards Finer-Grained Dynamic CNN Inference
Fractional Skipping: Towards Finer-Grained Dynamic CNN Inference
Jianghao Shen
Y. Fu
Yue Wang
Pengfei Xu
Zhangyang Wang
Yingyan Lin
MQ
60
45
0
03 Jan 2020
ZeroQ: A Novel Zero Shot Quantization Framework
ZeroQ: A Novel Zero Shot Quantization Framework
Yaohui Cai
Z. Yao
Zhen Dong
A. Gholami
Michael W. Mahoney
Kurt Keutzer
MQ
132
401
0
01 Jan 2020
PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with
  Pattern-based Weight Pruning
PatDNN: Achieving Real-Time DNN Execution on Mobile Devices with Pattern-based Weight Pruning
Wei Niu
Xiaolong Ma
Sheng Lin
Shihao Wang
Xuehai Qian
Xinyu Lin
Yanzhi Wang
Bin Ren
MQ
128
229
0
01 Jan 2020
AdderNet: Do We Really Need Multiplications in Deep Learning?
AdderNet: Do We Really Need Multiplications in Deep Learning?
Hanting Chen
Yunhe Wang
Chunjing Xu
Boxin Shi
Chao Xu
Qi Tian
Chang Xu
166
203
0
31 Dec 2019
RC-DARTS: Resource Constrained Differentiable Architecture Search
RC-DARTS: Resource Constrained Differentiable Architecture Search
Xiaojie Jin
Jiang Wang
Joshua Slocum
Ming-Hsuan Yang
Shengyang Dai
Shuicheng Yan
Jiashi Feng
73
31
0
30 Dec 2019
Opportunities and Challenges of Deep Learning Methods for
  Electrocardiogram Data: A Systematic Review
Opportunities and Challenges of Deep Learning Methods for Electrocardiogram Data: A Systematic Review
linda Qiao
Yuxi Zhou
Junyuan Shang
Cao Xiao
Jimeng Sun
100
128
0
28 Dec 2019
Pruning Deep Convolutional Neural Networks Architectures with Evolution
  Strategy
Pruning Deep Convolutional Neural Networks Architectures with Evolution Strategy
Francisco Erivaldo Fernandes Junior
Gary G. Yen
54
1
0
24 Dec 2019
Towards Efficient Training for Neural Network Quantization
Towards Efficient Training for Neural Network Quantization
Qing Jin
Linjie Yang
Zhenyu A. Liao
MQ
117
42
0
21 Dec 2019
DBP: Discrimination Based Block-Level Pruning for Deep Model
  Acceleration
DBP: Discrimination Based Block-Level Pruning for Deep Model Acceleration
Wenxiao Wang
Shuai Zhao
Minghao Chen
Jinming Hu
Deng Cai
Haifeng Liu
87
37
0
21 Dec 2019
EAST: Encoding-Aware Sparse Training for Deep Memory Compression of
  ConvNets
EAST: Encoding-Aware Sparse Training for Deep Memory Compression of ConvNets
Matteo Grimaldi
Valentino Peluso
A. Calimera
MQ
34
2
0
20 Dec 2019
HiLLoC: Lossless Image Compression with Hierarchical Latent Variable
  Models
HiLLoC: Lossless Image Compression with Hierarchical Latent Variable Models
James Townsend
Thomas Bird
Julius Kunze
David Barber
BDLVLM
153
56
0
20 Dec 2019
Taxonomy and Evaluation of Structured Compression of Convolutional
  Neural Networks
Taxonomy and Evaluation of Structured Compression of Convolutional Neural Networks
Andrey Kuzmin
Markus Nagel
Saurabh Pitre
Sandeep Pendyam
Tijmen Blankevoort
Max Welling
83
27
0
20 Dec 2019
AtomNAS: Fine-Grained End-to-End Neural Architecture Search
AtomNAS: Fine-Grained End-to-End Neural Architecture Search
Jieru Mei
Yingwei Li
Xiaochen Lian
Xiaojie Jin
Linjie Yang
Alan Yuille
Jianchao Yang
69
107
0
20 Dec 2019
FQ-Conv: Fully Quantized Convolution for Efficient and Accurate
  Inference
FQ-Conv: Fully Quantized Convolution for Efficient and Accurate Inference
Bram-Ernst Verhoef
Nathan Laubeuf
S. Cosemans
P. Debacker
Ioannis A. Papistas
A. Mallik
D. Verkest
MQ
65
16
0
19 Dec 2019
Optimization for deep learning: theory and algorithms
Optimization for deep learning: theory and algorithms
Ruoyu Sun
ODL
137
169
0
19 Dec 2019
Adaptive Loss-aware Quantization for Multi-bit Networks
Adaptive Loss-aware Quantization for Multi-bit Networks
Zhongnan Qu
Zimu Zhou
Yun Cheng
Lothar Thiele
MQ
165
56
0
18 Dec 2019
Dreaming to Distill: Data-free Knowledge Transfer via DeepInversion
Dreaming to Distill: Data-free Knowledge Transfer via DeepInversion
Hongxu Yin
Pavlo Molchanov
Zhizhong Li
J. Álvarez
Arun Mallya
Derek Hoiem
N. Jha
Jan Kautz
185
570
0
18 Dec 2019
TOCO: A Framework for Compressing Neural Network Models Based on
  Tolerance Analysis
TOCO: A Framework for Compressing Neural Network Models Based on Tolerance Analysis
Soroosh Khoram
J. Li
27
1
0
18 Dec 2019
Learning to Prevent Leakage: Privacy-Preserving Inference in the Mobile
  Cloud
Learning to Prevent Leakage: Privacy-Preserving Inference in the Mobile Cloud
Shuang Zhang
Liyao Xiang
Congcong Li
Yixuan Wang
Quanshi Zhang
Zeyu Liu
Yue Liu
FedML
61
1
0
18 Dec 2019
$\ell_0$ Regularized Structured Sparsity Convolutional Neural Networks
ℓ0\ell_0ℓ0​ Regularized Structured Sparsity Convolutional Neural Networks
Kevin Bui
Fredrick Park
Shuai Zhang
Y. Qi
Jack Xin
40
0
0
17 Dec 2019
Mitigate Parasitic Resistance in Resistive Crossbar-based Convolutional
  Neural Networks
Mitigate Parasitic Resistance in Resistive Crossbar-based Convolutional Neural Networks
Fan Zhang
Miao Hu
38
20
0
17 Dec 2019
Joint Architecture and Knowledge Distillation in CNN for Chinese Text
  Recognition
Joint Architecture and Knowledge Distillation in CNN for Chinese Text Recognition
Zirui Wang
Jun Du
45
0
0
17 Dec 2019
A flexible FPGA accelerator for convolutional neural networks
A flexible FPGA accelerator for convolutional neural networks
Kingshuk Majumder
Uday Bondhugula
57
5
0
16 Dec 2019
Towards Building a Real Time Mobile Device Bird Counting System Through Synthetic Data Training and Model Compression
Runde Yang
16
0
0
15 Dec 2019
WaLDORf: Wasteless Language-model Distillation On Reading-comprehension
WaLDORf: Wasteless Language-model Distillation On Reading-comprehension
J. Tian
A. Kreuzer
Pai-Hung Chen
Hans-Martin Will
VLM
62
3
0
13 Dec 2019
Previous
123...454647...686970
Next