ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown
Title
MoBiNet: A Mobile Binary Network for Image Classification
MoBiNet: A Mobile Binary Network for Image Classification
Hai T. Phan
Dang T. Huynh
Yihui He
Marios Savvides
Zhiqiang Shen
MQ
85
49
0
29 Jul 2019
Energy-Efficient Processing and Robust Wireless Cooperative Transmission
  for Edge Inference
Energy-Efficient Processing and Robust Wireless Cooperative Transmission for Edge Inference
Kai Yang
Yuanming Shi
Wei Yu
Z. Ding
46
42
0
29 Jul 2019
DeepCABAC: A Universal Compression Algorithm for Deep Neural Networks
DeepCABAC: A Universal Compression Algorithm for Deep Neural Networks
Simon Wiedemann
H. Kirchhoffer
Stefan Matlage
Paul Haase
Arturo Marbán
...
Ahmed Osman
D. Marpe
H. Schwarz
Thomas Wiegand
Wojciech Samek
111
98
0
27 Jul 2019
Learning Instance-wise Sparsity for Accelerating Deep Models
Learning Instance-wise Sparsity for Accelerating Deep Models
Chuanjian Liu
Yunhe Wang
Kai Han
Chunjing Xu
Chang Xu
77
27
0
27 Jul 2019
Memory- and Communication-Aware Model Compression for Distributed Deep
  Learning Inference on IoT
Memory- and Communication-Aware Model Compression for Distributed Deep Learning Inference on IoT
Kartikeya Bhardwaj
Chingyi Lin
A. L. Sartor
R. Marculescu
GNN
66
54
0
26 Jul 2019
Exploiting the Redundancy in Convolutional Filters for Parameter
  Reduction
Exploiting the Redundancy in Convolutional Filters for Parameter Reduction
Kumara Kahatapitiya
Ranga Rodrigo
25
0
0
26 Jul 2019
Co-Evolutionary Compression for Unpaired Image Translation
Co-Evolutionary Compression for Unpaired Image Translation
Han Shu
Yunhe Wang
Xu Jia
Kai Han
Hanting Chen
Chunjing Xu
Qi Tian
Chang Xu
52
87
0
25 Jul 2019
Benchmarking TPU, GPU, and CPU Platforms for Deep Learning
Benchmarking TPU, GPU, and CPU Platforms for Deep Learning
Y. Wang
Gu-Yeon Wei
David Brooks
ELMVLM
104
278
0
24 Jul 2019
Distilled Siamese Networks for Visual Tracking
Distilled Siamese Networks for Visual Tracking
Jianbing Shen
Yuanpei Liu
Xingping Dong
Xiankai Lu
Fahad Shahbaz Khan
Guosheng Lin
110
104
0
24 Jul 2019
Recurrent Neural Networks: An Embedded Computing Perspective
Recurrent Neural Networks: An Embedded Computing Perspective
Nesma M. Rezk
M. Purnaprajna
Tomas Nordstrom
Z. Ul-Abdin
129
82
0
23 Jul 2019
RRNet: Repetition-Reduction Network for Energy Efficient Decoder of
  Depth Estimation
RRNet: Repetition-Reduction Network for Energy Efficient Decoder of Depth Estimation
Sangyun Oh
Hye-Jin S. Kim
Jongeun Lee
Junmo Kim
3DVOffRL
84
1
0
23 Jul 2019
Similarity-Preserving Knowledge Distillation
Similarity-Preserving Knowledge Distillation
Frederick Tung
Greg Mori
156
987
0
23 Jul 2019
Highlight Every Step: Knowledge Distillation via Collaborative Teaching
Highlight Every Step: Knowledge Distillation via Collaborative Teaching
Haoran Zhao
Changrui Chen
Junyu Dong
Xin Sun
Zihe Dong
82
59
0
23 Jul 2019
MemNet: Memory-Efficiency Guided Neural Architecture Search with
  Augment-Trim learning
MemNet: Memory-Efficiency Guided Neural Architecture Search with Augment-Trim learning
Peiye Liu
Bo Wu
Huadong Ma
Mingoo Seok
76
6
0
22 Jul 2019
EnSyth: A Pruning Approach to Synthesis of Deep Learning Ensembles
EnSyth: A Pruning Approach to Synthesis of Deep Learning Ensembles
Besher Alhalabi
M. Gaber
S. Basurra
26
3
0
22 Jul 2019
Open DNN Box by Power Side-Channel Attack
Open DNN Box by Power Side-Channel Attack
Yun Xiang
Zhuangzhi Chen
Zuohui Chen
Zebin Fang
Haiyang Hao
Jinyin Chen
Yi Liu
Zhefu Wu
Qi Xuan
Xiaoniu Yang
AAML
72
90
0
21 Jul 2019
Convergence of Edge Computing and Deep Learning: A Comprehensive Survey
Convergence of Edge Computing and Deep Learning: A Comprehensive Survey
Xiaofei Wang
Yiwen Han
Victor C. M. Leung
Dusit Niyato
Xueqiang Yan
Xu Chen
113
1,006
0
19 Jul 2019
Light Multi-segment Activation for Model Compression
Light Multi-segment Activation for Model Compression
Zhenhui Xu
Guolin Ke
Jia Zhang
Jiang Bian
Tie-Yan Liu
28
2
0
16 Jul 2019
An Inter-Layer Weight Prediction and Quantization for Deep Neural
  Networks based on a Smoothly Varying Weight Hypothesis
An Inter-Layer Weight Prediction and Quantization for Deep Neural Networks based on a Smoothly Varying Weight Hypothesis
Kang-Ho Lee
Joonhyun Jeong
Sung-Ho Bae
72
4
0
16 Jul 2019
What does it mean to understand a neural network?
What does it mean to understand a neural network?
Timothy Lillicrap
Konrad Paul Kording
60
43
0
15 Jul 2019
Bringing Giant Neural Networks Down to Earth with Unlabeled Data
Bringing Giant Neural Networks Down to Earth with Unlabeled Data
Yehui Tang
Shan You
Chang Xu
Boxin Shi
Chao Xu
85
11
0
13 Jul 2019
And the Bit Goes Down: Revisiting the Quantization of Neural Networks
And the Bit Goes Down: Revisiting the Quantization of Neural Networks
Pierre Stock
Armand Joulin
Rémi Gribonval
Benjamin Graham
Hervé Jégou
MQ
137
149
0
12 Jul 2019
Neural Epitome Search for Architecture-Agnostic Network Compression
Neural Epitome Search for Architecture-Agnostic Network Compression
Daquan Zhou
Xiaojie Jin
Qibin Hou
Kaixin Wang
Jianchao Yang
Jiashi Feng
109
13
0
12 Jul 2019
Reinforcement Learning with Chromatic Networks for Compact Architecture
  Search
Reinforcement Learning with Chromatic Networks for Compact Architecture Search
Xingyou Song
K. Choromanski
Jack Parker-Holder
Yunhao Tang
Wenbo Gao
Aldo Pacchiano
Tamás Sarlós
Deepali Jain
Yuxiang Yang
50
1
0
10 Jul 2019
Dual Dynamic Inference: Enabling More Efficient, Adaptive and
  Controllable Deep Inference
Dual Dynamic Inference: Enabling More Efficient, Adaptive and Controllable Deep Inference
Yue Wang
Jianghao Shen
Ting-Kuei Hu
Pengfei Xu
T. Nguyen
Richard Baraniuk
Zhangyang Wang
Yingyan Lin
91
77
0
10 Jul 2019
Preferences Prediction using a Gallery of Mobile Device based on Scene
  Recognition and Object Detection
Preferences Prediction using a Gallery of Mobile Device based on Scene Recognition and Object Detection
Andrey V. Savchenko
K. Demochkin
I. Grechikhin
54
23
0
10 Jul 2019
Data-Independent Neural Pruning via Coresets
Data-Independent Neural Pruning via Coresets
Ben Mussay
Margarita Osadchy
Vladimir Braverman
Samson Zhou
Dan Feldman
104
60
0
09 Jul 2019
A Targeted Acceleration and Compression Framework for Low bit Neural
  Networks
A Targeted Acceleration and Compression Framework for Low bit Neural Networks
Biao Qian
Yang Wang
MQ
68
0
0
09 Jul 2019
Point-Voxel CNN for Efficient 3D Deep Learning
Point-Voxel CNN for Efficient 3D Deep Learning
Zhijian Liu
Haotian Tang
Chengyue Wu
Song Han
3DPC
201
678
0
08 Jul 2019
Exploiting Prunability for Person Re-Identification
Exploiting Prunability for Person Re-Identification
Hugo Masson
Amran Bhuiyan
Le Thanh Nguyen-Meidine
Mehrsan Javan
P. Siva
Ismail Ben Ayed
Eric Granger
67
9
0
04 Jul 2019
A Unified Optimization Approach for CNN Model Inference on Integrated
  GPUs
A Unified Optimization Approach for CNN Model Inference on Integrated GPUs
Leyuan Wang
Zhi Chen
Yizhi Liu
Yao Wang
Lianmin Zheng
Mu Li
Yida Wang
91
30
0
03 Jul 2019
Non-Structured DNN Weight Pruning -- Is It Beneficial in Any Platform?
Non-Structured DNN Weight Pruning -- Is It Beneficial in Any Platform?
Xiaolong Ma
Sheng Lin
Shaokai Ye
Zhezhi He
Linfeng Zhang
...
Deliang Fan
Xuehai Qian
Xinyu Lin
Kaisheng Ma
Yanzhi Wang
MQ
132
93
0
03 Jul 2019
Neuron ranking -- an informed way to condense convolutional neural
  networks architecture
Neuron ranking -- an informed way to condense convolutional neural networks architecture
Kamil Adamczewski
Mijung Park
FAtt
32
2
0
03 Jul 2019
Single-Path Mobile AutoML: Efficient ConvNet Design and NAS
  Hyperparameter Optimization
Single-Path Mobile AutoML: Efficient ConvNet Design and NAS Hyperparameter Optimization
Dimitrios Stamoulis
Ruizhou Ding
Di Wang
Dimitrios Lymberopoulos
B. Priyantha
Jie Liu
Diana Marculescu
67
34
0
01 Jul 2019
SLSNet: Skin lesion segmentation using a lightweight generative
  adversarial network
SLSNet: Skin lesion segmentation using a lightweight generative adversarial network
Md. Mostafa Kamal Sarker
Hatem A. Rashwan
Farhan Akram
V. Singh
Syeda Furruka Banu
...
Kabir Ahmed Choudhury
Sylvie Chambon
Petia Radeva
D. Puig
M. Abdel-Nasser
GANMedIm
71
28
0
01 Jul 2019
One Size Does Not Fit All: Quantifying and Exposing the Accuracy-Latency
  Trade-off in Machine Learning Cloud Service APIs via Tolerance Tiers
One Size Does Not Fit All: Quantifying and Exposing the Accuracy-Latency Trade-off in Machine Learning Cloud Service APIs via Tolerance Tiers
Matthew Halpern
Behzad Boroujerdian
Todd W. Mummert
Evelyn Duesterwald
Vijay Janapa Reddi
49
25
0
26 Jun 2019
New pointwise convolution in Deep Neural Networks through Extremely Fast
  and Non Parametric Transforms
New pointwise convolution in Deep Neural Networks through Extremely Fast and Non Parametric Transforms
Joonhyun Jeong
Sung-Ho Bae
62
1
0
25 Jun 2019
COP: Customized Deep Model Compression via Regularized Correlation-Based
  Filter-Level Pruning
COP: Customized Deep Model Compression via Regularized Correlation-Based Filter-Level Pruning
Wenxiao Wang
Cong Fu
Jishun Guo
Deng Cai
Xiaofei He
VLM
67
72
0
25 Jun 2019
Smaller Text Classifiers with Discriminative Cluster Embeddings
Smaller Text Classifiers with Discriminative Cluster Embeddings
Mingda Chen
Kevin Gimpel
35
6
0
23 Jun 2019
Filter Early, Match Late: Improving Network-Based Visual Place
  Recognition
Filter Early, Match Late: Improving Network-Based Visual Place Recognition
Stephen Hausler
A. Jacobson
Michael Milford
54
17
0
21 Jun 2019
Progressive Gradient Pruning for Classification, Detection and
  DomainAdaptation
Progressive Gradient Pruning for Classification, Detection and DomainAdaptation
Le Thanh Nguyen-Meidine
Eric Granger
M. Kiran
Louis-Antoine Blais-Morin
M. Pedersoli
VLM
40
3
0
20 Jun 2019
GAN-Knowledge Distillation for one-stage Object Detection
GAN-Knowledge Distillation for one-stage Object Detection
Wanwei Wang
Jin ke Yu Fan Zong
ObjD
55
29
0
20 Jun 2019
Back to Simplicity: How to Train Accurate BNNs from Scratch?
Back to Simplicity: How to Train Accurate BNNs from Scratch?
Joseph Bethge
Haojin Yang
Marvin Bornstein
Christoph Meinel
AAMLMQ
69
58
0
19 Jun 2019
ADA-Tucker: Compressing Deep Neural Networks via Adaptive Dimension
  Adjustment Tucker Decomposition
ADA-Tucker: Compressing Deep Neural Networks via Adaptive Dimension Adjustment Tucker Decomposition
Zhisheng Zhong
Fangyin Wei
Zhouchen Lin
Chao Zhang
62
29
0
18 Jun 2019
A One-step Pruning-recovery Framework for Acceleration of Convolutional
  Neural Networks
A One-step Pruning-recovery Framework for Acceleration of Convolutional Neural Networks
Dong Wang
Lei Zhou
Xiao Bai
Jun Zhou
28
2
0
18 Jun 2019
Structured Pruning of Recurrent Neural Networks through Neuron Selection
Structured Pruning of Recurrent Neural Networks through Neuron Selection
Liangjiang Wen
Xuanyang Zhang
Haoli Bai
Zenglin Xu
72
38
0
17 Jun 2019
Equivariant neural networks and equivarification
Equivariant neural networks and equivarification
Erkao Bao
Linqi Song
64
15
0
16 Jun 2019
Finding the Needle in the Haystack with Convolutions: on the benefits of
  architectural bias
Finding the Needle in the Haystack with Convolutions: on the benefits of architectural bias
Stéphane dÁscoli
Levent Sagun
Joan Bruna
Giulio Biroli
98
37
0
16 Jun 2019
Scalable Model Compression by Entropy Penalized Reparameterization
Scalable Model Compression by Entropy Penalized Reparameterization
Deniz Oktay
Johannes Ballé
Saurabh Singh
Abhinav Shrivastava
84
43
0
15 Jun 2019
A Signal Propagation Perspective for Pruning Neural Networks at
  Initialization
A Signal Propagation Perspective for Pruning Neural Networks at Initialization
Namhoon Lee
Thalaiyasingam Ajanthan
Stephen Gould
Philip Torr
AAML
82
156
0
14 Jun 2019
Previous
123...505152...686970
Next