ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown
Title
Distilled Neural Networks for Efficient Learning to Rank
Distilled Neural Networks for Efficient Learning to Rank
F. M. Nardini
Cosimo Rulli
Salvatore Trani
Rossano Venturini
FedML
58
16
0
22 Feb 2022
Enabling On-Device Smartphone GPU based Training: Lessons Learned
Enabling On-Device Smartphone GPU based Training: Lessons Learned
Anish Das
Young D. Kwon
Jagmohan Chauhan
Cecilia Mascolo
3DH
84
11
0
21 Feb 2022
AI/ML Algorithms and Applications in VLSI Design and Technology
AI/ML Algorithms and Applications in VLSI Design and Technology
Deepthi Amuru
Harsha V. Vudumula
Pavan K. Cherupally
Sushanth R. Gurram
Amir Ahmad
Andleeb Zahra
Zia Abbas
141
44
0
21 Feb 2022
Sparsity Winning Twice: Better Robust Generalization from More Efficient
  Training
Sparsity Winning Twice: Better Robust Generalization from More Efficient Training
Tianlong Chen
Zhenyu Zhang
Pengju Wang
Santosh Balachandra
Haoyu Ma
Zehao Wang
Zhangyang Wang
OODAAML
158
50
0
20 Feb 2022
Amenable Sparse Network Investigator
Amenable Sparse Network Investigator
S. Damadi
Erfan Nouri
Hamed Pirsiavash
53
4
0
18 Feb 2022
LG-LSQ: Learned Gradient Linear Symmetric Quantization
LG-LSQ: Learned Gradient Linear Symmetric Quantization
Shih-Ting Lin
Zhaofang Li
Yu-Hsiang Cheng
Hao-Wen Kuo
Chih-Cheng Lu
K. Tang
MQ
78
2
0
18 Feb 2022
Where Is My Training Bottleneck? Hidden Trade-Offs in Deep Learning
  Preprocessing Pipelines
Where Is My Training Bottleneck? Hidden Trade-Offs in Deep Learning Preprocessing Pipelines
Alexander Isenko
R. Mayer
Jeffrey Jedele
Hans-Arno Jacobsen
104
26
0
17 Feb 2022
Fingerprinting Deep Neural Networks Globally via Universal Adversarial
  Perturbations
Fingerprinting Deep Neural Networks Globally via Universal Adversarial Perturbations
Zirui Peng
Shaofeng Li
Guoxing Chen
Cheng Zhang
Haojin Zhu
Minhui Xue
AAMLFedML
122
69
0
17 Feb 2022
Heuristic Adaptability to Input Dynamics for SpMM on GPUs
Heuristic Adaptability to Input Dynamics for SpMM on GPUs
Guohao Dai
Guyue Huang
Shang Yang
Zhongming Yu
Hengrui Zhang
Yufei Ding
Yuan Xie
Huazhong Yang
Yu Wang
41
20
0
17 Feb 2022
Practical Network Acceleration with Tiny Sets
Practical Network Acceleration with Tiny Sets
G. Wang
Jianxin Wu
104
11
0
16 Feb 2022
DualConv: Dual Convolutional Kernels for Lightweight Deep Neural
  Networks
DualConv: Dual Convolutional Kernels for Lightweight Deep Neural Networks
Jiachen Zhong
Junying Chen
Ajmal Mian
46
63
0
15 Feb 2022
A Survey on Model Compression and Acceleration for Pretrained Language
  Models
A Survey on Model Compression and Acceleration for Pretrained Language Models
Canwen Xu
Julian McAuley
110
61
0
15 Feb 2022
BED: A Real-Time Object Detection System for Edge Devices
BED: A Real-Time Object Detection System for Edge Devices
Guanchu Wang
Zaid Pervaiz Bhat
Zhimeng Jiang
Yi-Wei Chen
Daochen Zha
...
A. Niktash
Mehmet Gorkem Ulkar
O. E. Okman
Xuanting Cai
Helen Zhou
52
11
0
14 Feb 2022
SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian
  Approximation
SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian Approximation
Cong Guo
Yuxian Qiu
Jingwen Leng
Xiaotian Gao
Chen Zhang
Yunxin Liu
Fan Yang
Yuhao Zhu
Minyi Guo
MQ
126
75
0
14 Feb 2022
F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization
F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization
Qing Jin
Jian Ren
Richard Zhuang
Sumant Hanumante
Zhengang Li
Zhiyu Chen
Yanzhi Wang
Kai-Min Yang
Sergey Tulyakov
MQ
97
50
0
10 Feb 2022
Deadwooding: Robust Global Pruning for Deep Neural Networks
Deadwooding: Robust Global Pruning for Deep Neural Networks
Sawinder Kaur
Ferdinando Fioretto
Asif Salekin
82
4
0
10 Feb 2022
Quantune: Post-training Quantization of Convolutional Neural Networks
  using Extreme Gradient Boosting for Fast Deployment
Quantune: Post-training Quantization of Convolutional Neural Networks using Extreme Gradient Boosting for Fast Deployment
Jemin Lee
Misun Yu
Yongin Kwon
Teaho Kim
MQ
75
17
0
10 Feb 2022
Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets
Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets
Tianlong Chen
Xuxi Chen
Xiaolong Ma
Yanzhi Wang
Zhangyang Wang
87
34
0
09 Feb 2022
EvoPruneDeepTL: An Evolutionary Pruning Model for Transfer Learning
  based Deep Neural Networks
EvoPruneDeepTL: An Evolutionary Pruning Model for Transfer Learning based Deep Neural Networks
Javier Poyatos
Daniel Molina
Aritz D. Martinez
Javier Del Ser
Francisco Herrera
87
36
0
08 Feb 2022
Membership Inference Attacks and Defenses in Neural Network Pruning
Membership Inference Attacks and Defenses in Neural Network Pruning
Xiaoyong Yuan
Lan Zhang
AAML
112
45
0
07 Feb 2022
The Unreasonable Effectiveness of Random Pruning: Return of the Most
  Naive Baseline for Sparse Training
The Unreasonable Effectiveness of Random Pruning: Return of the Most Naive Baseline for Sparse Training
Shiwei Liu
Tianlong Chen
Xiaohan Chen
Li Shen
Decebal Constantin Mocanu
Zhangyang Wang
Mykola Pechenizkiy
107
113
0
05 Feb 2022
EcoFlow: Efficient Convolutional Dataflows for Low-Power Neural Network
  Accelerators
EcoFlow: Efficient Convolutional Dataflows for Low-Power Neural Network Accelerators
Lois Orosa
Skanda Koppula
Yaman Umuroglu
Konstantinos Kanellopoulos
Juan Gómez Luna
Michaela Blott
K. Vissers
O. Mutlu
82
4
0
04 Feb 2022
Comparative assessment of federated and centralized machine learning
Comparative assessment of federated and centralized machine learning
Ibrahim Abdul Majeed
Sagar Kaushik
Aniruddha Bardhan
Venkata Siva Kumar Tadi
Hwang-Ki Min
K. Kumaraguru
Rajasekhara Reddy Duvvuru Muni
FedML
50
7
0
03 Feb 2022
Robust Binary Models by Pruning Randomly-initialized Networks
Robust Binary Models by Pruning Randomly-initialized Networks
Chen Liu
Ziqi Zhao
Sabine Süsstrunk
Mathieu Salzmann
TPMAAMLMQ
89
4
0
03 Feb 2022
Nonlinear Initialization Methods for Low-Rank Neural Networks
Nonlinear Initialization Methods for Low-Rank Neural Networks
Kiran Vodrahalli
Rakesh Shivanna
M. Sathiamoorthy
Sagar Jain
Ed H. Chi
94
4
0
02 Feb 2022
Accelerating DNN Training with Structured Data Gradient Pruning
Accelerating DNN Training with Structured Data Gradient Pruning
Bradley McDanel
Helia Dinh
J. Magallanes
79
8
0
01 Feb 2022
Win the Lottery Ticket via Fourier Analysis: Frequencies Guided Network
  Pruning
Win the Lottery Ticket via Fourier Analysis: Frequencies Guided Network Pruning
Yuzhang Shang
Bin Duan
Ziliang Zong
Liqiang Nie
Yan Yan
24
1
0
30 Jan 2022
Electra: Conditional Generative Model based Predicate-Aware Query
  Approximation
Electra: Conditional Generative Model based Predicate-Aware Query Approximation
Nikhil Sheoran
Subrata Mitra
Vibhor Porwal
Siddharth Ghetia
Jatin Varshney
Tung Mai
Anup B. Rao
Vikas Maddukuri
94
13
0
28 Jan 2022
On the Mitigation of Read Disturbances in Neuromorphic Inference
  Hardware
On the Mitigation of Read Disturbances in Neuromorphic Inference Hardware
A. Paul
Shihao Song
Twisha Titirsha
Anup Das
56
5
0
27 Jan 2022
Post-training Quantization for Neural Networks with Provable Guarantees
Post-training Quantization for Neural Networks with Provable Guarantees
Jinjie Zhang
Yixuan Zhou
Rayan Saab
MQ
89
34
0
26 Jan 2022
LiteHAR: Lightweight Human Activity Recognition from WiFi Signals with
  Random Convolution Kernels
LiteHAR: Lightweight Human Activity Recognition from WiFi Signals with Random Convolution Kernels
Hojjat Salehinejad
S. Valaee
56
32
0
23 Jan 2022
Iterative Activation-based Structured Pruning
Iterative Activation-based Structured Pruning
Kaiqi Zhao
Animesh Jain
Ming Zhao
77
0
0
22 Jan 2022
Enabling Deep Learning on Edge Devices through Filter Pruning and
  Knowledge Transfer
Enabling Deep Learning on Edge Devices through Filter Pruning and Knowledge Transfer
Kaiqi Zhao
Yitao Chen
Ming Zhao
44
3
0
22 Jan 2022
Adaptive Activation-based Structured Pruning
Adaptive Activation-based Structured Pruning
Kaiqi Zhao
Animesh Jain
Ming Zhao
80
1
0
21 Jan 2022
APack: Off-Chip, Lossless Data Compression for Efficient Deep Learning
  Inference
APack: Off-Chip, Lossless Data Compression for Efficient Deep Learning Inference
Alberto Delmas Lascorz
Mostafa Mahmoud
Andreas Moshovos
MQ
42
1
0
21 Jan 2022
Hardware-Efficient Deconvolution-Based GAN for Edge Computing
Hardware-Efficient Deconvolution-Based GAN for Edge Computing
A. Alhussain
Mingjie Lin
64
5
0
18 Jan 2022
Egeria: Efficient DNN Training with Knowledge-Guided Layer Freezing
Egeria: Efficient DNN Training with Knowledge-Guided Layer Freezing
Yiding Wang
D. Sun
Kai Chen
Fan Lai
Mosharaf Chowdhury
115
47
0
17 Jan 2022
UDC: Unified DNAS for Compressible TinyML Models
UDC: Unified DNAS for Compressible TinyML Models
Igor Fedorov
Ramon Matas
Hokchhay Tann
Chu Zhou
Matthew Mattina
P. Whatmough
AI4CE
87
14
0
15 Jan 2022
GhostNets on Heterogeneous Devices via Cheap Operations
GhostNets on Heterogeneous Devices via Cheap Operations
Kai Han
Yunhe Wang
Chang Xu
Jianyuan Guo
Chunjing Xu
Enhua Wu
Qi Tian
78
108
0
10 Jan 2022
ThreshNet: An Efficient DenseNet Using Threshold Mechanism to Reduce
  Connections
ThreshNet: An Efficient DenseNet Using Threshold Mechanism to Reduce Connections
Ruikang Ju
Ting-Yu Lin
Jia-Hao Jian
Jen-Shiun Chiang
Weida Yang
60
9
0
09 Jan 2022
An Adaptive Device-Edge Co-Inference Framework Based on Soft
  Actor-Critic
An Adaptive Device-Edge Co-Inference Framework Based on Soft Actor-Critic
Tao Niu
Yinglei Teng
Zhu Han
Panpan Zou
12
10
0
09 Jan 2022
Block Walsh-Hadamard Transform Based Binary Layers in Deep Neural
  Networks
Block Walsh-Hadamard Transform Based Binary Layers in Deep Neural Networks
Hongyi Pan
Diaa Badawi
Ahmet Enis Cetin
77
19
0
07 Jan 2022
The Effect of Model Compression on Fairness in Facial Expression
  Recognition
The Effect of Model Compression on Fairness in Facial Expression Recognition
Samuil Stoychev
Hatice Gunes
CVBM
136
19
0
05 Jan 2022
Extending the limit of molecular dynamics with ab initio accuracy to 10
  billion atoms
Extending the limit of molecular dynamics with ab initio accuracy to 10 billion atoms
Zhuoqiang Guo
Denghui Lu
Yujin Yan
Siyu Hu
Rongrong Liu
...
Yixiao Chen
Linfeng Zhang
Mohan Chen
Han Wang
Weile Jia
AI4CE
50
41
0
05 Jan 2022
Problem-dependent attention and effort in neural networks with
  applications to image resolution and model selection
Problem-dependent attention and effort in neural networks with applications to image resolution and model selection
Chris Rohlfs
105
4
0
05 Jan 2022
Role of Data Augmentation Strategies in Knowledge Distillation for
  Wearable Sensor Data
Role of Data Augmentation Strategies in Knowledge Distillation for Wearable Sensor Data
Eunyeong Jeon
Anirudh Som
Ankita Shukla
Kristina Hasanaj
M. Buman
Pavan Turaga
64
13
0
01 Jan 2022
Croesus: Multi-Stage Processing and Transactions for Video-Analytics in
  Edge-Cloud Systems
Croesus: Multi-Stage Processing and Transactions for Video-Analytics in Edge-Cloud Systems
Samaa Gazzaz
Vishal Chakraborty
Faisal Nawab
48
10
0
31 Dec 2021
Single-Shot Pruning for Offline Reinforcement Learning
Single-Shot Pruning for Offline Reinforcement Learning
Samin Yeasar Arnob
Riyasat Ohib
Sergey Plis
Doina Precup
OffRL
69
25
0
31 Dec 2021
Conditional Generative Data-free Knowledge Distillation
Conditional Generative Data-free Knowledge Distillation
Xinyi Yu
Ling Yan
Yang Yang
Libo Zhou
Linlin Ou
84
8
0
31 Dec 2021
Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural
  Networks
Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks
Runpei Dong
Zhanhong Tan
Mengdi Wu
Linfeng Zhang
Kaisheng Ma
MQ
97
12
0
30 Dec 2021
Previous
123...242526...686970
Next