ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown
Title
Why is the State of Neural Network Pruning so Confusing? On the
  Fairness, Comparison Setup, and Trainability in Network Pruning
Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Pruning
Huan Wang
Can Qin
Yue Bai
Yun Fu
105
21
0
12 Jan 2023
AnycostFL: Efficient On-Demand Federated Learning over Heterogeneous
  Edge Devices
AnycostFL: Efficient On-Demand Federated Learning over Heterogeneous Edge Devices
Peichun Li
Guoliang Cheng
Xumin Huang
Jiawen Kang
Rong Yu
Yuan Wu
Miao Pan
FedML
113
22
0
08 Jan 2023
Does compressing activations help model parallel training?
Does compressing activations help model parallel training?
S. Bian
Dacheng Li
Hongyi Wang
Eric P. Xing
Shivaram Venkataraman
81
9
0
06 Jan 2023
A Theory of I/O-Efficient Sparse Neural Network Inference
A Theory of I/O-Efficient Sparse Neural Network Inference
Niels Gleinig
Tal Ben-Nun
Torsten Hoefler
64
0
0
03 Jan 2023
Distributed Machine Learning for UAV Swarms: Computing, Sensing, and
  Semantics
Distributed Machine Learning for UAV Swarms: Computing, Sensing, and Semantics
Yahao Ding
Zhaohui Yang
Quoc-Viet Pham
Zhaoyang Zhang
M. Shikh-Bahaei
86
38
0
03 Jan 2023
SAFEMYRIDES: Application of Decentralized Control Edge-Computing to
  Ridesharing Monitoring Services
SAFEMYRIDES: Application of Decentralized Control Edge-Computing to Ridesharing Monitoring Services
S. Elnagar
Manoj A. Thomas
Kweku-Muata A. Osei-Bryson
68
0
0
02 Jan 2023
SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot
SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot
Elias Frantar
Dan Alistarh
VLM
165
739
0
02 Jan 2023
Holistic Network Virtualization and Pervasive Network Intelligence for
  6G
Holistic Network Virtualization and Pervasive Network Intelligence for 6G
Xuemin Shen
Shen
Jie Gao
Wen Wu
Mushu Li
Conghao Zhou
W. Zhuang
111
239
0
02 Jan 2023
Accuracy-Guaranteed Collaborative DNN Inference in Industrial IoT via
  Deep Reinforcement Learning
Accuracy-Guaranteed Collaborative DNN Inference in Industrial IoT via Deep Reinforcement Learning
Wen Wu
Peng Yang
Weiting Zhang
Conghao Zhou
Xuemin
X. Shen
141
108
0
31 Dec 2022
FlatENN: Train Flat for Enhanced Fault Tolerance of Quantized Deep
  Neural Networks
FlatENN: Train Flat for Enhanced Fault Tolerance of Quantized Deep Neural Networks
Akul Malhotra
S. Gupta
43
0
0
29 Dec 2022
QuickNets: Saving Training and Preventing Overconfidence in Early-Exit
  Neural Architectures
QuickNets: Saving Training and Preventing Overconfidence in Early-Exit Neural Architectures
Devdhar Patel
H. Siegelmann
OnRL
85
1
0
25 Dec 2022
Hyperspherical Quantization: Toward Smaller and More Accurate Models
Hyperspherical Quantization: Toward Smaller and More Accurate Models
Dan Liu
X. Chen
Chen Ma
Xue Liu
MQ
73
3
0
24 Dec 2022
Pruning On-the-Fly: A Recoverable Pruning Method without Fine-tuning
Pruning On-the-Fly: A Recoverable Pruning Method without Fine-tuning
Danyang Liu
Xue Liu
72
0
0
24 Dec 2022
Hyperspherical Loss-Aware Ternary Quantization
Hyperspherical Loss-Aware Ternary Quantization
Dan Liu
Xue Liu
MQ
65
0
0
24 Dec 2022
Exploring Content Relationships for Distilling Efficient GANs
Exploring Content Relationships for Distilling Efficient GANs
Lizhou You
Mingbao Lin
Tie Hu
Yong Li
Rongrong Ji
75
4
0
21 Dec 2022
Redistribution of Weights and Activations for AdderNet Quantization
Redistribution of Weights and Activations for AdderNet Quantization
Ying Nie
Kai Han
Haikang Diao
Chuanjian Liu
Enhua Wu
Yunhe Wang
MQ
96
6
0
20 Dec 2022
The case for 4-bit precision: k-bit Inference Scaling Laws
The case for 4-bit precision: k-bit Inference Scaling Laws
Tim Dettmers
Luke Zettlemoyer
MQ
114
234
0
19 Dec 2022
Training Lightweight Graph Convolutional Networks with Phase-field
  Models
Training Lightweight Graph Convolutional Networks with Phase-field Models
H. Sahbi
77
0
0
19 Dec 2022
FSCNN: A Fast Sparse Convolution Neural Network Inference System
FSCNN: A Fast Sparse Convolution Neural Network Inference System
Bo Ji
Tianyi Chen
62
3
0
17 Dec 2022
Atrous Space Bender U-Net (ASBU-Net/LogiNet)
Atrous Space Bender U-Net (ASBU-Net/LogiNet)
Anurag Bansal
O. Ostap
Miguel Maestre Trueba
Kristopher Perry
SSeg
95
1
0
16 Dec 2022
Can We Find Strong Lottery Tickets in Generative Models?
Can We Find Strong Lottery Tickets in Generative Models?
Sangyeop Yeo
Yoojin Jang
Jy-yong Sohn
Dongyoon Han
Jaejun Yoo
52
7
0
16 Dec 2022
Mod-Squad: Designing Mixture of Experts As Modular Multi-Task Learners
Mod-Squad: Designing Mixture of Experts As Modular Multi-Task Learners
Zitian Chen
Songlin Yang
Mingyu Ding
Zhenfang Chen
Hengshuang Zhao
E. Learned-Miller
Chuang Gan
MoE
50
15
0
15 Dec 2022
Towards Hardware-Specific Automatic Compression of Neural Networks
Towards Hardware-Specific Automatic Compression of Neural Networks
Torben Krieger
Bernhard Klein
Holger Fröning
MQ
71
2
0
15 Dec 2022
Quant 4.0: Engineering Quantitative Investment with Automated,
  Explainable and Knowledge-driven Artificial Intelligence
Quant 4.0: Engineering Quantitative Investment with Automated, Explainable and Knowledge-driven Artificial Intelligence
Jian Guo
Saizhuo Wang
L. Ni
H. Shum
AIFin
104
8
0
13 Dec 2022
ResFed: Communication Efficient Federated Learning by Transmitting Deep
  Compressed Residuals
ResFed: Communication Efficient Federated Learning by Transmitting Deep Compressed Residuals
Rui Song
Liguo Zhou
Lingjuan Lyu
Andreas Festag
Alois Knoll
FedML
83
5
0
11 Dec 2022
Statistical guarantees for sparse deep learning
Statistical guarantees for sparse deep learning
Johannes Lederer
45
11
0
11 Dec 2022
Vertical Layering of Quantized Neural Networks for Heterogeneous
  Inference
Vertical Layering of Quantized Neural Networks for Heterogeneous Inference
Hai Wu
Ruifei He
Hao Hao Tan
Xiaojuan Qi
Kaibin Huang
MQ
89
2
0
10 Dec 2022
QVIP: An ILP-based Formal Verification Approach for Quantized Neural
  Networks
QVIP: An ILP-based Formal Verification Approach for Quantized Neural Networks
Yedi Zhang
Zhe Zhao
Fu Song
Hao Fei
Tao Chen
Jun Sun
69
18
0
10 Dec 2022
Optimizing Learning Rate Schedules for Iterative Pruning of Deep Neural
  Networks
Optimizing Learning Rate Schedules for Iterative Pruning of Deep Neural Networks
Shiyu Liu
Rohan Ghosh
John Tan Chong Min
Mehul Motani
83
0
0
09 Dec 2022
Analysis of Deep Learning Architectures and Efficacy of Detecting Forest
  Fires
Analysis of Deep Learning Architectures and Efficacy of Detecting Forest Fires
Ryan Marinelli
84
0
0
08 Dec 2022
Efficient Stein Variational Inference for Reliable Distribution-lossless
  Network Pruning
Efficient Stein Variational Inference for Reliable Distribution-lossless Network Pruning
Yingchun Wang
Song Guo
Jingcai Guo
Weizhan Zhang
Yi Tian Xu
Jiewei Zhang
Yi Liu
87
17
0
07 Dec 2022
Slimmable Pruned Neural Networks
Slimmable Pruned Neural Networks
Hideaki Kuratsu
Atsuyoshi Nakamura
109
2
0
07 Dec 2022
Label-free Knowledge Distillation with Contrastive Loss for Light-weight
  Speaker Recognition
Label-free Knowledge Distillation with Contrastive Loss for Light-weight Speaker Recognition
Zhiyuan Peng
Xuanji He
Ke Ding
Tan Lee
Guanglu Wan
62
6
0
06 Dec 2022
QEBVerif: Quantization Error Bound Verification of Neural Networks
QEBVerif: Quantization Error Bound Verification of Neural Networks
Yedi Zhang
Fu Song
Jun Sun
MQ
99
12
0
06 Dec 2022
MobileTL: On-device Transfer Learning with Inverted Residual Blocks
MobileTL: On-device Transfer Learning with Inverted Residual Blocks
HungYueh Chiang
N. Frumkin
Feng Liang
Diana Marculescu
MQ
77
12
0
05 Dec 2022
Distributed Pruning Towards Tiny Neural Networks in Federated Learning
Distributed Pruning Towards Tiny Neural Networks in Federated Learning
Hong Huang
Lan Zhang
Chaoyue Sun
R. Fang
Xiaoyong Yuan
Dapeng Wu
FedML
71
18
0
05 Dec 2022
Exploiting Kernel Compression on BNNs
Exploiting Kernel Compression on BNNs
Franyell Silfa
J. Arnau
Antonio González
MQ
55
0
0
01 Dec 2022
Boosted Dynamic Neural Networks
Boosted Dynamic Neural Networks
Haichao Yu
Haoxiang Li
G. Hua
Gao Huang
Humphrey Shi
103
9
0
30 Nov 2022
Compressing Volumetric Radiance Fields to 1 MB
Compressing Volumetric Radiance Fields to 1 MB
Lingzhi Li
Zhen Shen
Zhongshu Wang
Li Shen
Liefeng Bo
79
67
0
29 Nov 2022
NoisyQuant: Noisy Bias-Enhanced Post-Training Activation Quantization
  for Vision Transformers
NoisyQuant: Noisy Bias-Enhanced Post-Training Activation Quantization for Vision Transformers
Yijiang Liu
Huanrui Yang
Zhen Dong
Kurt Keutzer
Li Du
Shanghang Zhang
MQ
108
53
0
29 Nov 2022
Feature-domain Adaptive Contrastive Distillation for Efficient Single
  Image Super-Resolution
Feature-domain Adaptive Contrastive Distillation for Efficient Single Image Super-Resolution
Hye-Min Moon
Jinwoo Jeong
Sungjei Kim
97
2
0
29 Nov 2022
On the Effectiveness of Parameter-Efficient Fine-Tuning
On the Effectiveness of Parameter-Efficient Fine-Tuning
Z. Fu
Haoran Yang
Anthony Man-Cho So
Wai Lam
Lidong Bing
Nigel Collier
82
162
0
28 Nov 2022
AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning
AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning
Hongjie Zhang
OffRL
44
0
0
28 Nov 2022
Class-based Quantization for Neural Networks
Class-based Quantization for Neural Networks
Wenhao Sun
Grace Li Zhang
Huaxi Gu
Bing Li
Ulf Schlichtmann
MQ
73
7
0
27 Nov 2022
SteppingNet: A Stepping Neural Network with Incremental Accuracy
  Enhancement
SteppingNet: A Stepping Neural Network with Incremental Accuracy Enhancement
Wenhao Sun
Grace Li Zhang
Xunzhao Yin
Cheng Zhuo
Huaxi Gu
Bing Li
Ulf Schlichtmann
52
2
0
27 Nov 2022
Diffusion Probabilistic Model Made Slim
Diffusion Probabilistic Model Made Slim
Xingyi Yang
Daquan Zhou
Jiashi Feng
Xinchao Wang
DiffM
115
111
0
27 Nov 2022
Medical Image Segmentation Review: The success of U-Net
Medical Image Segmentation Review: The success of U-Net
Reza Azad
Ehsan Khodapanah Aghdam
Amelie Rauland
Yiwei Jia
Atlas Haddadi Avval
Afshin Bozorgpour
Sanaz Karimijafarbigloo
Joseph Paul Cohen
Ehsan Adeli
Dorit Merhof
SSeg
131
326
0
27 Nov 2022
Fast and Efficient Malware Detection with Joint Static and Dynamic
  Features Through Transfer Learning
Fast and Efficient Malware Detection with Joint Static and Dynamic Features Through Transfer Learning
Mao V. Ngo
Tram Truong-Huu
Dima Rabadi
Jia Yi Loo
Sin Gee Teo
26
7
0
25 Nov 2022
Signed Binary Weight Networks
Sachit Kuhar
Alexey Tumanov
Judy Hoffman
MQ
91
1
0
25 Nov 2022
PAC-Bayes Compression Bounds So Tight That They Can Explain
  Generalization
PAC-Bayes Compression Bounds So Tight That They Can Explain Generalization
Sanae Lotfi
Marc Finzi
Sanyam Kapoor
Andres Potapczynski
Micah Goldblum
A. Wilson
BDLMLTAI4CE
92
62
0
24 Nov 2022
Previous
123...171819...686970
Next