ResearchTrend.AI
  • Papers
  • Communities
  • Organizations
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown
Title
Customized Watermarking for Deep Neural Networks via Label Distribution
  Perturbation
Customized Watermarking for Deep Neural Networks via Label Distribution Perturbation
Tzu-Yun Chien
Chih-Ya Shen
AAML
35
1
0
10 Aug 2022
Fast Heterogeneous Federated Learning with Hybrid Client Selection
Fast Heterogeneous Federated Learning with Hybrid Client Selection
Guangyuan Shen
D. Gao
Duanxiao Song
Libin Yang
Xukai Zhou
Shirui Pan
W. Lou
Fang Zhou
FedML
121
13
0
10 Aug 2022
SBPF: Sensitiveness Based Pruning Framework For Convolutional Neural
  Network On Image Classification
SBPF: Sensitiveness Based Pruning Framework For Convolutional Neural Network On Image Classification
Yihe Lu
Maoguo Gong
Wei Zhao
Kaiyuan Feng
Hao Li
VLM
73
0
0
09 Aug 2022
Controlled Sparsity via Constrained Optimization or: How I Learned to
  Stop Tuning Penalties and Love Constraints
Controlled Sparsity via Constrained Optimization or: How I Learned to Stop Tuning Penalties and Love Constraints
Jose Gallego-Posada
Juan Ramirez
Akram Erraqabi
Yoshua Bengio
Simon Lacoste-Julien
160
22
0
08 Aug 2022
N2NSkip: Learning Highly Sparse Networks using Neuron-to-Neuron Skip
  Connections
N2NSkip: Learning Highly Sparse Networks using Neuron-to-Neuron Skip Connections
Arvind Subramaniam
Avinash Sharma
50
17
0
07 Aug 2022
FBI: Fingerprinting models with Benign Inputs
FBI: Fingerprinting models with Benign Inputs
Thibault Maho
Teddy Furon
Erwan Le Merrer
AAML
76
4
0
05 Aug 2022
Model Blending for Text Classification
Model Blending for Text Classification
Ramit Pahwa
42
0
0
05 Aug 2022
Distributional Correlation--Aware Knowledge Distillation for Stock
  Trading Volume Prediction
Distributional Correlation--Aware Knowledge Distillation for Stock Trading Volume Prediction
Lei Li
Zhiyuan Zhang
Ruihan Bao
Keiko Harimoto
Xu Sun
79
3
0
04 Aug 2022
ZeroFL: Efficient On-Device Training for Federated Learning with Local
  Sparsity
ZeroFL: Efficient On-Device Training for Federated Learning with Local Sparsity
Xinchi Qiu
Javier Fernandez-Marques
Pedro Gusmão
Yan Gao
Titouan Parcollet
Nicholas D. Lane
FedML
85
71
0
04 Aug 2022
DeFL: Decentralized Weight Aggregation for Cross-silo Federated Learning
DeFL: Decentralized Weight Aggregation for Cross-silo Federated Learning
Jialiang Han
Yudong Han
Gang Huang
Yudong Han
FedML
69
4
0
01 Aug 2022
Adaptive Edge Offloading for Image Classification Under Rate Limit
Adaptive Edge Offloading for Image Classification Under Rate Limit
Jiaming Qiu
Ruiqi Wang
Ayan Chakrabarti
Roch Guérin
Chenyang Lu
OffRL
63
14
0
31 Jul 2022
Eco2AI: carbon emissions tracking of machine learning models as the
  first step towards sustainable AI
Eco2AI: carbon emissions tracking of machine learning models as the first step towards sustainable AI
S. Budennyy
V. Lazarev
N. Zakharenko
A. Korovin
Olga Plosskaya
...
Ivan Oseledets
I. Barsola
Ilya M. Egorov
A. Kosterina
L. Zhukov
112
107
0
31 Jul 2022
CoNLoCNN: Exploiting Correlation and Non-Uniform Quantization for
  Energy-Efficient Low-precision Deep Convolutional Neural Networks
CoNLoCNN: Exploiting Correlation and Non-Uniform Quantization for Energy-Efficient Low-precision Deep Convolutional Neural Networks
Muhammad Abdullah Hanif
G. M. Sarda
Alberto Marchisio
Guido Masera
Maurizio Martina
Mohamed Bennai
MQ
64
4
0
31 Jul 2022
Distilled Low Rank Neural Radiance Field with Quantization for Light
  Field Compression
Distilled Low Rank Neural Radiance Field with Quantization for Light Field Compression
Jinglei Shi
C. Guillemot
MQ
65
7
0
30 Jul 2022
Lightweight and Progressively-Scalable Networks for Semantic
  Segmentation
Lightweight and Progressively-Scalable Networks for Semantic Segmentation
Yiheng Zhang
Ting Yao
Zhaofan Qiu
Tao Mei
SSeg
115
24
0
27 Jul 2022
A Proper Orthogonal Decomposition approach for parameters reduction of
  Single Shot Detector networks
A Proper Orthogonal Decomposition approach for parameters reduction of Single Shot Detector networks
L. Meneghetti
N. Demo
G. Rozza
74
1
0
27 Jul 2022
Quiver neural networks
Quiver neural networks
I. Ganev
Robin Walters
70
4
0
26 Jul 2022
Compiler-Aware Neural Architecture Search for On-Mobile Real-time
  Super-Resolution
Compiler-Aware Neural Architecture Search for On-Mobile Real-time Super-Resolution
Yushu Wu
Yifan Gong
Pu Zhao
Yanyu Li
Zheng Zhan
Wei Niu
Hao Tang
Minghai Qin
Bin Ren
Yanzhi Wang
SupRMQ
88
23
0
25 Jul 2022
Trainability Preserving Neural Pruning
Trainability Preserving Neural Pruning
Huan Wang
Yun Fu
AAML
70
17
0
25 Jul 2022
FairGRAPE: Fairness-aware GRAdient Pruning mEthod for Face Attribute
  Classification
FairGRAPE: Fairness-aware GRAdient Pruning mEthod for Face Attribute Classification
Xiao-Ze Lin
Seungbae Kim
Jungseock Joo
CVBM
94
42
0
22 Jul 2022
Efficient model compression with Random Operation Access Specific Tile
  (ROAST) hashing
Efficient model compression with Random Operation Access Specific Tile (ROAST) hashing
Aditya Desai
K. Zhou
Anshumali Shrivastava
67
1
0
21 Jul 2022
TinyViT: Fast Pretraining Distillation for Small Vision Transformers
TinyViT: Fast Pretraining Distillation for Small Vision Transformers
Kan Wu
Jinnian Zhang
Houwen Peng
Mengchen Liu
Bin Xiao
Jianlong Fu
Lu Yuan
ViT
87
267
0
21 Jul 2022
Towards Transmission-Friendly and Robust CNN Models over Cloud and
  Device
Towards Transmission-Friendly and Robust CNN Models over Cloud and Device
Chuntao Ding
Zhichao Lu
F. Xu
Vishnu Boddeti
Yidong Li
Jiannong Cao
70
19
0
20 Jul 2022
Beyond Transmitting Bits: Context, Semantics, and Task-Oriented
  Communications
Beyond Transmitting Bits: Context, Semantics, and Task-Oriented Communications
Deniz Gunduz
Zhijin Qin
Iñaki Estella Aguerri
Harpreet S. Dhillon
Zhaohui Yang
Aylin Yener
Kai‐Kit Wong
C. Chae
111
461
0
19 Jul 2022
The Multiple Subnetwork Hypothesis: Enabling Multidomain Learning by
  Isolating Task-Specific Subnetworks in Feedforward Neural Networks
The Multiple Subnetwork Hypothesis: Enabling Multidomain Learning by Isolating Task-Specific Subnetworks in Feedforward Neural Networks
Jacob Renn
Ian Sotnek
Benjamin Harvey
B. Caffo
OODCLL
40
0
0
18 Jul 2022
S4: a High-sparsity, High-performance AI Accelerator
S4: a High-sparsity, High-performance AI Accelerator
Ian En-Hsu Yen
Zhibin Xiao
Dongkuan Xu
58
3
0
16 Jul 2022
Towards Lightweight Super-Resolution with Dual Regression Learning
Towards Lightweight Super-Resolution with Dual Regression Learning
Yong Guo
Mingkui Tan
Zeshuai Deng
Jingdong Wang
Qi Chen
Jingyun Liang
Yanwu Xu
Jian Chen
SupR
129
11
0
16 Jul 2022
Approximation Capabilities of Neural Networks using Morphological
  Perceptrons and Generalizations
Approximation Capabilities of Neural Networks using Morphological Perceptrons and Generalizations
William Chang
Hassan Hamad
K. Chugg
35
2
0
16 Jul 2022
Lightweight Vision Transformer with Cross Feature Attention
Lightweight Vision Transformer with Cross Feature Attention
Youpeng Zhao
Huadong Tang
Yingying Jiang
A. Yong
Qiang Wu
ViT
70
10
0
15 Jul 2022
Lipschitz Continuity Retained Binary Neural Network
Lipschitz Continuity Retained Binary Neural Network
Yuzhang Shang
Dan Xu
Bin Duan
Ziliang Zong
Liqiang Nie
Yan Yan
84
19
0
13 Jul 2022
Look-ups are not (yet) all you need for deep learning inference
Look-ups are not (yet) all you need for deep learning inference
Calvin McCarter
Nicholas Dronen
59
5
0
12 Jul 2022
Synergistic Self-supervised and Quantization Learning
Synergistic Self-supervised and Quantization Learning
Yunhao Cao
Peiqin Sun
Yechang Huang
Jianxin Wu
Shuchang Zhou
MQ
51
13
0
12 Jul 2022
Normalized Feature Distillation for Semantic Segmentation
Normalized Feature Distillation for Semantic Segmentation
Tao Liu
Xi Yang
Chenshu Chen
57
5
0
12 Jul 2022
STI: Turbocharge NLP Inference at the Edge via Elastic Pipelining
STI: Turbocharge NLP Inference at the Edge via Elastic Pipelining
Liwei Guo
Wonkyo Choe
F. Lin
72
15
0
11 Jul 2022
Sparsifying Binary Networks
Sparsifying Binary Networks
Riccardo Schiavone
Maria A. Zuluaga
MQ
52
0
0
11 Jul 2022
SparseTIR: Composable Abstractions for Sparse Compilation in Deep
  Learning
SparseTIR: Composable Abstractions for Sparse Compilation in Deep Learning
Zihao Ye
Ruihang Lai
Junru Shao
Tianqi Chen
Luis Ceze
167
99
0
11 Jul 2022
On the Robustness and Anomaly Detection of Sparse Neural Networks
On the Robustness and Anomaly Detection of Sparse Neural Networks
Morgane Ayle
Bertrand Charpentier
John Rachwan
Daniel Zügner
Simon Geisler
Stephan Günnemann
AAML
91
3
0
09 Jul 2022
SInGE: Sparsity via Integrated Gradients Estimation of Neuron Relevance
SInGE: Sparsity via Integrated Gradients Estimation of Neuron Relevance
Edouard Yvinec
Arnaud Dapogny
Matthieu Cord
Kévin Bailly
90
9
0
08 Jul 2022
Towards Semantic Communication Protocols: A Probabilistic Logic
  Perspective
Towards Semantic Communication Protocols: A Probabilistic Logic Perspective
Sejin Seo
Jihong Park
Seung-Woo Ko
Jinho Choi
M. Bennis
Seong-Lyun Kim
82
23
0
08 Jul 2022
Pruning Early Exit Networks
Pruning Early Exit Networks
Alperen Görmez
Erdem Koyuncu
43
5
0
08 Jul 2022
Network Binarization via Contrastive Learning
Network Binarization via Contrastive Learning
Yuzhang Shang
Dan Xu
Ziliang Zong
Liqiang Nie
Yan Yan
AAMLMQ
96
22
0
06 Jul 2022
Task-Oriented Sensing, Computation, and Communication Integration for
  Multi-Device Edge AI
Task-Oriented Sensing, Computation, and Communication Integration for Multi-Device Edge AI
Dingzhu Wen
Peixi Liu
Guangxu Zhu
Yuanming Shi
Jie Xu
Yonina C. Eldar
Shuguang Cui
80
118
0
03 Jul 2022
Enabling Harmonious Human-Machine Interaction with Visual-Context
  Augmented Dialogue System: A Review
Enabling Harmonious Human-Machine Interaction with Visual-Context Augmented Dialogue System: A Review
Hao Wang
Bin Guo
Y. Zeng
Yasan Ding
Chen Qiu
Ying Zhang
Li Yao
Zhiwen Yu
79
2
0
02 Jul 2022
VEDLIoT: Very Efficient Deep Learning in IoT
VEDLIoT: Very Efficient Deep Learning in IoT
M. Kaiser
R. Griessl
N. Kucza
C. Haumann
L. Tigges
...
James Ménétrey
K. Gugala
P. Zierhoffer
E. Knauss
Hans-Martin Heyn
23
8
0
01 Jul 2022
DRESS: Dynamic REal-time Sparse Subnets
DRESS: Dynamic REal-time Sparse Subnets
Zhongnan Qu
Syed Shakib Sarwar
Xin Dong
Yuecheng Li
Huseyin Ekin Sumbul
B. D. Salvo
3DH
69
1
0
01 Jul 2022
Studying the impact of magnitude pruning on contrastive learning methods
Studying the impact of magnitude pruning on contrastive learning methods
Francesco Corti
R. Entezari
Sara Hooker
D. Bacciu
O. Saukh
56
5
0
01 Jul 2022
DarKnight: An Accelerated Framework for Privacy and Integrity Preserving
  Deep Learning Using Trusted Hardware
DarKnight: An Accelerated Framework for Privacy and Integrity Preserving Deep Learning Using Trusted Hardware
H. Hashemi
Yongqin Wang
M. Annavaram
FedML
74
61
0
30 Jun 2022
On-Device Training Under 256KB Memory
On-Device Training Under 256KB Memory
Ji Lin
Ligeng Zhu
Wei-Ming Chen
Wei-Chen Wang
Chuang Gan
Song Han
MQ
144
213
0
30 Jun 2022
QUIDAM: A Framework for Quantization-Aware DNN Accelerator and Model
  Co-Exploration
QUIDAM: A Framework for Quantization-Aware DNN Accelerator and Model Co-Exploration
A. Inci
Siri Garudanagiri Virupaksha
Aman Jain
Ting-Wu Chin
Venkata Vivek Thallam
Ruizhou Ding
Diana Marculescu
MQ
44
3
0
30 Jun 2022
RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network
RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network
Vitaliy Chiley
Vithursan Thangarasa
Abhay Gupta
Anshul Samar
Joel Hestness
D. DeCoste
90
8
0
28 Jun 2022
Previous
123...202122...686970
Next