ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown
Title
Generative Model for Models: Rapid DNN Customization for Diverse Tasks
  and Resource Constraints
Generative Model for Models: Rapid DNN Customization for Diverse Tasks and Resource Constraints
Wenxing Xu
Yuanchun Li
Jiacheng Liu
Yiyou Sun
Zhengyang Cao
Yixuan Li
Hao Wen
Yunxin Liu
96
1
0
29 Aug 2023
Uncovering the Hidden Cost of Model Compression
Uncovering the Hidden Cost of Model Compression
Diganta Misra
Muawiz Chaudhary
Agam Goyal
Bharat Runwal
Pin-Yu Chen
VLM
97
0
0
29 Aug 2023
Low-bit Quantization for Deep Graph Neural Networks with
  Smoothness-aware Message Propagation
Low-bit Quantization for Deep Graph Neural Networks with Smoothness-aware Message Propagation
Shuang Wang
B. Eravcı
Rustam Guliyev
Hakan Ferhatosmanoglu
GNNMQ
70
9
0
29 Aug 2023
Maestro: Uncovering Low-Rank Structures via Trainable Decomposition
Maestro: Uncovering Low-Rank Structures via Trainable Decomposition
Samuel Horváth
Stefanos Laskaridis
Shashank Rajput
Hongyi Wang
BDL
90
4
0
28 Aug 2023
Computation-efficient Deep Learning for Computer Vision: A Survey
Computation-efficient Deep Learning for Computer Vision: A Survey
Yulin Wang
Yizeng Han
Chaofei Wang
Shiji Song
Qi Tian
Gao Huang
VLM
141
21
0
27 Aug 2023
Homological Convolutional Neural Networks
Homological Convolutional Neural Networks
Antonio Briola
Yuanrong Wang
Silvia Bartolucci
T. Aste
LMTD
87
7
0
26 Aug 2023
MST-compression: Compressing and Accelerating Binary Neural Networks
  with Minimum Spanning Tree
MST-compression: Compressing and Accelerating Binary Neural Networks with Minimum Spanning Tree
Quang Hieu Vo
Linh-Tam Tran
Sung-Ho Bae
Lokwon Kim
Choong Seon Hong
MQ
94
1
0
26 Aug 2023
REFT: Resource-Efficient Federated Training Framework for Heterogeneous
  and Resource-Constrained Environments
REFT: Resource-Efficient Federated Training Framework for Heterogeneous and Resource-Constrained Environments
Humaid Ahmed Desai
Amr B. Hilal
Hoda Eldardiry
67
0
0
25 Aug 2023
Federated Learning in IoT: a Survey from a Resource-Constrained
  Perspective
Federated Learning in IoT: a Survey from a Resource-Constrained Perspective
Ishmeet Kaur
87
3
0
25 Aug 2023
Data-Side Efficiencies for Lightweight Convolutional Neural Networks
Data-Side Efficiencies for Lightweight Convolutional Neural Networks
Bryan Bo Cao
Lawrence O'Gorman
Michael J. Coss
Shubham Jain
62
2
0
24 Aug 2023
Multi-stage feature decorrelation constraints for improving CNN
  classification performance
Multi-stage feature decorrelation constraints for improving CNN classification performance
Qiuyu Zhu
Hao Wang
Xuewen Zu
Chengfei Liu
42
0
0
24 Aug 2023
Enhancing Energy-Awareness in Deep Learning through Fine-Grained Energy
  Measurement
Enhancing Energy-Awareness in Deep Learning through Fine-Grained Energy Measurement
S. Rajput
Tim Widmayer
Ziyuan Shang
M. Kechagia
Federica Sarro
Tushar Sharma
106
4
0
23 Aug 2023
Sampling From Autoencoders' Latent Space via Quantization And
  Probability Mass Function Concepts
Sampling From Autoencoders' Latent Space via Quantization And Probability Mass Function Concepts
Aymene Mohammed Bouayed
Adrian Iaccovelli
D. Naccache
57
0
0
21 Aug 2023
Efficient Joint Optimization of Layer-Adaptive Weight Pruning in Deep
  Neural Networks
Efficient Joint Optimization of Layer-Adaptive Weight Pruning in Deep Neural Networks
Kaixin Xu
Zhe Wang
Xue Geng
Jie Lin
Min-man Wu
Xiaoli Li
Weisi Lin
77
15
0
21 Aug 2023
Benchmarking Adversarial Robustness of Compressed Deep Learning Models
Benchmarking Adversarial Robustness of Compressed Deep Learning Models
Brijesh Vora
Kartik Patwari
Syed Mahbub Hafiz
Zubair Shafiq
Chen-Nee Chuah
AAML
79
2
0
16 Aug 2023
A Survey on Model Compression for Large Language Models
A Survey on Model Compression for Large Language Models
Xunyu Zhu
Jian Li
Yong Liu
Can Ma
Weiping Wang
143
233
0
15 Aug 2023
Ada-QPacknet -- adaptive pruning with bit width reduction as an
  efficient continual learning method without forgetting
Ada-QPacknet -- adaptive pruning with bit width reduction as an efficient continual learning method without forgetting
Marcin Pietroñ
Dominik Zurek
Kamil Faber
Roberto Corizzo
CLL
73
2
0
14 Aug 2023
Estimator Meets Equilibrium Perspective: A Rectified Straight Through
  Estimator for Binary Neural Networks Training
Estimator Meets Equilibrium Perspective: A Rectified Straight Through Estimator for Binary Neural Networks Training
Xiao-Ming Wu
Dian Zheng
Zuhao Liu
Weishi Zheng
MQ
118
18
0
13 Aug 2023
Sensitivity-Aware Mixed-Precision Quantization and Width Optimization of
  Deep Neural Networks Through Cluster-Based Tree-Structured Parzen Estimation
Sensitivity-Aware Mixed-Precision Quantization and Width Optimization of Deep Neural Networks Through Cluster-Based Tree-Structured Parzen Estimation
Seyedarmin Azizi
M. Nazemi
A. Fayyazi
Massoud Pedram
MQ
64
5
0
12 Aug 2023
SSL-Auth: An Authentication Framework by Fragile Watermarking for
  Pre-trained Encoders in Self-supervised Learning
SSL-Auth: An Authentication Framework by Fragile Watermarking for Pre-trained Encoders in Self-supervised Learning
Xiaobei Li
Changchun Yin
Liyue Zhu
Xiaogang Xu
Liming Fang
Run Wang
Chenhao Lin
AAML
86
1
0
09 Aug 2023
Resource Constrained Model Compression via Minimax Optimization for
  Spiking Neural Networks
Resource Constrained Model Compression via Minimax Optimization for Spiking Neural Networks
Jue Chen
Huan Yuan
Jianchao Tan
Bin Chen
Chengru Song
Di Zhang
71
4
0
09 Aug 2023
Lossy and Lossless (L$^2$) Post-training Model Size Compression
Lossy and Lossless (L2^22) Post-training Model Size Compression
Yumeng Shi
Shihao Bai
Xiuying Wei
Ruihao Gong
Jianlei Yang
53
3
0
08 Aug 2023
D-Score: A Synapse-Inspired Approach for Filter Pruning
D-Score: A Synapse-Inspired Approach for Filter Pruning
Doyoung Park
Jinsoo Kim
Ji-Min Nam
Jooyoung Chang
S. Park
62
0
0
08 Aug 2023
Pruning a neural network using Bayesian inference
Pruning a neural network using Bayesian inference
Sunil Mathew
D. Rowe
34
0
0
04 Aug 2023
Survey on Computer Vision Techniques for Internet-of-Things Devices
Survey on Computer Vision Techniques for Internet-of-Things Devices
Ishmeet Kaur
Adwaita Janardhan Jadhav
AI4CE
60
1
0
02 Aug 2023
An Introduction to Bi-level Optimization: Foundations and Applications
  in Signal Processing and Machine Learning
An Introduction to Bi-level Optimization: Foundations and Applications in Signal Processing and Machine Learning
Yihua Zhang
Prashant Khanduri
Ioannis C. Tsaknakis
Yuguang Yao
Min-Fong Hong
Sijia Liu
AI4CE
129
31
0
01 Aug 2023
Evaluating Spiking Neural Network On Neuromorphic Platform For Human
  Activity Recognition
Evaluating Spiking Neural Network On Neuromorphic Platform For Human Activity Recognition
Sizhen Bian
Michele Magno
55
6
0
01 Aug 2023
Improving Generalization of Adversarial Training via Robust Critical
  Fine-Tuning
Improving Generalization of Adversarial Training via Robust Critical Fine-Tuning
Kaijie Zhu
Jindong Wang
Xixu Hu
Xingxu Xie
G. Yang
AAML
76
25
0
01 Aug 2023
Revisiting the Parameter Efficiency of Adapters from the Perspective of
  Precision Redundancy
Revisiting the Parameter Efficiency of Adapters from the Perspective of Precision Redundancy
Shibo Jie
Haoqing Wang
Zhiwei Deng
76
34
0
31 Jul 2023
Alpha-GPT: Human-AI Interactive Alpha Mining for Quantitative Investment
Alpha-GPT: Human-AI Interactive Alpha Mining for Quantitative Investment
Saizhuo Wang
Hang Yuan
Leon Zhou
L. Ni
H. Shum
Jian Guo
67
25
0
31 Jul 2023
Stable Adam Optimization for 16-bit Neural Networks Training
Juyoung Yun
25
1
0
30 Jul 2023
Incrementally-Computable Neural Networks: Efficient Inference for
  Dynamic Inputs
Incrementally-Computable Neural Networks: Efficient Inference for Dynamic Inputs
Or Sharir
Anima Anandkumar
60
0
0
27 Jul 2023
Object-based Probabilistic Similarity Evidence of Sparse Latent Features
  from Fully Convolutional Networks
Object-based Probabilistic Similarity Evidence of Sparse Latent Features from Fully Convolutional Networks
Cyril Juliani
25
0
0
25 Jul 2023
Mitigating Memory Wall Effects in CNN Engines with On-the-Fly Weights
  Generation
Mitigating Memory Wall Effects in CNN Engines with On-the-Fly Weights Generation
Stylianos I. Venieris
Javier Fernandez-Marques
Nicholas D. Lane
MQ
68
3
0
25 Jul 2023
An Estimator for the Sensitivity to Perturbations of Deep Neural
  Networks
An Estimator for the Sensitivity to Perturbations of Deep Neural Networks
Naman Maheshwari
Nicholas Malaya
Scott A. Moe
J. Kulkarni
S. Gurumurthi
AAML
42
0
0
24 Jul 2023
PATROL: Privacy-Oriented Pruning for Collaborative Inference Against
  Model Inversion Attacks
PATROL: Privacy-Oriented Pruning for Collaborative Inference Against Model Inversion Attacks
Shiwei Ding
Lan Zhang
Miao Pan
Xiaoyong Yuan
AAML
87
6
0
20 Jul 2023
Communication-Efficient Split Learning via Adaptive Feature-Wise Compression
Communication-Efficient Split Learning via Adaptive Feature-Wise Compression
Yong-Nam Oh
Jaeho Lee
Christopher G. Brinton
Yo-Seb Jeon
MQ
102
8
0
20 Jul 2023
EMQ: Evolving Training-free Proxies for Automated Mixed Precision
  Quantization
EMQ: Evolving Training-free Proxies for Automated Mixed Precision Quantization
Peijie Dong
Lujun Li
Zimian Wei
Xin-Yi Niu
Zhiliang Tian
H. Pan
MQ
84
31
0
20 Jul 2023
Approximate Computing Survey, Part II: Application-Specific & Architectural Approximation Techniques and Applications
Approximate Computing Survey, Part II: Application-Specific & Architectural Approximation Techniques and Applications
Vasileios Leon
Muhammad Abdullah Hanif
Giorgos Armeniakos
Xun Jiao
Mohamed Bennai
K. Pekmestzi
Dimitrios Soudris
108
3
0
20 Jul 2023
TinyTrain: Resource-Aware Task-Adaptive Sparse Training of DNNs at the
  Data-Scarce Edge
TinyTrain: Resource-Aware Task-Adaptive Sparse Training of DNNs at the Data-Scarce Edge
Young D. Kwon
Rui Li
Stylianos I. Venieris
Jagmohan Chauhan
Nicholas D. Lane
Cecilia Mascolo
72
9
0
19 Jul 2023
Light-Weight Vision Transformer with Parallel Local and Global
  Self-Attention
Light-Weight Vision Transformer with Parallel Local and Global Self-Attention
Nikolas Ebert
Laurenz Reichardt
D. Stricker
Oliver Wasenmüller
ViT
73
2
0
18 Jul 2023
Neural Network Pruning as Spectrum Preserving Process
Neural Network Pruning as Spectrum Preserving Process
S. Yao
Dantong Yu
I. Koutis
CVBM
32
1
0
18 Jul 2023
UPSCALE: Unconstrained Channel Pruning
UPSCALE: Unconstrained Channel Pruning
Alvin Wan
Hanxiang Hao
K. Patnaik
Yueyang Xu
Omer Hadad
David Guera
Zhile Ren
Qi Shan
80
4
0
17 Jul 2023
Revisiting Implicit Models: Sparsity Trade-offs Capability in
  Weight-tied Model for Vision Tasks
Revisiting Implicit Models: Sparsity Trade-offs Capability in Weight-tied Model for Vision Tasks
Haobo Song
Soumajit Majumder
Tao R. Lin
VLM
93
0
0
16 Jul 2023
A Survey of Techniques for Optimizing Transformer Inference
A Survey of Techniques for Optimizing Transformer Inference
Krishna Teja Chitty-Venkata
Sparsh Mittal
M. Emani
V. Vishwanath
Arun Somani
131
75
0
16 Jul 2023
TinyTracker: Ultra-Fast and Ultra-Low-Power Edge Vision In-Sensor for
  Gaze Estimation
TinyTracker: Ultra-Fast and Ultra-Low-Power Edge Vision In-Sensor for Gaze Estimation
Pietro Bonazzi
Thomas Rüegg
Sizhen Bian
Yawei Li
Michele Magno
91
12
0
15 Jul 2023
Learning Sparse Neural Networks with Identity Layers
Learning Sparse Neural Networks with Identity Layers
Mingjian Ni
Guangyao Chen
Xiawu Zheng
Peixi Peng
Liuliang Yuan
Yonghong Tian
68
0
0
14 Jul 2023
Flexible and Fully Quantized Ultra-Lightweight TinyissimoYOLO for
  Ultra-Low-Power Edge Systems
Flexible and Fully Quantized Ultra-Lightweight TinyissimoYOLO for Ultra-Low-Power Edge Systems
Julian Moosmann
H. Mueller
Nicky Zimmerman
Georg Rutishauser
Luca Benini
Michele Magno
72
9
0
12 Jul 2023
Search-time Efficient Device Constraints-Aware Neural Architecture
  Search
Search-time Efficient Device Constraints-Aware Neural Architecture Search
Oshin Dutta
Tanu Kanvar
Sumeet Agarwal
64
3
0
10 Jul 2023
Rosko: Row Skipping Outer Products for Sparse Matrix Multiplication
  Kernels
Rosko: Row Skipping Outer Products for Sparse Matrix Multiplication Kernels
Vikas Natesh
Andrew Sabot
H. T. Kung
Mark Ting
63
0
0
08 Jul 2023
Previous
123...121314...686970
Next