Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
v1
v2
v3
v4
v5 (latest)
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,481 papers shown
Title
Generative Model for Models: Rapid DNN Customization for Diverse Tasks and Resource Constraints
Wenxing Xu
Yuanchun Li
Jiacheng Liu
Yiyou Sun
Zhengyang Cao
Yixuan Li
Hao Wen
Yunxin Liu
96
1
0
29 Aug 2023
Uncovering the Hidden Cost of Model Compression
Diganta Misra
Muawiz Chaudhary
Agam Goyal
Bharat Runwal
Pin-Yu Chen
VLM
97
0
0
29 Aug 2023
Low-bit Quantization for Deep Graph Neural Networks with Smoothness-aware Message Propagation
Shuang Wang
B. Eravcı
Rustam Guliyev
Hakan Ferhatosmanoglu
GNN
MQ
70
9
0
29 Aug 2023
Maestro: Uncovering Low-Rank Structures via Trainable Decomposition
Samuel Horváth
Stefanos Laskaridis
Shashank Rajput
Hongyi Wang
BDL
90
4
0
28 Aug 2023
Computation-efficient Deep Learning for Computer Vision: A Survey
Yulin Wang
Yizeng Han
Chaofei Wang
Shiji Song
Qi Tian
Gao Huang
VLM
141
21
0
27 Aug 2023
Homological Convolutional Neural Networks
Antonio Briola
Yuanrong Wang
Silvia Bartolucci
T. Aste
LMTD
87
7
0
26 Aug 2023
MST-compression: Compressing and Accelerating Binary Neural Networks with Minimum Spanning Tree
Quang Hieu Vo
Linh-Tam Tran
Sung-Ho Bae
Lokwon Kim
Choong Seon Hong
MQ
94
1
0
26 Aug 2023
REFT: Resource-Efficient Federated Training Framework for Heterogeneous and Resource-Constrained Environments
Humaid Ahmed Desai
Amr B. Hilal
Hoda Eldardiry
67
0
0
25 Aug 2023
Federated Learning in IoT: a Survey from a Resource-Constrained Perspective
Ishmeet Kaur
87
3
0
25 Aug 2023
Data-Side Efficiencies for Lightweight Convolutional Neural Networks
Bryan Bo Cao
Lawrence O'Gorman
Michael J. Coss
Shubham Jain
62
2
0
24 Aug 2023
Multi-stage feature decorrelation constraints for improving CNN classification performance
Qiuyu Zhu
Hao Wang
Xuewen Zu
Chengfei Liu
42
0
0
24 Aug 2023
Enhancing Energy-Awareness in Deep Learning through Fine-Grained Energy Measurement
S. Rajput
Tim Widmayer
Ziyuan Shang
M. Kechagia
Federica Sarro
Tushar Sharma
106
4
0
23 Aug 2023
Sampling From Autoencoders' Latent Space via Quantization And Probability Mass Function Concepts
Aymene Mohammed Bouayed
Adrian Iaccovelli
D. Naccache
57
0
0
21 Aug 2023
Efficient Joint Optimization of Layer-Adaptive Weight Pruning in Deep Neural Networks
Kaixin Xu
Zhe Wang
Xue Geng
Jie Lin
Min-man Wu
Xiaoli Li
Weisi Lin
77
15
0
21 Aug 2023
Benchmarking Adversarial Robustness of Compressed Deep Learning Models
Brijesh Vora
Kartik Patwari
Syed Mahbub Hafiz
Zubair Shafiq
Chen-Nee Chuah
AAML
79
2
0
16 Aug 2023
A Survey on Model Compression for Large Language Models
Xunyu Zhu
Jian Li
Yong Liu
Can Ma
Weiping Wang
143
233
0
15 Aug 2023
Ada-QPacknet -- adaptive pruning with bit width reduction as an efficient continual learning method without forgetting
Marcin Pietroñ
Dominik Zurek
Kamil Faber
Roberto Corizzo
CLL
73
2
0
14 Aug 2023
Estimator Meets Equilibrium Perspective: A Rectified Straight Through Estimator for Binary Neural Networks Training
Xiao-Ming Wu
Dian Zheng
Zuhao Liu
Weishi Zheng
MQ
118
18
0
13 Aug 2023
Sensitivity-Aware Mixed-Precision Quantization and Width Optimization of Deep Neural Networks Through Cluster-Based Tree-Structured Parzen Estimation
Seyedarmin Azizi
M. Nazemi
A. Fayyazi
Massoud Pedram
MQ
64
5
0
12 Aug 2023
SSL-Auth: An Authentication Framework by Fragile Watermarking for Pre-trained Encoders in Self-supervised Learning
Xiaobei Li
Changchun Yin
Liyue Zhu
Xiaogang Xu
Liming Fang
Run Wang
Chenhao Lin
AAML
86
1
0
09 Aug 2023
Resource Constrained Model Compression via Minimax Optimization for Spiking Neural Networks
Jue Chen
Huan Yuan
Jianchao Tan
Bin Chen
Chengru Song
Di Zhang
71
4
0
09 Aug 2023
Lossy and Lossless (L
2
^2
2
) Post-training Model Size Compression
Yumeng Shi
Shihao Bai
Xiuying Wei
Ruihao Gong
Jianlei Yang
53
3
0
08 Aug 2023
D-Score: A Synapse-Inspired Approach for Filter Pruning
Doyoung Park
Jinsoo Kim
Ji-Min Nam
Jooyoung Chang
S. Park
62
0
0
08 Aug 2023
Pruning a neural network using Bayesian inference
Sunil Mathew
D. Rowe
34
0
0
04 Aug 2023
Survey on Computer Vision Techniques for Internet-of-Things Devices
Ishmeet Kaur
Adwaita Janardhan Jadhav
AI4CE
60
1
0
02 Aug 2023
An Introduction to Bi-level Optimization: Foundations and Applications in Signal Processing and Machine Learning
Yihua Zhang
Prashant Khanduri
Ioannis C. Tsaknakis
Yuguang Yao
Min-Fong Hong
Sijia Liu
AI4CE
129
31
0
01 Aug 2023
Evaluating Spiking Neural Network On Neuromorphic Platform For Human Activity Recognition
Sizhen Bian
Michele Magno
55
6
0
01 Aug 2023
Improving Generalization of Adversarial Training via Robust Critical Fine-Tuning
Kaijie Zhu
Jindong Wang
Xixu Hu
Xingxu Xie
G. Yang
AAML
76
25
0
01 Aug 2023
Revisiting the Parameter Efficiency of Adapters from the Perspective of Precision Redundancy
Shibo Jie
Haoqing Wang
Zhiwei Deng
76
34
0
31 Jul 2023
Alpha-GPT: Human-AI Interactive Alpha Mining for Quantitative Investment
Saizhuo Wang
Hang Yuan
Leon Zhou
L. Ni
H. Shum
Jian Guo
67
25
0
31 Jul 2023
Stable Adam Optimization for 16-bit Neural Networks Training
Juyoung Yun
25
1
0
30 Jul 2023
Incrementally-Computable Neural Networks: Efficient Inference for Dynamic Inputs
Or Sharir
Anima Anandkumar
60
0
0
27 Jul 2023
Object-based Probabilistic Similarity Evidence of Sparse Latent Features from Fully Convolutional Networks
Cyril Juliani
25
0
0
25 Jul 2023
Mitigating Memory Wall Effects in CNN Engines with On-the-Fly Weights Generation
Stylianos I. Venieris
Javier Fernandez-Marques
Nicholas D. Lane
MQ
68
3
0
25 Jul 2023
An Estimator for the Sensitivity to Perturbations of Deep Neural Networks
Naman Maheshwari
Nicholas Malaya
Scott A. Moe
J. Kulkarni
S. Gurumurthi
AAML
42
0
0
24 Jul 2023
PATROL: Privacy-Oriented Pruning for Collaborative Inference Against Model Inversion Attacks
Shiwei Ding
Lan Zhang
Miao Pan
Xiaoyong Yuan
AAML
87
6
0
20 Jul 2023
Communication-Efficient Split Learning via Adaptive Feature-Wise Compression
Yong-Nam Oh
Jaeho Lee
Christopher G. Brinton
Yo-Seb Jeon
MQ
102
8
0
20 Jul 2023
EMQ: Evolving Training-free Proxies for Automated Mixed Precision Quantization
Peijie Dong
Lujun Li
Zimian Wei
Xin-Yi Niu
Zhiliang Tian
H. Pan
MQ
84
31
0
20 Jul 2023
Approximate Computing Survey, Part II: Application-Specific & Architectural Approximation Techniques and Applications
Vasileios Leon
Muhammad Abdullah Hanif
Giorgos Armeniakos
Xun Jiao
Mohamed Bennai
K. Pekmestzi
Dimitrios Soudris
108
3
0
20 Jul 2023
TinyTrain: Resource-Aware Task-Adaptive Sparse Training of DNNs at the Data-Scarce Edge
Young D. Kwon
Rui Li
Stylianos I. Venieris
Jagmohan Chauhan
Nicholas D. Lane
Cecilia Mascolo
72
9
0
19 Jul 2023
Light-Weight Vision Transformer with Parallel Local and Global Self-Attention
Nikolas Ebert
Laurenz Reichardt
D. Stricker
Oliver Wasenmüller
ViT
73
2
0
18 Jul 2023
Neural Network Pruning as Spectrum Preserving Process
S. Yao
Dantong Yu
I. Koutis
CVBM
32
1
0
18 Jul 2023
UPSCALE: Unconstrained Channel Pruning
Alvin Wan
Hanxiang Hao
K. Patnaik
Yueyang Xu
Omer Hadad
David Guera
Zhile Ren
Qi Shan
80
4
0
17 Jul 2023
Revisiting Implicit Models: Sparsity Trade-offs Capability in Weight-tied Model for Vision Tasks
Haobo Song
Soumajit Majumder
Tao R. Lin
VLM
93
0
0
16 Jul 2023
A Survey of Techniques for Optimizing Transformer Inference
Krishna Teja Chitty-Venkata
Sparsh Mittal
M. Emani
V. Vishwanath
Arun Somani
131
75
0
16 Jul 2023
TinyTracker: Ultra-Fast and Ultra-Low-Power Edge Vision In-Sensor for Gaze Estimation
Pietro Bonazzi
Thomas Rüegg
Sizhen Bian
Yawei Li
Michele Magno
91
12
0
15 Jul 2023
Learning Sparse Neural Networks with Identity Layers
Mingjian Ni
Guangyao Chen
Xiawu Zheng
Peixi Peng
Liuliang Yuan
Yonghong Tian
68
0
0
14 Jul 2023
Flexible and Fully Quantized Ultra-Lightweight TinyissimoYOLO for Ultra-Low-Power Edge Systems
Julian Moosmann
H. Mueller
Nicky Zimmerman
Georg Rutishauser
Luca Benini
Michele Magno
72
9
0
12 Jul 2023
Search-time Efficient Device Constraints-Aware Neural Architecture Search
Oshin Dutta
Tanu Kanvar
Sumeet Agarwal
64
3
0
10 Jul 2023
Rosko: Row Skipping Outer Products for Sparse Matrix Multiplication Kernels
Vikas Natesh
Andrew Sabot
H. T. Kung
Mark Ting
63
0
0
08 Jul 2023
Previous
1
2
3
...
12
13
14
...
68
69
70
Next