Papers
Communities
Organizations
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
v1
v2
v3
v4
v5 (latest)
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,481 papers shown
Title
PointDistiller: Structured Knowledge Distillation Towards Efficient and Compact 3D Detection
Linfeng Zhang
Runpei Dong
Hung-Shuo Tai
Kaisheng Ma
3DPC
154
50
0
23 May 2022
Parameter-Efficient Sparsity for Large Language Models Fine-Tuning
Yuchao Li
Fuli Luo
Chuanqi Tan
Mengdi Wang
Songfang Huang
Shen Li
Junjie Bai
MQ
121
34
0
23 May 2022
QADAM: Quantization-Aware DNN Accelerator Modeling for Pareto-Optimality
A. Inci
Siri Garudanagiri Virupaksha
Aman Jain
Venkata Vivek Thallam
Ruizhou Ding
Diana Marculescu
MQ
60
2
0
20 May 2022
Energy-efficient Deployment of Deep Learning Applications on Cortex-M based Microcontrollers using Deep Compression
M. Deutel
Philipp Woller
Christopher Mutschler
Jürgen Teich
119
4
0
20 May 2022
Service Delay Minimization for Federated Learning over Mobile Devices
Rui Chen
Dian Shi
Xiaoqi Qin
Dongjie Liu
Miao Pan
Shuguang Cui
FedML
90
34
0
19 May 2022
QAPPA: Quantization-Aware Power, Performance, and Area Modeling of DNN Accelerators
A. Inci
Siri Garudanagiri Virupaksha
Aman Jain
Venkata Vivek Thallam
Ruizhou Ding
Diana Marculescu
MQ
54
5
0
17 May 2022
Effect of Batch Normalization on Noise Resistant Property of Deep Learning Models
Omobayode Fagbohungbe
Lijun Qian
75
10
0
15 May 2022
A Comprehensive Survey on Model Quantization for Deep Neural Networks in Image Classification
Babak Rokh
A. Azarpeyvand
Alireza Khanteymoori
MQ
135
103
0
14 May 2022
ImageSig: A signature transform for ultra-lightweight image recognition
Mohamed Ramzy Ibrahim
Terry Lyons
VLM
143
7
0
13 May 2022
Knowledge Distillation Meets Open-Set Semi-Supervised Learning
Jing Yang
Xiatian Zhu
Adrian Bulat
Brais Martínez
Georgios Tzimiropoulos
92
10
0
13 May 2022
Fast Conditional Network Compression Using Bayesian HyperNetworks
Phuoc Nguyen
T. Tran
Ky Le
Sunil R. Gupta
Santu Rana
Dang Nguyen
Trong Nguyen
S. Ryan
Svetha Venkatesh
BDL
54
7
0
13 May 2022
Blueprint Separable Residual Network for Efficient Image Super-Resolution
Zheyu Li
Yingqi Liu
Xiangyu Chen
Haoming Cai
Jinjin Gu
Yu Qiao
Chao Dong
98
141
0
12 May 2022
Target Aware Network Architecture Search and Compression for Efficient Knowledge Transfer
S. H. Shabbeer Basha
Debapriya Tula
Sravan Kumar Vinakota
S. Dubey
64
3
0
12 May 2022
Sparseloop: An Analytical Approach To Sparse Tensor Accelerator Modeling
Yannan Nellie Wu
Po-An Tsai
A. Parashar
Vivienne Sze
J. Emer
77
62
0
12 May 2022
Tiny Robot Learning: Challenges and Directions for Machine Learning in Resource-Constrained Robots
Sabrina M. Neuman
Brian Plancher
Bardienus P. Duisterhof
Srivatsan Krishnan
Colby R. Banbury
...
Shvetank Prakash
Jason J. Jabbour
Aleksandra Faust
Guido de Croon
Vijay Janapa Reddi
96
39
0
11 May 2022
Revisiting Random Channel Pruning for Neural Network Compression
Yawei Li
Kamil Adamczewski
Wen Li
Shuhang Gu
Radu Timofte
Luc Van Gool
122
86
0
11 May 2022
NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results
Yawei Li
Peng Sun
Radu Timofte
Luc Van Gool
Fang Kong
...
Deng-Guang Zhou
Kun Zeng
Han-Yuan Lin
Xinyu Chen
Jin-Tao Fang
SupR
83
78
0
11 May 2022
Task-specific Compression for Multi-task Language Models using Attribution-based Pruning
Nakyeong Yang
Yunah Jang
Hwanhee Lee
Seohyeong Jung
Kyomin Jung
40
9
0
09 May 2022
A Survey on AI Sustainability: Emerging Trends on Learning Algorithms and Research Challenges
Zhenghua Chen
Min-man Wu
Alvin Chan
Xiaoli Li
Yew-Soon Ong
71
7
0
08 May 2022
Impact of Learning Rate on Noise Resistant Property of Deep Learning Models
Omobayode Fagbohungbe
Lijun Qian
60
3
0
08 May 2022
Impact of L1 Batch Normalization on Analog Noise Resistant Property of Deep Learning Models
Omobayode Fagbohungbe
Lijun Qian
62
0
0
07 May 2022
Automatic Block-wise Pruning with Auxiliary Gating Structures for Deep Convolutional Neural Networks
Zhaofeng Si
H. Qi
Xiaoyu Song
CVBM
87
0
0
07 May 2022
Online Model Compression for Federated Learning with Large Models
Tien-Ju Yang
Yonghui Xiao
Giovanni Motta
F. Beaufays
Rajiv Mathews
Mingqing Chen
FedML
MQ
90
8
0
06 May 2022
EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers
Junting Pan
Adrian Bulat
Fuwen Tan
Xiatian Zhu
Łukasz Dudziak
Hongsheng Li
Georgios Tzimiropoulos
Brais Martínez
ViT
104
198
0
06 May 2022
Green Accelerated Hoeffding Tree
E. García-Martín
Albert Bifet
Niklas Lavesson
Rikard König
Henrik Linusson
47
7
0
06 May 2022
Machine Learning Operations (MLOps): Overview, Definition, and Architecture
Dominik Kreuzberger
Niklas Kühl
Sebastian Hirschl
VLM
AI4CE
108
361
0
04 May 2022
Compact Neural Networks via Stacking Designed Basic Units
Weichao Lan
Y. Cheung
Juyong Jiang
60
0
0
03 May 2022
Lite Pose: Efficient Architecture Design for 2D Human Pose Estimation
Yihan Wang
Zhekai Zhang
Han Cai
Wei-Ming Chen
Song Han
3DH
123
75
0
03 May 2022
Jack and Masters of all Trades: One-Pass Learning Sets of Model Sets From Large Pre-Trained Models
Han Xiang Choong
Yew-Soon Ong
Abhishek Gupta
Caishun Chen
Ray Lim
75
5
0
02 May 2022
Schrödinger's FP: Dynamic Adaptation of Floating-Point Containers for Deep Learning Training
Milovs Nikolić
Enrique Torres Sanchez
Jia-Hui Wang
Ali Hadi Zadeh
Mostafa Mahmoud
Ameer Abdelhadi
Kareem Ibrahim
Andreas Moshovos
MQ
74
1
0
28 Apr 2022
Rate-Constrained Remote Contextual Bandits
Francesco Pase
Deniz Gündüz
M. Zorzi
67
8
0
26 Apr 2022
Federated Progressive Sparsification (Purge, Merge, Tune)+
Dimitris Stripelis
Umang Gupta
Greg Ver Steeg
J. Ambite
FedML
66
11
0
26 Apr 2022
Attentive Fine-Grained Structured Sparsity for Image Restoration
Junghun Oh
Heewon Kim
Seungjun Nah
Chee Hong
Jonghyun Choi
Kyoung Mu Lee
136
20
0
26 Apr 2022
PVNAS: 3D Neural Architecture Search with Point-Voxel Convolution
Zhijian Liu
Haotian Tang
Shengyu Zhao
Kevin Shao
Song Han
3DPC
72
40
0
25 Apr 2022
Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Han Cai
Ji Lin
Chengyue Wu
Zhijian Liu
Haotian Tang
Hanrui Wang
Ligeng Zhu
Song Han
116
117
0
25 Apr 2022
A Tale of Two Models: Constructing Evasive Attacks on Edge Models
Wei Hao
Aahil Awatramani
Jia-Bin Hu
Chengzhi Mao
Pin-Chun Chen
Eyal Cidon
Asaf Cidon
Junfeng Yang
AAML
95
4
0
22 Apr 2022
Multiply-and-Fire (MNF): An Event-driven Sparse Neural Network Accelerator
Miao Yu
Tingting Xiang
Venkata Pavan Kumar Miriyala
Trevor E. Carlson
46
1
0
20 Apr 2022
End-to-End Sensitivity-Based Filter Pruning
Z. Babaiee
Lucas Liebenwein
Ramin Hasani
Daniela Rus
Radu Grosu
AAML
66
1
0
15 Apr 2022
Joint Coreset Construction and Quantization for Distributed Machine Learning
Hanlin Lu
Changchang Liu
Shiqiang Wang
T. He
Vijay Narayanan
Kevin S. Chan
Stephen Pasteris
39
2
0
13 Apr 2022
DL4SciVis: A State-of-the-Art Survey on Deep Learning for Scientific Visualization
Chaoli Wang
J. Han
96
38
0
13 Apr 2022
HuBERT-EE: Early Exiting HuBERT for Efficient Speech Recognition
J. Yoon
Beom Jun Woo
N. Kim
80
13
0
13 Apr 2022
Neural Network Pruning by Cooperative Coevolution
Haopu Shang
Jia-Liang Wu
Wenjing Hong
Chaojun Qian
VLM
63
23
0
12 Apr 2022
Compact Model Training by Low-Rank Projection with Energy Transfer
K. Guo
Zhenquan Lin
Xiaofen Xing
Fang Liu
Xiangmin Xu
83
2
0
12 Apr 2022
Adaptive Differential Filters for Fast and Communication-Efficient Federated Learning
Daniel Becking
H. Kirchhoffer
G. Tech
Paul Haase
Karsten Müller
H. Schwarz
Wojciech Samek
FedML
64
4
0
09 Apr 2022
LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification
Sharath Girish
Kamal Gupta
Saurabh Singh
Abhinav Shrivastava
100
11
0
06 Apr 2022
Masking Adversarial Damage: Finding Adversarial Saliency for Robust and Sparse Network
Byung-Kwan Lee
Junho Kim
Y. Ro
AAML
59
20
0
06 Apr 2022
A Survey on Dropout Methods and Experimental Verification in Recommendation
Yongqian Li
Weizhi Ma
C. L. Philip Chen
Hao Fei
Yiqun Liu
Shaoping Ma
Yue Yang
92
11
0
05 Apr 2022
Aligned Weight Regularizers for Pruning Pretrained Neural Networks
J. Ó. Neill
Sourav Dutta
H. Assem
VLM
68
2
0
04 Apr 2022
Soft Threshold Ternary Networks
Weixiang Xu
Xiangyu He
Tianli Zhao
Qinghao Hu
Peisong Wang
Jian Cheng
MQ
74
7
0
04 Apr 2022
Monarch: Expressive Structured Matrices for Efficient and Accurate Training
Tri Dao
Beidi Chen
N. Sohoni
Arjun D Desai
Michael Poli
Jessica Grogan
Alexander Liu
Aniruddh Rao
Atri Rudra
Christopher Ré
143
97
0
01 Apr 2022
Previous
1
2
3
...
22
23
24
...
68
69
70
Next