Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
v1
v2
v3
v4
v5 (latest)
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,481 papers shown
Title
Learning Accurate Performance Predictors for Ultrafast Automated Model Compression
Ziwei Wang
Jiwen Lu
Han Xiao
Shengyu Liu
Jie Zhou
OffRL
71
1
0
13 Apr 2023
Boosting Convolutional Neural Networks with Middle Spectrum Grouped Convolution
Z. Su
Jiehua Zhang
Tianpeng Liu
Zhen Liu
Shuanghui Zhang
M. Pietikäinen
Li Liu
70
2
0
13 Apr 2023
EcoFed: Efficient Communication for DNN Partitioning-based Federated Learning
Di Wu
R. Ullah
Philip Rodgers
Peter Kilpatrick
I. Spence
Blesson Varghese
FedML
119
1
0
11 Apr 2023
Scale-Space Hypernetworks for Efficient Biomedical Imaging
Jose Javier Gonzalez Ortiz
John Guttag
Adrian Dalca
66
0
0
11 Apr 2023
SamurAI: A Versatile IoT Node With Event-Driven Wake-Up and Embedded ML Acceleration
I. Miro-Panadès
Benoît Tain
J. Christmann
David Coriat
R. Lemaire
...
Jean-Marc Philippe
Y. Thonnart
A. Valentian
Frédéric Heitzmann
F. Clermidy
51
15
0
11 Apr 2023
Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference
Tao Lei
Junwen Bai
Siddhartha Brahma
Joshua Ainslie
Kenton Lee
...
Vincent Zhao
Yuexin Wu
Yue Liu
Yu Zhang
Ming-Wei Chang
BDL
AI4CE
115
64
0
11 Apr 2023
Model Sparsity Can Simplify Machine Unlearning
Jinghan Jia
Jiancheng Liu
Parikshit Ram
Yuguang Yao
Gaowen Liu
Yang Liu
Pranay Sharma
Sijia Liu
MU
206
130
0
11 Apr 2023
Graph Enabled Cross-Domain Knowledge Transfer
S. Yao
41
0
0
07 Apr 2023
Tensor Slicing and Optimization for Multicore NPUs
R. Sousa
M. Pereira
Yongin Kwon
Taeho Kim
Namsoon Jung
Chang Soo Kim
Michael Frank
Guido Araujo
91
6
0
06 Apr 2023
Learning to Learn with Indispensable Connections
Sambhavi Tiwari
Manas Gogoi
Shekhar Verma
Krishna Pratap Singh
CLL
85
1
0
06 Apr 2023
HNeRV: A Hybrid Neural Representation for Videos
Hao Chen
M. Gwilliam
Ser-Nam Lim
Abhinav Shrivastava
73
77
1
05 Apr 2023
Efficient human-in-loop deep learning model training with iterative refinement and statistical result validation
Manuel Zahn
Douglas P. Perrin
16
1
0
03 Apr 2023
Optimizing data-flow in Binary Neural Networks
Lorenzo Vorabbi
Davide Maltoni
Stefano Santi
MQ
83
6
0
03 Apr 2023
SEENN: Towards Temporal Spiking Early-Exit Neural Networks
Yuhang Li
Tamar Geller
Youngeun Kim
Priyadarshini Panda
109
41
0
02 Apr 2023
A Generative Framework for Low-Cost Result Validation of Machine Learning-as-a-Service Inference
Abhinav Kumar
Miguel A. Guirao Aguilera
R. Tourani
Satyajayant Misra
AAML
98
0
0
31 Mar 2023
BOLT: An Automated Deep Learning Framework for Training and Deploying Large-Scale Search and Recommendation Models on Commodity CPU Hardware
Nicholas Meisburger
V. Lakshman
Benito Geordie
Joshua Engels
David Torres Ramos
...
Benjamin Meisburger
Shubh Gupta
Yashwanth Adunukota
Tharun Medini
Anshumali Shrivastava
104
2
0
30 Mar 2023
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer
Xuanyao Chen
Zhijian Liu
Haotian Tang
Li Yi
Hang Zhao
Song Han
ViT
217
48
0
30 Mar 2023
Distributed Neural Representation for Reactive in situ Visualization
Qi Wu
J. Insley
V. Mateevitsi
S. Rizzi
M. Papka
Kwan-Liu Ma
80
2
0
28 Mar 2023
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications
Abdelrahman M. Shaker
Muhammad Maaz
H. Rasheed
Salman Khan
Ming-Hsuan Yang
Fahad Shahbaz Khan
ViT
159
98
0
27 Mar 2023
Vision Models Can Be Efficiently Specialized via Few-Shot Task-Aware Compression
Denis Kuznedelev
Soroush Tabesh
Kimia Noorbakhsh
Elias Frantar
Sara Beery
Eldar Kurtic
Dan Alistarh
MQ
VLM
82
2
0
25 Mar 2023
PowerPruning: Selecting Weights and Activations for Power-Efficient Neural Network Acceleration
Richard Petri
Grace Li Zhang
Yiran Chen
Ulf Schlichtmann
Bing Li
29
6
0
24 Mar 2023
Q-HyViT: Post-Training Quantization of Hybrid Vision Transformers with Bridge Block Reconstruction for IoT Systems
Jemin Lee
Yongin Kwon
Sihyeong Park
Misun Yu
Jeman Park
Hwanjun Song
ViT
MQ
90
6
0
22 Mar 2023
Low Rank Optimization for Efficient Deep Learning: Making A Balance between Compact Architecture and Fast Training
Xinwei Ou
Zhangxin Chen
Ce Zhu
Yipeng Liu
81
5
0
22 Mar 2023
Performance-aware Approximation of Global Channel Pruning for Multitask CNNs
Hancheng Ye
Bo Zhang
Tao Chen
Jiayuan Fan
Bin Wang
74
19
0
21 Mar 2023
Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective
Yuexiao Ma
Huixia Li
Xiawu Zheng
Xuefeng Xiao
Rui Wang
Shilei Wen
Xin Pan
Yong Li
Rongrong Ji
MQ
74
12
0
21 Mar 2023
Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency
Vithursan Thangarasa
Shreyas Saxena
Abhay Gupta
Sean Lie
121
5
0
21 Mar 2023
Greedy Pruning with Group Lasso Provably Generalizes for Matrix Sensing
Nived Rajaraman
Devvrit
Aryan Mokhtari
Kannan Ramchandran
76
0
0
20 Mar 2023
ExplainFix: Explainable Spatially Fixed Deep Networks
Alex Gaudio
Christos Faloutsos
A. Smailagic
P. Costa
A. Campilho
FAtt
73
3
0
18 Mar 2023
DC-CCL: Device-Cloud Collaborative Controlled Learning for Large Vision Models
Yucheng Ding
Chaoyue Niu
Fan Wu
Shaojie Tang
Chengfei Lyu
Guihai Chen
70
6
0
18 Mar 2023
Unleashing the Potential of Spiking Neural Networks by Dynamic Confidence
Chen Li
Edward Jones
Steve Furber
79
19
0
17 Mar 2023
Iterative Soft Shrinkage Learning for Efficient Image Super-Resolution
Jiamian Wang
Huan Wang
Yulun Zhang
Yun Fu
Zhiqiang Tao
SupR
68
2
0
16 Mar 2023
A High-Performance Accelerator for Super-Resolution Processing on Embedded GPU
W. Zhao
Qi Sun
Yang Bai
Wenbo Li
Haisheng Zheng
Bei Yu
Martin D. F. Wong
SupR
74
8
0
16 Mar 2023
Gated Compression Layers for Efficient Always-On Models
Haiguang Li
T. Thormundsson
I. Poupyrev
N. Gillian
88
2
0
15 Mar 2023
R2 Loss: Range Restriction Loss for Model Compression and Quantization
Arnav Kundu
Chungkuk Yoo
Srijan Mishra
Minsik Cho
Saurabh N. Adya
MQ
69
1
0
14 Mar 2023
MetaMixer: A Regularization Strategy for Online Knowledge Distillation
Maorong Wang
L. Xiao
T. Yamasaki
KELM
MoE
46
1
0
14 Mar 2023
FPUS23: An Ultrasound Fetus Phantom Dataset with Deep Neural Network Evaluations for Fetus Orientations, Fetal Planes, and Anatomical Features
B. Prabakaran
Paul Hamelmann
Erik Ostrowski
Mohamed Bennai
62
12
0
14 Mar 2023
Automatic Attention Pruning: Improving and Automating Model Pruning using Attentions
Kaiqi Zhao
Animesh Jain
Ming Zhao
69
11
0
14 Mar 2023
AdaptiveNet: Post-deployment Neural Architecture Adaptation for Diverse Edge Environments
Hao Wen
Yuanchun Li
Zunshuai Zhang
Shiqi Jiang
Xiaozhou Ye
Ouyang Ye
Yaqin Zhang
Yunxin Liu
143
34
0
13 Mar 2023
Three Guidelines You Should Know for Universally Slimmable Self-Supervised Learning
Yunhao Cao
Peiqin Sun
Shuchang Zhou
51
4
0
13 Mar 2023
OTOV2: Automatic, Generic, User-Friendly
Tianyi Chen
Luming Liang
Tian Ding
Zhihui Zhu
Ilya Zharkov
VLM
MQ
113
36
0
13 Mar 2023
Complement Sparsification: Low-Overhead Model Pruning for Federated Learning
Xiaopeng Jiang
Cristian Borcea
FedML
85
17
0
10 Mar 2023
Sparse and Local Networks for Hypergraph Reasoning
Guangxuan Xiao
L. Kaelbling
Jiajun Wu
Jiayuan Mao
NAI
ReLM
LRM
94
1
0
09 Mar 2023
A Privacy Preserving System for Movie Recommendations Using Federated Learning
David Neumann
Andreas Lutz
Karsten Müller
Wojciech Samek
59
12
0
07 Mar 2023
An Edge-based WiFi Fingerprinting Indoor Localization Using Convolutional Neural Network and Convolutional Auto-Encoder
Amin Kargar-Barzi
Ebrahim Farahmand
Nooshin Taheri Chatrudi
A. Mahani
M. Shafique
57
8
0
07 Mar 2023
Training-Free Acceleration of ViTs with Delayed Spatial Merging
J. Heo
Seyedarmin Azizi
A. Fayyazi
Massoud Pedram
126
3
0
04 Mar 2023
Adversarial Attacks on Machine Learning in Embedded and IoT Platforms
Christian Westbrook
S. Pasricha
AAML
74
3
0
03 Mar 2023
Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!
Shiwei Liu
Tianlong Chen
Zhenyu Zhang
Xuxi Chen
Tianjin Huang
Ajay Jaiswal
Zhangyang Wang
87
28
0
03 Mar 2023
Rotation Invariant Quantization for Model Compression
Dor-Joseph Kampeas
Yury Nahshan
Hanoch Kremer
Gil Lederman
Shira Zaloshinski
Zheng Li
E. Haleva
MQ
122
1
0
03 Mar 2023
TopSpark: A Timestep Optimization Methodology for Energy-Efficient Spiking Neural Networks on Autonomous Mobile Agents
Rachmad Vidya Wicaksana Putra
Mohamed Bennai
80
13
0
03 Mar 2023
Distilling Multi-Level X-vector Knowledge for Small-footprint Speaker Verification
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
139
4
0
02 Mar 2023
Previous
1
2
3
...
15
16
17
...
68
69
70
Next