Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
v1
v2
v3
v4
v5 (latest)
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,481 papers shown
Title
RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models
David Qiu
David Rim
Shaojin Ding
Oleg Rybakov
Yanzhang He
MQ
77
4
0
24 May 2023
PruMUX: Augmenting Data Multiplexing with Model Compression
Yushan Su
Vishvak Murahari
Karthik Narasimhan
Keqin Li
70
3
0
24 May 2023
Combining Multi-Objective Bayesian Optimization with Reinforcement Learning for TinyML
M. Deutel
G. Kontes
Christopher Mutschler
Jürgen Teich
214
0
0
23 May 2023
Layer-adaptive Structured Pruning Guided by Latency
Siyuan Pan
Linna Zhang
Jie Zhang
Xiaoshuang Li
Liang Hou
Xiaobing Tu
66
0
0
23 May 2023
Revisiting Data Augmentation in Model Compression: An Empirical and Comprehensive Study
Muzhou Yu
Linfeng Zhang
Kaisheng Ma
68
2
0
22 May 2023
TinyissimoYOLO: A Quantized, Low-Memory Footprint, TinyML Object Detection Network for Low Power Microcontrollers
Julian Moosmann
Marco Giordano
Christian Vogt
Michele Magno
MQ
ObjD
60
20
0
22 May 2023
HighLight: Efficient and Flexible DNN Acceleration with Hierarchical Structured Sparsity
Yannan Nellie Wu
Po-An Tsai
Saurav Muralidharan
A. Parashar
Vivienne Sze
J. Emer
71
27
0
22 May 2023
Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models
Yijia Zhang
Lingran Zhao
Shijie Cao
Wenqiang Wang
Ting Cao
Fan Yang
Mao Yang
Shanghang Zhang
Ningyi Xu
MQ
66
22
0
21 May 2023
Self-Distillation with Meta Learning for Knowledge Graph Completion
Yunshui Li
Junhao Liu
Chengming Li
Min Yang
81
5
0
20 May 2023
Efficient Prompting via Dynamic In-Context Learning
Wangchunshu Zhou
Yuchen Eleanor Jiang
Ryan Cotterell
Mrinmaya Sachan
70
19
0
18 May 2023
PDP: Parameter-free Differentiable Pruning is All You Need
Minsik Cho
Saurabh N. Adya
Devang Naik
VLM
67
12
0
18 May 2023
FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention
Guangxuan Xiao
Tianwei Yin
William T. Freeman
F. Durand
Song Han
VGen
DiffM
152
254
0
17 May 2023
Analyzing Compression Techniques for Computer Vision
Maniratnam Mandal
Imran Khan
82
1
0
14 May 2023
TIPS: Topologically Important Path Sampling for Anytime Neural Networks
Guihong Li
Kartikeya Bhardwaj
Yuedong Yang
R. Marculescu
AAML
115
0
0
13 May 2023
Efficient Asynchronize Stochastic Gradient Algorithm with Structured Data
Zhao Song
Mingquan Ye
79
4
0
13 May 2023
Accelerator-Aware Training for Transducer-Based Speech Recognition
Suhaila M. Shakiah
Rupak Vignesh Swaminathan
Hieu Duy Nguyen
Raviteja Chinta
Tariq Afzal
Nathan Susanj
Athanasios Mouchtaris
Grant P. Strimel
Ariya Rastrow
56
1
0
12 May 2023
Divide-and-Conquer the NAS puzzle in Resource Constrained Federated Learning Systems
Yeshwanth Venkatesha
Youngeun Kim
Hyoungseob Park
Priyadarshini Panda
FedML
45
4
0
11 May 2023
Post-training Model Quantization Using GANs for Synthetic Data Generation
Athanasios Masouris
Mansi Sharma
Adrian Boguszewski
Alexander Kozlov
Zhuo Wu
Raymond Lo
MQ
60
0
0
10 May 2023
VEDLIoT -- Next generation accelerated AIoT systems and applications
Kevin Mika
R. Griessl
N. Kucza
F. Porrmann
M. Kaiser
...
Mario Porrmann
Hans-Martin Heyn
E. Knauss
Yufei Mao
Franz Meierhofer
68
3
0
09 May 2023
DietCNN: Multiplication-free Inference for Quantized CNNs
Swarnava Dey
P. Dasgupta
P. Chakrabarti
MQ
116
1
0
09 May 2023
FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance
Lingjiao Chen
Matei A. Zaharia
James Zou
LLMAG
189
251
0
09 May 2023
CrAFT: Compression-Aware Fine-Tuning for Efficient Visual Task Adaptation
J. Heo
S. Azizi
A. Fayyazi
Massoud Pedram
57
0
0
08 May 2023
Compressing audio CNNs with graph centrality based filter pruning
James A. King
Ashutosh Kumar Singh
Mark D. Plumbley
GNN
42
2
0
05 May 2023
CAMEL: Co-Designing AI Models and Embedded DRAMs for Efficient On-Device Learning
Sai Qian Zhang
Thierry Tambe
Nestor Cuevas
Gu-Yeon Wei
David Brooks
61
4
0
04 May 2023
Input Layer Binarization with Bit-Plane Encoding
Lorenzo Vorabbi
Davide Maltoni
Stefano Santi
MQ
62
6
0
04 May 2023
A Constrained BA Algorithm for Rate-Distortion and Distortion-Rate Functions
Lin Chen
Shitong Wu
Wen-Long Ye
Huihui Wu
Wen-Ying Zhang
Hao Wu
Bo Bai
16
6
0
04 May 2023
Cuttlefish: Low-Rank Model Training without All the Tuning
Hongyi Wang
Saurabh Agarwal
Pongsakorn U-chupala
Yoshiki Tanaka
Eric P. Xing
Dimitris Papailiopoulos
OffRL
158
23
0
04 May 2023
Dynamic Sparse Training with Structured Sparsity
Mike Lasby
A. Golubeva
Utku Evci
Mihai Nica
Yani Andrew Ioannou
186
23
0
03 May 2023
A Digital Twin Empowered Lightweight Model Sharing Scheme for Multi-Robot Systems
Kai Xiong
Zhihong Wang
S. Leng
Jianhua He
44
9
0
03 May 2023
BCEdge: SLO-Aware DNN Inference Services with Adaptive Batching on Edge Platforms
Ziyang Zhang
Huan Li
Yang Zhao
Changyao Lin
Jie Liu
64
3
0
01 May 2023
CORSD: Class-Oriented Relational Self Distillation
Muzhou Yu
S. Tan
Kailu Wu
Runpei Dong
Linfeng Zhang
Kaisheng Ma
41
0
0
28 Apr 2023
Sparsified Model Zoo Twins: Investigating Populations of Sparsified Neural Network Models
D. Honegger
Konstantin Schurholt
Damian Borth
88
4
0
26 Apr 2023
Optimizing Deep Learning Models For Raspberry Pi
Sa Ameen
Kangaranmulle Siriwardana
Theodoros Theodoridis
VLM
34
7
0
25 Apr 2023
Multiplierless In-filter Computing for tinyML Platforms
Abhishek Ramdas Nair
P. Nath
S. Chakrabartty
Chetan Singh Thakur
34
1
0
24 Apr 2023
The Case for Hierarchical Deep Learning Inference at the Network Edge
Ghina Al-Atat
Andrea Fresa
Adarsh Prasad Behera
Vishnu Narayanan Moothedath
James Gross
J. Champati
75
8
0
23 Apr 2023
Deep Convolutional Tables: Deep Learning without Convolutions
S. Dekel
Y. Keller
Aharon Bar-Hillel
3DV
93
0
0
23 Apr 2023
QuMoS: A Framework for Preserving Security of Quantum Machine Learning Model
Zhepeng Wang
Jinyang Li
Zhirui Hu
Blake Gage
Elizabeth Iwasawa
Weiwen Jiang
103
11
0
23 Apr 2023
Identifying Appropriate Intellectual Property Protection Mechanisms for Machine Learning Models: A Systematization of Watermarking, Fingerprinting, Model Access, and Attacks
Isabell Lederer
Rudolf Mayer
Andreas Rauber
98
19
0
22 Apr 2023
Securing Neural Networks with Knapsack Optimization
Yakir Gorski
Amir Jevnisek
S. Avidan
AAML
51
0
0
20 Apr 2023
Radar-Camera Fusion for Object Detection and Semantic Segmentation in Autonomous Driving: A Comprehensive Review
Shanliang Yao
Runwei Guan
Xiaoyu Huang
Zhuoxiao Li
Xiangyu Sha
...
Eng Gee Lim
H. Seo
Ka Lok Man
Xiaohui Zhu
Yutao Yue
138
108
0
20 Apr 2023
Knowledge Distillation Under Ideal Joint Classifier Assumption
Huayu Li
Xiwen Chen
G. Ditzler
Janet Roveda
Ao Li
51
1
0
19 Apr 2023
Adaptive Scheduling for Edge-Assisted DNN Serving
Jian He
Chen-Shun Yang
Zhaoyuan He
Ghufran Baig
L. Qiu
54
0
0
19 Apr 2023
Model Pruning Enables Localized and Efficient Federated Learning for Yield Forecasting and Data Sharing
An-dong Li
Milan Markovic
P. Edwards
Georgios Leontidis
FedML
63
18
0
19 Apr 2023
Neural Network Quantisation for Faster Homomorphic Encryption
Wouter Legiest
Jan-Pieter DÁnvers
Furkan Turan
Michiel Van Beirendonck
Ingrid Verbauwhede
MQ
62
6
0
19 Apr 2023
Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling
Xiuying Wei
Yunchen Zhang
Yuhang Li
Xiangguo Zhang
Ruihao Gong
Jian Ren
Zhengang Li
MQ
78
36
0
18 Apr 2023
Frequency Regularization: Restricting Information Redundancy of Convolutional Neural Networks
Chenqiu Zhao
Guanfang Dong
Shupei Zhang
Zijie Tan
Anup Basu
101
2
0
17 Apr 2023
Evil from Within: Machine Learning Backdoors through Hardware Trojans
Alexander Warnecke
Julian Speith
Janka Möller
Konrad Rieck
C. Paar
AAML
215
3
0
17 Apr 2023
SalientGrads: Sparse Models for Communication Efficient and Data Aware Distributed Federated Training
Riyasat Ohib
Bishal Thapaliya
Pratyush Gaggenapalli
Qingbin Liu
Vince D. Calhoun
Sergey Plis
FedML
69
2
0
15 Apr 2023
Generating Adversarial Examples with Better Transferability via Masking Unimportant Parameters of Surrogate Model
Dingcheng Yang
Wenjian Yu
Zihao Xiao
Jiaqi Luo
AAML
DiffM
62
5
0
14 Apr 2023
A Survey on Approximate Edge AI for Energy Efficient Autonomous Driving Services
Dewant Katare
Diego Perino
J. Nurmi
M. Warnier
Marijn Janssen
Aaron Yi Ding
132
40
0
13 Apr 2023
Previous
1
2
3
...
14
15
16
...
68
69
70
Next