ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown
Title
RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models
RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models
David Qiu
David Rim
Shaojin Ding
Oleg Rybakov
Yanzhang He
MQ
77
4
0
24 May 2023
PruMUX: Augmenting Data Multiplexing with Model Compression
PruMUX: Augmenting Data Multiplexing with Model Compression
Yushan Su
Vishvak Murahari
Karthik Narasimhan
Keqin Li
70
3
0
24 May 2023
Combining Multi-Objective Bayesian Optimization with Reinforcement Learning for TinyML
Combining Multi-Objective Bayesian Optimization with Reinforcement Learning for TinyML
M. Deutel
G. Kontes
Christopher Mutschler
Jürgen Teich
214
0
0
23 May 2023
Layer-adaptive Structured Pruning Guided by Latency
Layer-adaptive Structured Pruning Guided by Latency
Siyuan Pan
Linna Zhang
Jie Zhang
Xiaoshuang Li
Liang Hou
Xiaobing Tu
66
0
0
23 May 2023
Revisiting Data Augmentation in Model Compression: An Empirical and
  Comprehensive Study
Revisiting Data Augmentation in Model Compression: An Empirical and Comprehensive Study
Muzhou Yu
Linfeng Zhang
Kaisheng Ma
68
2
0
22 May 2023
TinyissimoYOLO: A Quantized, Low-Memory Footprint, TinyML Object
  Detection Network for Low Power Microcontrollers
TinyissimoYOLO: A Quantized, Low-Memory Footprint, TinyML Object Detection Network for Low Power Microcontrollers
Julian Moosmann
Marco Giordano
Christian Vogt
Michele Magno
MQObjD
60
20
0
22 May 2023
HighLight: Efficient and Flexible DNN Acceleration with Hierarchical
  Structured Sparsity
HighLight: Efficient and Flexible DNN Acceleration with Hierarchical Structured Sparsity
Yannan Nellie Wu
Po-An Tsai
Saurav Muralidharan
A. Parashar
Vivienne Sze
J. Emer
71
27
0
22 May 2023
Integer or Floating Point? New Outlooks for Low-Bit Quantization on
  Large Language Models
Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models
Yijia Zhang
Lingran Zhao
Shijie Cao
Wenqiang Wang
Ting Cao
Fan Yang
Mao Yang
Shanghang Zhang
Ningyi Xu
MQ
66
22
0
21 May 2023
Self-Distillation with Meta Learning for Knowledge Graph Completion
Self-Distillation with Meta Learning for Knowledge Graph Completion
Yunshui Li
Junhao Liu
Chengming Li
Min Yang
81
5
0
20 May 2023
Efficient Prompting via Dynamic In-Context Learning
Efficient Prompting via Dynamic In-Context Learning
Wangchunshu Zhou
Yuchen Eleanor Jiang
Ryan Cotterell
Mrinmaya Sachan
70
19
0
18 May 2023
PDP: Parameter-free Differentiable Pruning is All You Need
PDP: Parameter-free Differentiable Pruning is All You Need
Minsik Cho
Saurabh N. Adya
Devang Naik
VLM
67
12
0
18 May 2023
FastComposer: Tuning-Free Multi-Subject Image Generation with Localized
  Attention
FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention
Guangxuan Xiao
Tianwei Yin
William T. Freeman
F. Durand
Song Han
VGenDiffM
152
254
0
17 May 2023
Analyzing Compression Techniques for Computer Vision
Analyzing Compression Techniques for Computer Vision
Maniratnam Mandal
Imran Khan
82
1
0
14 May 2023
TIPS: Topologically Important Path Sampling for Anytime Neural Networks
TIPS: Topologically Important Path Sampling for Anytime Neural Networks
Guihong Li
Kartikeya Bhardwaj
Yuedong Yang
R. Marculescu
AAML
115
0
0
13 May 2023
Efficient Asynchronize Stochastic Gradient Algorithm with Structured
  Data
Efficient Asynchronize Stochastic Gradient Algorithm with Structured Data
Zhao Song
Mingquan Ye
79
4
0
13 May 2023
Accelerator-Aware Training for Transducer-Based Speech Recognition
Accelerator-Aware Training for Transducer-Based Speech Recognition
Suhaila M. Shakiah
Rupak Vignesh Swaminathan
Hieu Duy Nguyen
Raviteja Chinta
Tariq Afzal
Nathan Susanj
Athanasios Mouchtaris
Grant P. Strimel
Ariya Rastrow
56
1
0
12 May 2023
Divide-and-Conquer the NAS puzzle in Resource Constrained Federated
  Learning Systems
Divide-and-Conquer the NAS puzzle in Resource Constrained Federated Learning Systems
Yeshwanth Venkatesha
Youngeun Kim
Hyoungseob Park
Priyadarshini Panda
FedML
45
4
0
11 May 2023
Post-training Model Quantization Using GANs for Synthetic Data
  Generation
Post-training Model Quantization Using GANs for Synthetic Data Generation
Athanasios Masouris
Mansi Sharma
Adrian Boguszewski
Alexander Kozlov
Zhuo Wu
Raymond Lo
MQ
60
0
0
10 May 2023
VEDLIoT -- Next generation accelerated AIoT systems and applications
VEDLIoT -- Next generation accelerated AIoT systems and applications
Kevin Mika
R. Griessl
N. Kucza
F. Porrmann
M. Kaiser
...
Mario Porrmann
Hans-Martin Heyn
E. Knauss
Yufei Mao
Franz Meierhofer
68
3
0
09 May 2023
DietCNN: Multiplication-free Inference for Quantized CNNs
DietCNN: Multiplication-free Inference for Quantized CNNs
Swarnava Dey
P. Dasgupta
P. Chakrabarti
MQ
116
1
0
09 May 2023
FrugalGPT: How to Use Large Language Models While Reducing Cost and
  Improving Performance
FrugalGPT: How to Use Large Language Models While Reducing Cost and Improving Performance
Lingjiao Chen
Matei A. Zaharia
James Zou
LLMAG
189
251
0
09 May 2023
CrAFT: Compression-Aware Fine-Tuning for Efficient Visual Task
  Adaptation
CrAFT: Compression-Aware Fine-Tuning for Efficient Visual Task Adaptation
J. Heo
S. Azizi
A. Fayyazi
Massoud Pedram
57
0
0
08 May 2023
Compressing audio CNNs with graph centrality based filter pruning
Compressing audio CNNs with graph centrality based filter pruning
James A. King
Ashutosh Kumar Singh
Mark D. Plumbley
GNN
42
2
0
05 May 2023
CAMEL: Co-Designing AI Models and Embedded DRAMs for Efficient On-Device
  Learning
CAMEL: Co-Designing AI Models and Embedded DRAMs for Efficient On-Device Learning
Sai Qian Zhang
Thierry Tambe
Nestor Cuevas
Gu-Yeon Wei
David Brooks
61
4
0
04 May 2023
Input Layer Binarization with Bit-Plane Encoding
Input Layer Binarization with Bit-Plane Encoding
Lorenzo Vorabbi
Davide Maltoni
Stefano Santi
MQ
62
6
0
04 May 2023
A Constrained BA Algorithm for Rate-Distortion and Distortion-Rate
  Functions
A Constrained BA Algorithm for Rate-Distortion and Distortion-Rate Functions
Lin Chen
Shitong Wu
Wen-Long Ye
Huihui Wu
Wen-Ying Zhang
Hao Wu
Bo Bai
16
6
0
04 May 2023
Cuttlefish: Low-Rank Model Training without All the Tuning
Cuttlefish: Low-Rank Model Training without All the Tuning
Hongyi Wang
Saurabh Agarwal
Pongsakorn U-chupala
Yoshiki Tanaka
Eric P. Xing
Dimitris Papailiopoulos
OffRL
158
23
0
04 May 2023
Dynamic Sparse Training with Structured Sparsity
Dynamic Sparse Training with Structured Sparsity
Mike Lasby
A. Golubeva
Utku Evci
Mihai Nica
Yani Andrew Ioannou
186
23
0
03 May 2023
A Digital Twin Empowered Lightweight Model Sharing Scheme for
  Multi-Robot Systems
A Digital Twin Empowered Lightweight Model Sharing Scheme for Multi-Robot Systems
Kai Xiong
Zhihong Wang
S. Leng
Jianhua He
44
9
0
03 May 2023
BCEdge: SLO-Aware DNN Inference Services with Adaptive Batching on Edge
  Platforms
BCEdge: SLO-Aware DNN Inference Services with Adaptive Batching on Edge Platforms
Ziyang Zhang
Huan Li
Yang Zhao
Changyao Lin
Jie Liu
64
3
0
01 May 2023
CORSD: Class-Oriented Relational Self Distillation
CORSD: Class-Oriented Relational Self Distillation
Muzhou Yu
S. Tan
Kailu Wu
Runpei Dong
Linfeng Zhang
Kaisheng Ma
41
0
0
28 Apr 2023
Sparsified Model Zoo Twins: Investigating Populations of Sparsified
  Neural Network Models
Sparsified Model Zoo Twins: Investigating Populations of Sparsified Neural Network Models
D. Honegger
Konstantin Schurholt
Damian Borth
88
4
0
26 Apr 2023
Optimizing Deep Learning Models For Raspberry Pi
Optimizing Deep Learning Models For Raspberry Pi
Sa Ameen
Kangaranmulle Siriwardana
Theodoros Theodoridis
VLM
34
7
0
25 Apr 2023
Multiplierless In-filter Computing for tinyML Platforms
Multiplierless In-filter Computing for tinyML Platforms
Abhishek Ramdas Nair
P. Nath
S. Chakrabartty
Chetan Singh Thakur
34
1
0
24 Apr 2023
The Case for Hierarchical Deep Learning Inference at the Network Edge
The Case for Hierarchical Deep Learning Inference at the Network Edge
Ghina Al-Atat
Andrea Fresa
Adarsh Prasad Behera
Vishnu Narayanan Moothedath
James Gross
J. Champati
75
8
0
23 Apr 2023
Deep Convolutional Tables: Deep Learning without Convolutions
Deep Convolutional Tables: Deep Learning without Convolutions
S. Dekel
Y. Keller
Aharon Bar-Hillel
3DV
93
0
0
23 Apr 2023
QuMoS: A Framework for Preserving Security of Quantum Machine Learning
  Model
QuMoS: A Framework for Preserving Security of Quantum Machine Learning Model
Zhepeng Wang
Jinyang Li
Zhirui Hu
Blake Gage
Elizabeth Iwasawa
Weiwen Jiang
103
11
0
23 Apr 2023
Identifying Appropriate Intellectual Property Protection Mechanisms for
  Machine Learning Models: A Systematization of Watermarking, Fingerprinting,
  Model Access, and Attacks
Identifying Appropriate Intellectual Property Protection Mechanisms for Machine Learning Models: A Systematization of Watermarking, Fingerprinting, Model Access, and Attacks
Isabell Lederer
Rudolf Mayer
Andreas Rauber
98
19
0
22 Apr 2023
Securing Neural Networks with Knapsack Optimization
Securing Neural Networks with Knapsack Optimization
Yakir Gorski
Amir Jevnisek
S. Avidan
AAML
51
0
0
20 Apr 2023
Radar-Camera Fusion for Object Detection and Semantic Segmentation in
  Autonomous Driving: A Comprehensive Review
Radar-Camera Fusion for Object Detection and Semantic Segmentation in Autonomous Driving: A Comprehensive Review
Shanliang Yao
Runwei Guan
Xiaoyu Huang
Zhuoxiao Li
Xiangyu Sha
...
Eng Gee Lim
H. Seo
Ka Lok Man
Xiaohui Zhu
Yutao Yue
138
108
0
20 Apr 2023
Knowledge Distillation Under Ideal Joint Classifier Assumption
Knowledge Distillation Under Ideal Joint Classifier Assumption
Huayu Li
Xiwen Chen
G. Ditzler
Janet Roveda
Ao Li
51
1
0
19 Apr 2023
Adaptive Scheduling for Edge-Assisted DNN Serving
Adaptive Scheduling for Edge-Assisted DNN Serving
Jian He
Chen-Shun Yang
Zhaoyuan He
Ghufran Baig
L. Qiu
54
0
0
19 Apr 2023
Model Pruning Enables Localized and Efficient Federated Learning for
  Yield Forecasting and Data Sharing
Model Pruning Enables Localized and Efficient Federated Learning for Yield Forecasting and Data Sharing
An-dong Li
Milan Markovic
P. Edwards
Georgios Leontidis
FedML
63
18
0
19 Apr 2023
Neural Network Quantisation for Faster Homomorphic Encryption
Neural Network Quantisation for Faster Homomorphic Encryption
Wouter Legiest
Jan-Pieter DÁnvers
Furkan Turan
Michiel Van Beirendonck
Ingrid Verbauwhede
MQ
62
6
0
19 Apr 2023
Outlier Suppression+: Accurate quantization of large language models by
  equivalent and optimal shifting and scaling
Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling
Xiuying Wei
Yunchen Zhang
Yuhang Li
Xiangguo Zhang
Ruihao Gong
Jian Ren
Zhengang Li
MQ
78
36
0
18 Apr 2023
Frequency Regularization: Restricting Information Redundancy of
  Convolutional Neural Networks
Frequency Regularization: Restricting Information Redundancy of Convolutional Neural Networks
Chenqiu Zhao
Guanfang Dong
Shupei Zhang
Zijie Tan
Anup Basu
101
2
0
17 Apr 2023
Evil from Within: Machine Learning Backdoors through Hardware Trojans
Evil from Within: Machine Learning Backdoors through Hardware Trojans
Alexander Warnecke
Julian Speith
Janka Möller
Konrad Rieck
C. Paar
AAML
215
3
0
17 Apr 2023
SalientGrads: Sparse Models for Communication Efficient and Data Aware
  Distributed Federated Training
SalientGrads: Sparse Models for Communication Efficient and Data Aware Distributed Federated Training
Riyasat Ohib
Bishal Thapaliya
Pratyush Gaggenapalli
Qingbin Liu
Vince D. Calhoun
Sergey Plis
FedML
69
2
0
15 Apr 2023
Generating Adversarial Examples with Better Transferability via Masking
  Unimportant Parameters of Surrogate Model
Generating Adversarial Examples with Better Transferability via Masking Unimportant Parameters of Surrogate Model
Dingcheng Yang
Wenjian Yu
Zihao Xiao
Jiaqi Luo
AAMLDiffM
62
5
0
14 Apr 2023
A Survey on Approximate Edge AI for Energy Efficient Autonomous Driving
  Services
A Survey on Approximate Edge AI for Energy Efficient Autonomous Driving Services
Dewant Katare
Diego Perino
J. Nurmi
M. Warnier
Marijn Janssen
Aaron Yi Ding
132
40
0
13 Apr 2023
Previous
123...141516...686970
Next