ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown
Title
Learning Accurate Performance Predictors for Ultrafast Automated Model
  Compression
Learning Accurate Performance Predictors for Ultrafast Automated Model Compression
Ziwei Wang
Jiwen Lu
Han Xiao
Shengyu Liu
Jie Zhou
OffRL
71
1
0
13 Apr 2023
Boosting Convolutional Neural Networks with Middle Spectrum Grouped
  Convolution
Boosting Convolutional Neural Networks with Middle Spectrum Grouped Convolution
Z. Su
Jiehua Zhang
Tianpeng Liu
Zhen Liu
Shuanghui Zhang
M. Pietikäinen
Li Liu
70
2
0
13 Apr 2023
EcoFed: Efficient Communication for DNN Partitioning-based Federated
  Learning
EcoFed: Efficient Communication for DNN Partitioning-based Federated Learning
Di Wu
R. Ullah
Philip Rodgers
Peter Kilpatrick
I. Spence
Blesson Varghese
FedML
119
1
0
11 Apr 2023
Scale-Space Hypernetworks for Efficient Biomedical Imaging
Scale-Space Hypernetworks for Efficient Biomedical Imaging
Jose Javier Gonzalez Ortiz
John Guttag
Adrian Dalca
66
0
0
11 Apr 2023
SamurAI: A Versatile IoT Node With Event-Driven Wake-Up and Embedded ML
  Acceleration
SamurAI: A Versatile IoT Node With Event-Driven Wake-Up and Embedded ML Acceleration
I. Miro-Panadès
Benoît Tain
J. Christmann
David Coriat
R. Lemaire
...
Jean-Marc Philippe
Y. Thonnart
A. Valentian
Frédéric Heitzmann
F. Clermidy
51
15
0
11 Apr 2023
Conditional Adapters: Parameter-efficient Transfer Learning with Fast
  Inference
Conditional Adapters: Parameter-efficient Transfer Learning with Fast Inference
Tao Lei
Junwen Bai
Siddhartha Brahma
Joshua Ainslie
Kenton Lee
...
Vincent Zhao
Yuexin Wu
Yue Liu
Yu Zhang
Ming-Wei Chang
BDLAI4CE
115
64
0
11 Apr 2023
Model Sparsity Can Simplify Machine Unlearning
Model Sparsity Can Simplify Machine Unlearning
Jinghan Jia
Jiancheng Liu
Parikshit Ram
Yuguang Yao
Gaowen Liu
Yang Liu
Pranay Sharma
Sijia Liu
MU
206
130
0
11 Apr 2023
Graph Enabled Cross-Domain Knowledge Transfer
Graph Enabled Cross-Domain Knowledge Transfer
S. Yao
41
0
0
07 Apr 2023
Tensor Slicing and Optimization for Multicore NPUs
Tensor Slicing and Optimization for Multicore NPUs
R. Sousa
M. Pereira
Yongin Kwon
Taeho Kim
Namsoon Jung
Chang Soo Kim
Michael Frank
Guido Araujo
91
6
0
06 Apr 2023
Learning to Learn with Indispensable Connections
Learning to Learn with Indispensable Connections
Sambhavi Tiwari
Manas Gogoi
Shekhar Verma
Krishna Pratap Singh
CLL
85
1
0
06 Apr 2023
HNeRV: A Hybrid Neural Representation for Videos
HNeRV: A Hybrid Neural Representation for Videos
Hao Chen
M. Gwilliam
Ser-Nam Lim
Abhinav Shrivastava
73
77
1
05 Apr 2023
Efficient human-in-loop deep learning model training with iterative
  refinement and statistical result validation
Efficient human-in-loop deep learning model training with iterative refinement and statistical result validation
Manuel Zahn
Douglas P. Perrin
16
1
0
03 Apr 2023
Optimizing data-flow in Binary Neural Networks
Optimizing data-flow in Binary Neural Networks
Lorenzo Vorabbi
Davide Maltoni
Stefano Santi
MQ
83
6
0
03 Apr 2023
SEENN: Towards Temporal Spiking Early-Exit Neural Networks
SEENN: Towards Temporal Spiking Early-Exit Neural Networks
Yuhang Li
Tamar Geller
Youngeun Kim
Priyadarshini Panda
109
41
0
02 Apr 2023
A Generative Framework for Low-Cost Result Validation of Machine
  Learning-as-a-Service Inference
A Generative Framework for Low-Cost Result Validation of Machine Learning-as-a-Service Inference
Abhinav Kumar
Miguel A. Guirao Aguilera
R. Tourani
Satyajayant Misra
AAML
98
0
0
31 Mar 2023
BOLT: An Automated Deep Learning Framework for Training and Deploying
  Large-Scale Search and Recommendation Models on Commodity CPU Hardware
BOLT: An Automated Deep Learning Framework for Training and Deploying Large-Scale Search and Recommendation Models on Commodity CPU Hardware
Nicholas Meisburger
V. Lakshman
Benito Geordie
Joshua Engels
David Torres Ramos
...
Benjamin Meisburger
Shubh Gupta
Yashwanth Adunukota
Tharun Medini
Anshumali Shrivastava
104
2
0
30 Mar 2023
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution
  Vision Transformer
SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer
Xuanyao Chen
Zhijian Liu
Haotian Tang
Li Yi
Hang Zhao
Song Han
ViT
217
48
0
30 Mar 2023
Distributed Neural Representation for Reactive in situ Visualization
Distributed Neural Representation for Reactive in situ Visualization
Qi Wu
J. Insley
V. Mateevitsi
S. Rizzi
M. Papka
Kwan-Liu Ma
80
2
0
28 Mar 2023
SwiftFormer: Efficient Additive Attention for Transformer-based
  Real-time Mobile Vision Applications
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications
Abdelrahman M. Shaker
Muhammad Maaz
H. Rasheed
Salman Khan
Ming-Hsuan Yang
Fahad Shahbaz Khan
ViT
159
98
0
27 Mar 2023
Vision Models Can Be Efficiently Specialized via Few-Shot Task-Aware
  Compression
Vision Models Can Be Efficiently Specialized via Few-Shot Task-Aware Compression
Denis Kuznedelev
Soroush Tabesh
Kimia Noorbakhsh
Elias Frantar
Sara Beery
Eldar Kurtic
Dan Alistarh
MQVLM
82
2
0
25 Mar 2023
PowerPruning: Selecting Weights and Activations for Power-Efficient
  Neural Network Acceleration
PowerPruning: Selecting Weights and Activations for Power-Efficient Neural Network Acceleration
Richard Petri
Grace Li Zhang
Yiran Chen
Ulf Schlichtmann
Bing Li
29
6
0
24 Mar 2023
Q-HyViT: Post-Training Quantization of Hybrid Vision Transformers with
  Bridge Block Reconstruction for IoT Systems
Q-HyViT: Post-Training Quantization of Hybrid Vision Transformers with Bridge Block Reconstruction for IoT Systems
Jemin Lee
Yongin Kwon
Sihyeong Park
Misun Yu
Jeman Park
Hwanjun Song
ViTMQ
90
6
0
22 Mar 2023
Low Rank Optimization for Efficient Deep Learning: Making A Balance
  between Compact Architecture and Fast Training
Low Rank Optimization for Efficient Deep Learning: Making A Balance between Compact Architecture and Fast Training
Xinwei Ou
Zhangxin Chen
Ce Zhu
Yipeng Liu
81
5
0
22 Mar 2023
Performance-aware Approximation of Global Channel Pruning for Multitask
  CNNs
Performance-aware Approximation of Global Channel Pruning for Multitask CNNs
Hancheng Ye
Bo Zhang
Tao Chen
Jiayuan Fan
Bin Wang
74
19
0
21 Mar 2023
Solving Oscillation Problem in Post-Training Quantization Through a
  Theoretical Perspective
Solving Oscillation Problem in Post-Training Quantization Through a Theoretical Perspective
Yuexiao Ma
Huixia Li
Xiawu Zheng
Xuefeng Xiao
Rui Wang
Shilei Wen
Xin Pan
Yong Li
Rongrong Ji
MQ
74
12
0
21 Mar 2023
Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training
  Efficiency
Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency
Vithursan Thangarasa
Shreyas Saxena
Abhay Gupta
Sean Lie
121
5
0
21 Mar 2023
Greedy Pruning with Group Lasso Provably Generalizes for Matrix Sensing
Greedy Pruning with Group Lasso Provably Generalizes for Matrix Sensing
Nived Rajaraman
Devvrit
Aryan Mokhtari
Kannan Ramchandran
76
0
0
20 Mar 2023
ExplainFix: Explainable Spatially Fixed Deep Networks
ExplainFix: Explainable Spatially Fixed Deep Networks
Alex Gaudio
Christos Faloutsos
A. Smailagic
P. Costa
A. Campilho
FAtt
73
3
0
18 Mar 2023
DC-CCL: Device-Cloud Collaborative Controlled Learning for Large Vision
  Models
DC-CCL: Device-Cloud Collaborative Controlled Learning for Large Vision Models
Yucheng Ding
Chaoyue Niu
Fan Wu
Shaojie Tang
Chengfei Lyu
Guihai Chen
70
6
0
18 Mar 2023
Unleashing the Potential of Spiking Neural Networks by Dynamic
  Confidence
Unleashing the Potential of Spiking Neural Networks by Dynamic Confidence
Chen Li
Edward Jones
Steve Furber
79
19
0
17 Mar 2023
Iterative Soft Shrinkage Learning for Efficient Image Super-Resolution
Iterative Soft Shrinkage Learning for Efficient Image Super-Resolution
Jiamian Wang
Huan Wang
Yulun Zhang
Yun Fu
Zhiqiang Tao
SupR
68
2
0
16 Mar 2023
A High-Performance Accelerator for Super-Resolution Processing on
  Embedded GPU
A High-Performance Accelerator for Super-Resolution Processing on Embedded GPU
W. Zhao
Qi Sun
Yang Bai
Wenbo Li
Haisheng Zheng
Bei Yu
Martin D. F. Wong
SupR
74
8
0
16 Mar 2023
Gated Compression Layers for Efficient Always-On Models
Gated Compression Layers for Efficient Always-On Models
Haiguang Li
T. Thormundsson
I. Poupyrev
N. Gillian
88
2
0
15 Mar 2023
R2 Loss: Range Restriction Loss for Model Compression and Quantization
R2 Loss: Range Restriction Loss for Model Compression and Quantization
Arnav Kundu
Chungkuk Yoo
Srijan Mishra
Minsik Cho
Saurabh N. Adya
MQ
69
1
0
14 Mar 2023
MetaMixer: A Regularization Strategy for Online Knowledge Distillation
MetaMixer: A Regularization Strategy for Online Knowledge Distillation
Maorong Wang
L. Xiao
T. Yamasaki
KELMMoE
46
1
0
14 Mar 2023
FPUS23: An Ultrasound Fetus Phantom Dataset with Deep Neural Network
  Evaluations for Fetus Orientations, Fetal Planes, and Anatomical Features
FPUS23: An Ultrasound Fetus Phantom Dataset with Deep Neural Network Evaluations for Fetus Orientations, Fetal Planes, and Anatomical Features
B. Prabakaran
Paul Hamelmann
Erik Ostrowski
Mohamed Bennai
62
12
0
14 Mar 2023
Automatic Attention Pruning: Improving and Automating Model Pruning
  using Attentions
Automatic Attention Pruning: Improving and Automating Model Pruning using Attentions
Kaiqi Zhao
Animesh Jain
Ming Zhao
69
11
0
14 Mar 2023
AdaptiveNet: Post-deployment Neural Architecture Adaptation for Diverse
  Edge Environments
AdaptiveNet: Post-deployment Neural Architecture Adaptation for Diverse Edge Environments
Hao Wen
Yuanchun Li
Zunshuai Zhang
Shiqi Jiang
Xiaozhou Ye
Ouyang Ye
Yaqin Zhang
Yunxin Liu
143
34
0
13 Mar 2023
Three Guidelines You Should Know for Universally Slimmable
  Self-Supervised Learning
Three Guidelines You Should Know for Universally Slimmable Self-Supervised Learning
Yunhao Cao
Peiqin Sun
Shuchang Zhou
51
4
0
13 Mar 2023
OTOV2: Automatic, Generic, User-Friendly
OTOV2: Automatic, Generic, User-Friendly
Tianyi Chen
Luming Liang
Tian Ding
Zhihui Zhu
Ilya Zharkov
VLMMQ
113
36
0
13 Mar 2023
Complement Sparsification: Low-Overhead Model Pruning for Federated
  Learning
Complement Sparsification: Low-Overhead Model Pruning for Federated Learning
Xiaopeng Jiang
Cristian Borcea
FedML
85
17
0
10 Mar 2023
Sparse and Local Networks for Hypergraph Reasoning
Sparse and Local Networks for Hypergraph Reasoning
Guangxuan Xiao
L. Kaelbling
Jiajun Wu
Jiayuan Mao
NAIReLMLRM
94
1
0
09 Mar 2023
A Privacy Preserving System for Movie Recommendations Using Federated
  Learning
A Privacy Preserving System for Movie Recommendations Using Federated Learning
David Neumann
Andreas Lutz
Karsten Müller
Wojciech Samek
59
12
0
07 Mar 2023
An Edge-based WiFi Fingerprinting Indoor Localization Using
  Convolutional Neural Network and Convolutional Auto-Encoder
An Edge-based WiFi Fingerprinting Indoor Localization Using Convolutional Neural Network and Convolutional Auto-Encoder
Amin Kargar-Barzi
Ebrahim Farahmand
Nooshin Taheri Chatrudi
A. Mahani
M. Shafique
57
8
0
07 Mar 2023
Training-Free Acceleration of ViTs with Delayed Spatial Merging
Training-Free Acceleration of ViTs with Delayed Spatial Merging
J. Heo
Seyedarmin Azizi
A. Fayyazi
Massoud Pedram
126
3
0
04 Mar 2023
Adversarial Attacks on Machine Learning in Embedded and IoT Platforms
Adversarial Attacks on Machine Learning in Embedded and IoT Platforms
Christian Westbrook
S. Pasricha
AAML
74
3
0
03 Mar 2023
Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!
Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!
Shiwei Liu
Tianlong Chen
Zhenyu Zhang
Xuxi Chen
Tianjin Huang
Ajay Jaiswal
Zhangyang Wang
87
28
0
03 Mar 2023
Rotation Invariant Quantization for Model Compression
Rotation Invariant Quantization for Model Compression
Dor-Joseph Kampeas
Yury Nahshan
Hanoch Kremer
Gil Lederman
Shira Zaloshinski
Zheng Li
E. Haleva
MQ
122
1
0
03 Mar 2023
TopSpark: A Timestep Optimization Methodology for Energy-Efficient
  Spiking Neural Networks on Autonomous Mobile Agents
TopSpark: A Timestep Optimization Methodology for Energy-Efficient Spiking Neural Networks on Autonomous Mobile Agents
Rachmad Vidya Wicaksana Putra
Mohamed Bennai
80
13
0
03 Mar 2023
Distilling Multi-Level X-vector Knowledge for Small-footprint Speaker
  Verification
Distilling Multi-Level X-vector Knowledge for Small-footprint Speaker Verification
Xuechen Liu
Md. Sahidullah
Tomi Kinnunen
139
4
0
02 Mar 2023
Previous
123...151617...686970
Next