ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown
Title
Learning to Grow Pretrained Models for Efficient Transformer Training
Learning to Grow Pretrained Models for Efficient Transformer Training
Peihao Wang
Yikang Shen
Lucas Torroba Hennigen
P. Greengard
Leonid Karlinsky
Rogerio Feris
David D. Cox
Zhangyang Wang
Yoon Kim
75
56
0
02 Mar 2023
EdgeServe: A Streaming System for Decentralized Model Serving
EdgeServe: A Streaming System for Decentralized Model Serving
Ted Shaowang
Sanjay Krishnan
52
1
0
02 Mar 2023
Structured Pruning for Deep Convolutional Neural Networks: A survey
Structured Pruning for Deep Convolutional Neural Networks: A survey
Yang He
Lingao Xiao
3DPC
120
146
0
01 Mar 2023
GRAN: Ghost Residual Attention Network for Single Image Super Resolution
GRAN: Ghost Residual Attention Network for Single Image Super Resolution
Axi Niu
Pei Wang
Yu Zhu
Jinqiu Sun
Qingsen Yan
Yanning Zhang
SupR
81
8
0
28 Feb 2023
Leveraging Angular Distributions for Improved Knowledge Distillation
Leveraging Angular Distributions for Improved Knowledge Distillation
Eunyeong Jeon
Hongjun Choi
Ankita Shukla
Pavan Turaga
25
8
0
27 Feb 2023
Efficient Multitask Learning on Resource-Constrained Systems
Efficient Multitask Learning on Resource-Constrained Systems
Yubo Luo
Le Zhang
Zhenyu Wang
S. Nirjon
60
9
0
25 Feb 2023
A Unified Framework for Soft Threshold Pruning
A Unified Framework for Soft Threshold Pruning
Yanqing Chen
Zhengyu Ma
Wei Fang
Xiawu Zheng
Zhaofei Yu
Yonghong Tian
137
21
0
25 Feb 2023
Debiased Distillation by Transplanting the Last Layer
Debiased Distillation by Transplanting the Last Layer
Jiwoon Lee
Jaeho Lee
60
3
0
22 Feb 2023
Device Tuning for Multi-Task Large Model
Device Tuning for Multi-Task Large Model
Penghao Jiang
Xuanchen Hou
Y. Zhou
42
0
0
21 Feb 2023
Oriented Object Detection in Optical Remote Sensing Images using Deep Learning: A Survey
Oriented Object Detection in Optical Remote Sensing Images using Deep Learning: A Survey
Kunlin Wang
Zi Wang
Zhang Li
Ang Su
Xichao Teng
Minhao Liu
Qifeng Yu
Qifeng Yu
ObjD
219
9
0
21 Feb 2023
HomoDistil: Homotopic Task-Agnostic Distillation of Pre-trained
  Transformers
HomoDistil: Homotopic Task-Agnostic Distillation of Pre-trained Transformers
Chen Liang
Haoming Jiang
Zheng Li
Xianfeng Tang
Bin Yin
Tuo Zhao
VLM
136
25
0
19 Feb 2023
Rethinking Data-Free Quantization as a Zero-Sum Game
Rethinking Data-Free Quantization as a Zero-Sum Game
Biao Qian
Yang Wang
Richang Hong
Meng Wang
MQ
70
18
0
19 Feb 2023
Moby: Empowering 2D Models for Efficient Point Cloud Analytics on the
  Edge
Moby: Empowering 2D Models for Efficient Point Cloud Analytics on the Edge
Jingzong Li
Yik Hong Cai
Libin Liu
Yushun Mao
Chun Jason Xue
Hongchang Xu
56
4
0
18 Feb 2023
VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile
  Acceleration on CPUs
VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs
Geonhwa Jeong
S. Damani
Abhimanyu Bambhaniya
Eric Qin
C. Hughes
S. Subramoney
Hyesoon Kim
T. Krishna
MoE
95
26
0
17 Feb 2023
Hardware-aware training for large-scale and diverse deep learning
  inference workloads using in-memory computing-based accelerators
Hardware-aware training for large-scale and diverse deep learning inference workloads using in-memory computing-based accelerators
Malte J. Rasch
C. Mackin
Manuel Le Gallo
An Chen
A. Fasoli
...
P. Narayanan
H. Tsai
G. Burr
Abu Sebastian
Vijay Narayanan
69
96
0
16 Feb 2023
TFormer: A Transmission-Friendly ViT Model for IoT Devices
TFormer: A Transmission-Friendly ViT Model for IoT Devices
Zhichao Lu
Chuntao Ding
Felix Juefei Xu
Vishnu Boddeti
Shangguang Wang
Yun Yang
86
17
0
15 Feb 2023
Towards Optimal Compression: Joint Pruning and Quantization
Towards Optimal Compression: Joint Pruning and Quantization
Ben Zandonati
Glenn Bucagu
Adrian Alan Pol
M. Pierini
Olya Sirkin
Tal Kopetz
MQ
97
3
0
15 Feb 2023
Workload-Balanced Pruning for Sparse Spiking Neural Networks
Workload-Balanced Pruning for Sparse Spiking Neural Networks
Ruokai Yin
Youngeun Kim
Yuhang Li
Abhishek Moitra
Nitin Satpute
Anna Hambitzer
Priyadarshini Panda
98
21
0
13 Feb 2023
Simple Hardware-Efficient Long Convolutions for Sequence Modeling
Simple Hardware-Efficient Long Convolutions for Sequence Modeling
Daniel Y. Fu
Elliot L. Epstein
Eric N. D. Nguyen
A. Thomas
Michael Zhang
Tri Dao
Atri Rudra
Christopher Ré
69
55
0
13 Feb 2023
Sneaky Spikes: Uncovering Stealthy Backdoor Attacks in Spiking Neural
  Networks with Neuromorphic Data
Sneaky Spikes: Uncovering Stealthy Backdoor Attacks in Spiking Neural Networks with Neuromorphic Data
Gorka Abad
Oguzhan Ersoy
S. Picek
A. Urbieta
AAML
60
19
0
13 Feb 2023
Autoselection of the Ensemble of Convolutional Neural Networks with
  Second-Order Cone Programming
Autoselection of the Ensemble of Convolutional Neural Networks with Second-Order Cone Programming
Buse Çisil Güldoğuş
Abdullah Nazhat Abdullah
Muhammad Ammar Ali
Süreyya Özögür-Akyüz
78
0
0
12 Feb 2023
Pruning Deep Neural Networks from a Sparsity Perspective
Pruning Deep Neural Networks from a Sparsity Perspective
Enmao Diao
G. Wang
Jiawei Zhan
Yuhong Yang
Jie Ding
Vahid Tarokh
82
32
0
11 Feb 2023
Offsite-Tuning: Transfer Learning without Full Model
Offsite-Tuning: Transfer Learning without Full Model
Guangxuan Xiao
Ji Lin
Song Han
90
76
0
09 Feb 2023
SparseProp: Efficient Sparse Backpropagation for Faster Training of
  Neural Networks
SparseProp: Efficient Sparse Backpropagation for Faster Training of Neural Networks
Mahdi Nikdan
Tommaso Pegolotti
Eugenia Iofinova
Eldar Kurtic
Dan Alistarh
87
11
0
09 Feb 2023
Towards Fairer and More Efficient Federated Learning via
  Multidimensional Personalized Edge Models
Towards Fairer and More Efficient Federated Learning via Multidimensional Personalized Edge Models
Yingchun Wang
Jingcai Guo
Jie Zhang
Song Guo
Weizhan Zhang
Qinghua Zheng
FedML
106
11
0
09 Feb 2023
Data Quality-aware Mixed-precision Quantization via Hybrid Reinforcement
  Learning
Data Quality-aware Mixed-precision Quantization via Hybrid Reinforcement Learning
Yingchun Wang
Jingcai Guo
Song Guo
Weizhan Zhang
MQ
82
21
0
09 Feb 2023
PFGM++: Unlocking the Potential of Physics-Inspired Generative Models
PFGM++: Unlocking the Potential of Physics-Inspired Generative Models
Yilun Xu
Ziming Liu
Yonglong Tian
Shangyuan Tong
Max Tegmark
Tommi Jaakkola
AI4CEDiffM
99
72
0
08 Feb 2023
CRAFT: Criticality-Aware Fault-Tolerance Enhancement Techniques for
  Emerging Memories-Based Deep Neural Networks
CRAFT: Criticality-Aware Fault-Tolerance Enhancement Techniques for Emerging Memories-Based Deep Neural Networks
Thai-Hoang Nguyen
Muhammad Imran
Jaehyuk Choi
Joongseob Yang
21
3
0
08 Feb 2023
What Matters In The Structured Pruning of Generative Language Models?
What Matters In The Structured Pruning of Generative Language Models?
Michael Santacroce
Zixin Wen
Yelong Shen
Yuan-Fang Li
91
34
0
07 Feb 2023
Ten Lessons We Have Learned in the New "Sparseland": A Short Handbook
  for Sparse Neural Network Researchers
Ten Lessons We Have Learned in the New "Sparseland": A Short Handbook for Sparse Neural Network Researchers
Shiwei Liu
Zhangyang Wang
131
32
0
06 Feb 2023
Towards Implementing Energy-aware Data-driven Intelligence for Smart
  Health Applications on Mobile Platforms
Towards Implementing Energy-aware Data-driven Intelligence for Smart Health Applications on Mobile Platforms
G. D. Samaraweera
Hung Nguyen
Hadi Zanddizari
Behnam Zeinali
Jerome Chang
68
0
0
01 Feb 2023
UPop: Unified and Progressive Pruning for Compressing Vision-Language
  Transformers
UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers
Dachuan Shi
Chaofan Tao
Ying Jin
Zhendong Yang
Chun Yuan
Jiaqi Wang
VLMViT
133
39
0
31 Jan 2023
Self-Compressing Neural Networks
Self-Compressing Neural Networks
Szabolcs Cséfalvay
J. Imber
56
3
0
30 Jan 2023
DepGraph: Towards Any Structural Pruning
DepGraph: Towards Any Structural Pruning
Gongfan Fang
Xinyin Ma
Mingli Song
Michael Bi Mi
Xinchao Wang
GNN
183
276
0
30 Jan 2023
Towards Inference Efficient Deep Ensemble Learning
Towards Inference Efficient Deep Ensemble Learning
Ziyue Li
Kan Ren
Yifan Yang
Xinyang Jiang
Yuqing Yang
Dongsheng Li
BDL
65
14
0
29 Jan 2023
Understanding INT4 Quantization for Transformer Models: Latency Speedup,
  Composability, and Failure Cases
Understanding INT4 Quantization for Transformer Models: Latency Speedup, Composability, and Failure Cases
Xiaoxia Wu
Cheng-rong Li
Reza Yazdani Aminabadi
Z. Yao
Yuxiong He
MQ
77
25
0
27 Jan 2023
SWARM Parallelism: Training Large Models Can Be Surprisingly
  Communication-Efficient
SWARM Parallelism: Training Large Models Can Be Surprisingly Communication-Efficient
Max Ryabinin
Tim Dettmers
Michael Diskin
Alexander Borzunov
MoE
111
38
0
27 Jan 2023
Voting from Nearest Tasks: Meta-Vote Pruning of Pre-trained Models for
  Downstream Tasks
Voting from Nearest Tasks: Meta-Vote Pruning of Pre-trained Models for Downstream Tasks
Haiyan Zhao
Tianyi Zhou
Guodong Long
Jing Jiang
Chengqi Zhang
83
0
0
27 Jan 2023
Open Problems in Applied Deep Learning
Open Problems in Applied Deep Learning
M. Raissi
AI4CE
117
2
0
26 Jan 2023
BiBench: Benchmarking and Analyzing Network Binarization
BiBench: Benchmarking and Analyzing Network Binarization
Haotong Qin
Mingyuan Zhang
Yifu Ding
Aoyu Li
Zhongang Cai
Ziwei Liu
Feng Yu
Xianglong Liu
MQAAML
110
37
0
26 Jan 2023
Low-Rank Winograd Transformation for 3D Convolutional Neural Networks
Low-Rank Winograd Transformation for 3D Convolutional Neural Networks
Ziran Qin
Mingbao Lin
Weiyao Lin
3DPC
77
3
0
26 Jan 2023
Rewarded meta-pruning: Meta Learning with Rewards for Channel Pruning
Rewarded meta-pruning: Meta Learning with Rewards for Channel Pruning
Athul Shibu
Abhishek Kumar
Heechul Jung
Dong-Gyu Lee
82
1
0
26 Jan 2023
PowerQuant: Automorphism Search for Non-Uniform Quantization
PowerQuant: Automorphism Search for Non-Uniform Quantization
Edouard Yvinec
Arnaud Dapogny
Matthieu Cord
Kévin Bailly
MQ
63
17
0
24 Jan 2023
Ensemble Transfer Learning for Multilingual Coreference Resolution
Ensemble Transfer Learning for Multilingual Coreference Resolution
T. Lai
Heng Ji
50
1
0
22 Jan 2023
Accelerating and Compressing Deep Neural Networks for Massive MIMO CSI
  Feedback
Accelerating and Compressing Deep Neural Networks for Massive MIMO CSI Feedback
O. Erak
H. Abou-zeid
91
5
0
20 Jan 2023
Getting Away with More Network Pruning: From Sparsity to Geometry and
  Linear Regions
Getting Away with More Network Pruning: From Sparsity to Geometry and Linear Regions
Junyang Cai
Khai-Nguyen Nguyen
Nishant Shrestha
Aidan Good
Ruisen Tu
Xin Yu
Shandian Zhe
Thiago Serra
MLT
102
10
0
19 Jan 2023
Quantum HyperNetworks: Training Binary Neural Networks in Quantum Superposition
Quantum HyperNetworks: Training Binary Neural Networks in Quantum Superposition
Juan Carrasquilla
Mohamed Hibat-Allah
E. Inack
Alireza Makhzani
Kirill Neklyudov
Graham Taylor
G. Torlai
MQ
84
9
0
19 Jan 2023
Scaling Deep Networks with the Mesh Adaptive Direct Search algorithm
Scaling Deep Networks with the Mesh Adaptive Direct Search algorithm
Dounia Lakhmiri
Mahdi Zolnouri
V. Nia
C. Tribes
Sébastien Le Digabel
56
0
0
17 Jan 2023
RedBit: An End-to-End Flexible Framework for Evaluating the Accuracy of
  Quantized CNNs
RedBit: An End-to-End Flexible Framework for Evaluating the Accuracy of Quantized CNNs
A. M. Ribeiro-dos-Santos
João Dinis Ferreira
O. Mutlu
G. Falcão
MQ
97
2
0
15 Jan 2023
GOHSP: A Unified Framework of Graph and Optimization-based Heterogeneous
  Structured Pruning for Vision Transformer
GOHSP: A Unified Framework of Graph and Optimization-based Heterogeneous Structured Pruning for Vision Transformer
Miao Yin
Burak Uzkent
Yilin Shen
Hongxia Jin
Bo Yuan
ViT
89
16
0
13 Jan 2023
Previous
123...161718...686970
Next