ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1510.00149
  4. Cited By
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
v1v2v3v4v5 (latest)

Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding

1 October 2015
Song Han
Huizi Mao
W. Dally
    3DGS
ArXiv (abs)PDFHTML

Papers citing "Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"

50 / 3,481 papers shown
Title
REDS: Resource-Efficient Deep Subnetworks for Dynamic Resource
  Constraints
REDS: Resource-Efficient Deep Subnetworks for Dynamic Resource Constraints
Francesco Corti
Balz Maag
Joachim Schauer
U. Pferschy
O. Saukh
106
2
0
22 Nov 2023
Efficient Neural Networks for Tiny Machine Learning: A Comprehensive
  Review
Efficient Neural Networks for Tiny Machine Learning: A Comprehensive Review
M. Lê
Pierre Wolinski
Julyan Arbel
99
10
0
20 Nov 2023
Tensor-Aware Energy Accounting
Tensor-Aware Energy Accounting
Timur Babakol
Yu David Liu
43
4
0
19 Nov 2023
LifeLearner: Hardware-Aware Meta Continual Learning System for Embedded
  Computing Platforms
LifeLearner: Hardware-Aware Meta Continual Learning System for Embedded Computing Platforms
Young D. Kwon
Jagmohan Chauhan
Hong Jia
Stylianos I. Venieris
Cecilia Mascolo
89
12
0
19 Nov 2023
Low-Precision Floating-Point for Efficient On-Board Deep Neural Network
  Processing
Low-Precision Floating-Point for Efficient On-Board Deep Neural Network Processing
Cédric Gernigon
Silviu-Ioan Filip
Olivier Sentieys
Clément Coggiola
Mickael Bruno
MQ
61
8
0
18 Nov 2023
Deep Coherence Learning: An Unsupervised Deep Beamformer for High
  Quality Single Plane Wave Imaging in Medical Ultrasound
Deep Coherence Learning: An Unsupervised Deep Beamformer for High Quality Single Plane Wave Imaging in Medical Ultrasound
Hyunwoo Cho
Seongjun Park
Jinbum Kang
Yangmo Yoo
OOD
21
3
0
18 Nov 2023
ECLM: Efficient Edge-Cloud Collaborative Learning with Continuous
  Environment Adaptation
ECLM: Efficient Edge-Cloud Collaborative Learning with Continuous Environment Adaptation
Zhuang Yan
Zhenzhe Zheng
Yunfeng Shao
Bingshuai Li
Fan Wu
Guihai Chen
79
5
0
18 Nov 2023
Improved TokenPose with Sparsity
Improved TokenPose with Sparsity
Anning Li
ViT
80
0
0
16 Nov 2023
Do Localization Methods Actually Localize Memorized Data in LLMs? A Tale
  of Two Benchmarks
Do Localization Methods Actually Localize Memorized Data in LLMs? A Tale of Two Benchmarks
Ting-Yun Chang
Jesse Thomason
Robin Jia
100
19
0
15 Nov 2023
FedCode: Communication-Efficient Federated Learning via Transferring
  Codebooks
FedCode: Communication-Efficient Federated Learning via Transferring Codebooks
Saeed Khalilian Gourtani
Vasileios Tsouvalas
T. Ozcelebi
N. Meratnia
FedML
105
5
0
15 Nov 2023
Boolean Variation and Boolean Logic BackPropagation
Boolean Variation and Boolean Logic BackPropagation
Van Minh Nguyen
93
2
0
13 Nov 2023
Training A Multi-stage Deep Classifier with Feedback Signals
Training A Multi-stage Deep Classifier with Feedback Signals
Chao Xu
Yu Yang
Rong Wang
Guan Wang
Bojia Lin
43
0
0
12 Nov 2023
5G Positioning Advancements with AI/ML
5G Positioning Advancements with AI/ML
Mohammad Alawieh
Georgios Kontes
39
5
0
10 Nov 2023
Quantized Distillation: Optimizing Driver Activity Recognition Models
  for Resource-Constrained Environments
Quantized Distillation: Optimizing Driver Activity Recognition Models for Resource-Constrained Environments
Calvin Tanama
Kunyu Peng
Zdravko Marinov
Rainer Stiefelhagen
Alina Roitberg
70
1
0
10 Nov 2023
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor
  Cores
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
Daniel Y. Fu
Hermann Kumbong
Eric N. D. Nguyen
Christopher Ré
VLM
100
30
0
10 Nov 2023
Compressed and Sparse Models for Non-Convex Decentralized Learning
Compressed and Sparse Models for Non-Convex Decentralized Learning
Andrew Campbell
Hang Liu
Leah Woldemariam
Anna Scaglione
53
0
0
09 Nov 2023
Adaptive Compression-Aware Split Learning and Inference for Enhanced
  Network Efficiency
Adaptive Compression-Aware Split Learning and Inference for Enhanced Network Efficiency
Akrit Mudvari
Antero Vainio
Iason Ofeidis
Sasu Tarkoma
Leandros Tassiulas
81
3
0
09 Nov 2023
Game Theory Solutions in Sensor-Based Human Activity Recognition: A
  Review
Game Theory Solutions in Sensor-Based Human Activity Recognition: A Review
M. Shayesteh
Behrooz Sharokhzadeh
B. Masoumi
30
3
0
09 Nov 2023
Exploiting Neural-Network Statistics for Low-Power DNN Inference
Exploiting Neural-Network Statistics for Low-Power DNN Inference
Lennart Bamberg
Ardalan Najafi
Alberto García-Ortiz
29
1
0
09 Nov 2023
Beyond Size: How Gradients Shape Pruning Decisions in Large Language
  Models
Beyond Size: How Gradients Shape Pruning Decisions in Large Language Models
Rocktim Jyoti Das
Mingjie Sun
Liqun Ma
Zhiqiang Shen
VLM
83
18
0
08 Nov 2023
Mini but Mighty: Finetuning ViTs with Mini Adapters
Mini but Mighty: Finetuning ViTs with Mini Adapters
Imad Eddine Marouf
Enzo Tartaglione
Stéphane Lathuilière
75
5
0
07 Nov 2023
Machine learning's own Industrial Revolution
Machine learning's own Industrial Revolution
Yuan Luo
Song Han
Jingjing Liu
AI4CE
108
0
0
04 Nov 2023
AFPQ: Asymmetric Floating Point Quantization for LLMs
AFPQ: Asymmetric Floating Point Quantization for LLMs
Yijia Zhang
Sicheng Zhang
Shijie Cao
Dayou Du
Jianyu Wei
Ting Cao
Ningyi Xu
MQ
60
5
0
03 Nov 2023
Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D
  Object Detection
Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection
Haibao Yu
Yingjuan Tang
Enze Xie
Jilei Mao
Ping Luo
Zaiqing Nie
3DPC
105
27
0
03 Nov 2023
Ultra-Efficient On-Device Object Detection on AI-Integrated Smart
  Glasses with TinyissimoYOLO
Ultra-Efficient On-Device Object Detection on AI-Integrated Smart Glasses with TinyissimoYOLO
Julian Moosmann
Pietro Bonazzi
Yawei Li
Sizhen Bian
Philipp Mayer
Luca Benini
Michele Magno
117
13
0
02 Nov 2023
Efficient LLM Inference on CPUs
Efficient LLM Inference on CPUs
Haihao Shen
Hanwen Chang
Bo Dong
Yu Luo
Hengyu Meng
MQ
75
19
0
01 Nov 2023
Federated Topic Model and Model Pruning Based on Variational Autoencoder
Federated Topic Model and Model Pruning Based on Variational Autoencoder
Chengjie Ma
Yawen Li
M. Liang
Ang Li
FedML
29
1
0
01 Nov 2023
Importance Estimation with Random Gradient for Neural Network Pruning
Importance Estimation with Random Gradient for Neural Network Pruning
Suman Sapkota
Binod Bhattarai
100
1
0
31 Oct 2023
PriPrune: Quantifying and Preserving Privacy in Pruned Federated
  Learning
PriPrune: Quantifying and Preserving Privacy in Pruned Federated Learning
Tianyue Chu
Mengwei Yang
Nikolaos Laoutaris
A. Markopoulou
87
7
0
30 Oct 2023
SparseByteNN: A Novel Mobile Inference Acceleration Framework Based on
  Fine-Grained Group Sparsity
SparseByteNN: A Novel Mobile Inference Acceleration Framework Based on Fine-Grained Group Sparsity
Haitao Xu
Songwei Liu
Yuyang Xu
Shuai Wang
Jiashi Li
Chenqian Yan
Liangqiang Li
Lean Fu
Xin Pan
Fangmin Chen
MQ
46
0
0
30 Oct 2023
Efficient IoT Inference via Context-Awareness
Efficient IoT Inference via Context-Awareness
Mohammad Mehdi Rastikerdar
Jin Huang
Shiwei Fang
Hui Guan
Deepak Ganesan
103
0
0
29 Oct 2023
Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
Yilong Zhao
Chien-Yu Lin
Kan Zhu
Zihao Ye
Lequn Chen
Wenlei Bao
Luis Ceze
Arvind Krishnamurthy
Tianqi Chen
Baris Kasikci
MQ
154
150
0
29 Oct 2023
FedPEAT: Convergence of Federated Learning, Parameter-Efficient Fine
  Tuning, and Emulator Assisted Tuning for Artificial Intelligence Foundation
  Models with Mobile Edge Computing
FedPEAT: Convergence of Federated Learning, Parameter-Efficient Fine Tuning, and Emulator Assisted Tuning for Artificial Intelligence Foundation Models with Mobile Edge Computing
Terence Jie Chua
Wen-li Yu
Junfeng Zhao
Kwok-Yan Lam
FedML
61
5
0
26 Oct 2023
Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time
Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time
Zichang Liu
Jue Wang
Tri Dao
Dinesh Manocha
Binhang Yuan
...
Anshumali Shrivastava
Ce Zhang
Yuandong Tian
Christopher Ré
Beidi Chen
BDL
126
221
0
26 Oct 2023
How Robust is Federated Learning to Communication Error? A Comparison
  Study Between Uplink and Downlink Channels
How Robust is Federated Learning to Communication Error? A Comparison Study Between Uplink and Downlink Channels
Linping Qu
Shenghui Song
Chi-Ying Tsui
Yuyi Mao
64
2
0
25 Oct 2023
E-Sparse: Boosting the Large Language Model Inference through
  Entropy-based N:M Sparsity
E-Sparse: Boosting the Large Language Model Inference through Entropy-based N:M Sparsity
Yun Li
Lin Niu
Xipeng Zhang
Kai Liu
Jianchen Zhu
Zhanhui Kang
MoE
90
14
0
24 Oct 2023
LoRAShear: Efficient Large Language Model Structured Pruning and
  Knowledge Recovery
LoRAShear: Efficient Large Language Model Structured Pruning and Knowledge Recovery
Tianyi Chen
Tianyu Ding
Badal Yadav
Ilya Zharkov
Luming Liang
113
32
0
24 Oct 2023
Federated learning compression designed for lightweight communications
Federated learning compression designed for lightweight communications
Lucas Grativol Ribeiro
Mathieu Léonardon
Guillaume Muller
Virginie Fresse
Matthieu Arzel
FedML
77
3
0
23 Oct 2023
Large Search Model: Redefining Search Stack in the Era of LLMs
Large Search Model: Redefining Search Stack in the Era of LLMs
Liang Wang
Nan Yang
Xiaolong Huang
Linjun Yang
Rangan Majumder
Furu Wei
LRMKELM
100
15
0
23 Oct 2023
One is More: Diverse Perspectives within a Single Network for Efficient
  DRL
One is More: Diverse Perspectives within a Single Network for Efficient DRL
Yiqin Tan
Ling Pan
Longbo Huang
OffRL
85
0
0
21 Oct 2023
Breaking through Deterministic Barriers: Randomized Pruning Mask
  Generation and Selection
Breaking through Deterministic Barriers: Randomized Pruning Mask Generation and Selection
Jianwei Li
Weizhi Gao
Qi Lei
Dongkuan Xu
68
2
0
19 Oct 2023
SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency
  in Both Image Classification and Generation
SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation
Chongyu Fan
Jiancheng Liu
Yihua Zhang
Eric Wong
Dennis Wei
Sijia Liu
MU
143
150
0
19 Oct 2023
Sparse-DySta: Sparsity-Aware Dynamic and Static Scheduling for Sparse
  Multi-DNN Workloads
Sparse-DySta: Sparsity-Aware Dynamic and Static Scheduling for Sparse Multi-DNN Workloads
Hongxiang Fan
Stylianos I. Venieris
Alexandros Kouris
Nicholas D. Lane
88
8
0
17 Oct 2023
RefConv: Re-parameterized Refocusing Convolution for Powerful ConvNets
RefConv: Re-parameterized Refocusing Convolution for Powerful ConvNets
Zhicheng Cai
Xiaohan Ding
Qiu Shen
Xun Cao
73
20
0
16 Oct 2023
The Road to On-board Change Detection: A Lightweight Patch-Level Change
  Detection Network via Exploring the Potential of Pruning and Pooling
The Road to On-board Change Detection: A Lightweight Patch-Level Change Detection Network via Exploring the Potential of Pruning and Pooling
Lihui Xue
Zhihao Wang
Xueqian Wang
Gang Li
95
1
0
16 Oct 2023
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Wenqi Jiang
Marco Zeller
R. Waleffe
Torsten Hoefler
Gustavo Alonso
130
19
0
15 Oct 2023
Edge-InversionNet: Enabling Efficient Inference of InversionNet on Edge
  Devices
Edge-InversionNet: Enabling Efficient Inference of InversionNet on Edge Devices
Zhepeng Wang
Isaacshubhanand Putla
Weiwen Jiang
Youzuo Lin
64
2
0
14 Oct 2023
Prompt Backdoors in Visual Prompt Learning
Prompt Backdoors in Visual Prompt Learning
Hai Huang
Zhengyu Zhao
Michael Backes
Yun Shen
Yang Zhang
VLMVPVLMAAMLSILM
76
2
0
11 Oct 2023
Efficient machine-learning surrogates for large-scale geological carbon
  and energy storage
Efficient machine-learning surrogates for large-scale geological carbon and energy storage
T. Kadeethum
Stephen J Verzi
Hongkyu Yoon
AI4CE
62
2
0
11 Oct 2023
Sheared LLaMA: Accelerating Language Model Pre-training via Structured
  Pruning
Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
Mengzhou Xia
Tianyu Gao
Zhiyuan Zeng
Danqi Chen
127
311
0
10 Oct 2023
Previous
123...101112...686970
Next