Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.00149
Cited By
v1
v2
v3
v4
v5 (latest)
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
1 October 2015
Song Han
Huizi Mao
W. Dally
3DGS
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding"
50 / 3,481 papers shown
Title
REDS: Resource-Efficient Deep Subnetworks for Dynamic Resource Constraints
Francesco Corti
Balz Maag
Joachim Schauer
U. Pferschy
O. Saukh
106
2
0
22 Nov 2023
Efficient Neural Networks for Tiny Machine Learning: A Comprehensive Review
M. Lê
Pierre Wolinski
Julyan Arbel
99
10
0
20 Nov 2023
Tensor-Aware Energy Accounting
Timur Babakol
Yu David Liu
43
4
0
19 Nov 2023
LifeLearner: Hardware-Aware Meta Continual Learning System for Embedded Computing Platforms
Young D. Kwon
Jagmohan Chauhan
Hong Jia
Stylianos I. Venieris
Cecilia Mascolo
89
12
0
19 Nov 2023
Low-Precision Floating-Point for Efficient On-Board Deep Neural Network Processing
Cédric Gernigon
Silviu-Ioan Filip
Olivier Sentieys
Clément Coggiola
Mickael Bruno
MQ
61
8
0
18 Nov 2023
Deep Coherence Learning: An Unsupervised Deep Beamformer for High Quality Single Plane Wave Imaging in Medical Ultrasound
Hyunwoo Cho
Seongjun Park
Jinbum Kang
Yangmo Yoo
OOD
21
3
0
18 Nov 2023
ECLM: Efficient Edge-Cloud Collaborative Learning with Continuous Environment Adaptation
Zhuang Yan
Zhenzhe Zheng
Yunfeng Shao
Bingshuai Li
Fan Wu
Guihai Chen
79
5
0
18 Nov 2023
Improved TokenPose with Sparsity
Anning Li
ViT
80
0
0
16 Nov 2023
Do Localization Methods Actually Localize Memorized Data in LLMs? A Tale of Two Benchmarks
Ting-Yun Chang
Jesse Thomason
Robin Jia
100
19
0
15 Nov 2023
FedCode: Communication-Efficient Federated Learning via Transferring Codebooks
Saeed Khalilian Gourtani
Vasileios Tsouvalas
T. Ozcelebi
N. Meratnia
FedML
105
5
0
15 Nov 2023
Boolean Variation and Boolean Logic BackPropagation
Van Minh Nguyen
93
2
0
13 Nov 2023
Training A Multi-stage Deep Classifier with Feedback Signals
Chao Xu
Yu Yang
Rong Wang
Guan Wang
Bojia Lin
43
0
0
12 Nov 2023
5G Positioning Advancements with AI/ML
Mohammad Alawieh
Georgios Kontes
39
5
0
10 Nov 2023
Quantized Distillation: Optimizing Driver Activity Recognition Models for Resource-Constrained Environments
Calvin Tanama
Kunyu Peng
Zdravko Marinov
Rainer Stiefelhagen
Alina Roitberg
70
1
0
10 Nov 2023
FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Cores
Daniel Y. Fu
Hermann Kumbong
Eric N. D. Nguyen
Christopher Ré
VLM
100
30
0
10 Nov 2023
Compressed and Sparse Models for Non-Convex Decentralized Learning
Andrew Campbell
Hang Liu
Leah Woldemariam
Anna Scaglione
53
0
0
09 Nov 2023
Adaptive Compression-Aware Split Learning and Inference for Enhanced Network Efficiency
Akrit Mudvari
Antero Vainio
Iason Ofeidis
Sasu Tarkoma
Leandros Tassiulas
81
3
0
09 Nov 2023
Game Theory Solutions in Sensor-Based Human Activity Recognition: A Review
M. Shayesteh
Behrooz Sharokhzadeh
B. Masoumi
30
3
0
09 Nov 2023
Exploiting Neural-Network Statistics for Low-Power DNN Inference
Lennart Bamberg
Ardalan Najafi
Alberto García-Ortiz
29
1
0
09 Nov 2023
Beyond Size: How Gradients Shape Pruning Decisions in Large Language Models
Rocktim Jyoti Das
Mingjie Sun
Liqun Ma
Zhiqiang Shen
VLM
83
18
0
08 Nov 2023
Mini but Mighty: Finetuning ViTs with Mini Adapters
Imad Eddine Marouf
Enzo Tartaglione
Stéphane Lathuilière
75
5
0
07 Nov 2023
Machine learning's own Industrial Revolution
Yuan Luo
Song Han
Jingjing Liu
AI4CE
108
0
0
04 Nov 2023
AFPQ: Asymmetric Floating Point Quantization for LLMs
Yijia Zhang
Sicheng Zhang
Shijie Cao
Dayou Du
Jianyu Wei
Ting Cao
Ningyi Xu
MQ
60
5
0
03 Nov 2023
Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection
Haibao Yu
Yingjuan Tang
Enze Xie
Jilei Mao
Ping Luo
Zaiqing Nie
3DPC
105
27
0
03 Nov 2023
Ultra-Efficient On-Device Object Detection on AI-Integrated Smart Glasses with TinyissimoYOLO
Julian Moosmann
Pietro Bonazzi
Yawei Li
Sizhen Bian
Philipp Mayer
Luca Benini
Michele Magno
117
13
0
02 Nov 2023
Efficient LLM Inference on CPUs
Haihao Shen
Hanwen Chang
Bo Dong
Yu Luo
Hengyu Meng
MQ
75
19
0
01 Nov 2023
Federated Topic Model and Model Pruning Based on Variational Autoencoder
Chengjie Ma
Yawen Li
M. Liang
Ang Li
FedML
29
1
0
01 Nov 2023
Importance Estimation with Random Gradient for Neural Network Pruning
Suman Sapkota
Binod Bhattarai
100
1
0
31 Oct 2023
PriPrune: Quantifying and Preserving Privacy in Pruned Federated Learning
Tianyue Chu
Mengwei Yang
Nikolaos Laoutaris
A. Markopoulou
87
7
0
30 Oct 2023
SparseByteNN: A Novel Mobile Inference Acceleration Framework Based on Fine-Grained Group Sparsity
Haitao Xu
Songwei Liu
Yuyang Xu
Shuai Wang
Jiashi Li
Chenqian Yan
Liangqiang Li
Lean Fu
Xin Pan
Fangmin Chen
MQ
46
0
0
30 Oct 2023
Efficient IoT Inference via Context-Awareness
Mohammad Mehdi Rastikerdar
Jin Huang
Shiwei Fang
Hui Guan
Deepak Ganesan
103
0
0
29 Oct 2023
Atom: Low-bit Quantization for Efficient and Accurate LLM Serving
Yilong Zhao
Chien-Yu Lin
Kan Zhu
Zihao Ye
Lequn Chen
Wenlei Bao
Luis Ceze
Arvind Krishnamurthy
Tianqi Chen
Baris Kasikci
MQ
154
150
0
29 Oct 2023
FedPEAT: Convergence of Federated Learning, Parameter-Efficient Fine Tuning, and Emulator Assisted Tuning for Artificial Intelligence Foundation Models with Mobile Edge Computing
Terence Jie Chua
Wen-li Yu
Junfeng Zhao
Kwok-Yan Lam
FedML
61
5
0
26 Oct 2023
Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time
Zichang Liu
Jue Wang
Tri Dao
Dinesh Manocha
Binhang Yuan
...
Anshumali Shrivastava
Ce Zhang
Yuandong Tian
Christopher Ré
Beidi Chen
BDL
126
221
0
26 Oct 2023
How Robust is Federated Learning to Communication Error? A Comparison Study Between Uplink and Downlink Channels
Linping Qu
Shenghui Song
Chi-Ying Tsui
Yuyi Mao
64
2
0
25 Oct 2023
E-Sparse: Boosting the Large Language Model Inference through Entropy-based N:M Sparsity
Yun Li
Lin Niu
Xipeng Zhang
Kai Liu
Jianchen Zhu
Zhanhui Kang
MoE
90
14
0
24 Oct 2023
LoRAShear: Efficient Large Language Model Structured Pruning and Knowledge Recovery
Tianyi Chen
Tianyu Ding
Badal Yadav
Ilya Zharkov
Luming Liang
113
32
0
24 Oct 2023
Federated learning compression designed for lightweight communications
Lucas Grativol Ribeiro
Mathieu Léonardon
Guillaume Muller
Virginie Fresse
Matthieu Arzel
FedML
77
3
0
23 Oct 2023
Large Search Model: Redefining Search Stack in the Era of LLMs
Liang Wang
Nan Yang
Xiaolong Huang
Linjun Yang
Rangan Majumder
Furu Wei
LRM
KELM
100
15
0
23 Oct 2023
One is More: Diverse Perspectives within a Single Network for Efficient DRL
Yiqin Tan
Ling Pan
Longbo Huang
OffRL
85
0
0
21 Oct 2023
Breaking through Deterministic Barriers: Randomized Pruning Mask Generation and Selection
Jianwei Li
Weizhi Gao
Qi Lei
Dongkuan Xu
68
2
0
19 Oct 2023
SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation
Chongyu Fan
Jiancheng Liu
Yihua Zhang
Eric Wong
Dennis Wei
Sijia Liu
MU
143
150
0
19 Oct 2023
Sparse-DySta: Sparsity-Aware Dynamic and Static Scheduling for Sparse Multi-DNN Workloads
Hongxiang Fan
Stylianos I. Venieris
Alexandros Kouris
Nicholas D. Lane
88
8
0
17 Oct 2023
RefConv: Re-parameterized Refocusing Convolution for Powerful ConvNets
Zhicheng Cai
Xiaohan Ding
Qiu Shen
Xun Cao
73
20
0
16 Oct 2023
The Road to On-board Change Detection: A Lightweight Patch-Level Change Detection Network via Exploring the Potential of Pruning and Pooling
Lihui Xue
Zhihao Wang
Xueqian Wang
Gang Li
95
1
0
16 Oct 2023
Chameleon: a Heterogeneous and Disaggregated Accelerator System for Retrieval-Augmented Language Models
Wenqi Jiang
Marco Zeller
R. Waleffe
Torsten Hoefler
Gustavo Alonso
130
19
0
15 Oct 2023
Edge-InversionNet: Enabling Efficient Inference of InversionNet on Edge Devices
Zhepeng Wang
Isaacshubhanand Putla
Weiwen Jiang
Youzuo Lin
64
2
0
14 Oct 2023
Prompt Backdoors in Visual Prompt Learning
Hai Huang
Zhengyu Zhao
Michael Backes
Yun Shen
Yang Zhang
VLM
VPVLM
AAML
SILM
76
2
0
11 Oct 2023
Efficient machine-learning surrogates for large-scale geological carbon and energy storage
T. Kadeethum
Stephen J Verzi
Hongkyu Yoon
AI4CE
62
2
0
11 Oct 2023
Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
Mengzhou Xia
Tianyu Gao
Zhiyuan Zeng
Danqi Chen
127
311
0
10 Oct 2023
Previous
1
2
3
...
10
11
12
...
68
69
70
Next