Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2101.09671
Cited By
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
24 January 2021
Tailin Liang
C. Glossner
Lei Wang
Shaobo Shi
Xiaotong Zhang
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Pruning and Quantization for Deep Neural Network Acceleration: A Survey"
50 / 202 papers shown
Title
Efficiency is Not Enough: A Critical Perspective of Environmentally Sustainable AI
Dustin Wright
Christian Igel
Gabrielle Samuel
Raghavendra Selvan
29
15
0
05 Sep 2023
SAF-IS: a Spatial Annotation Free Framework for Instance Segmentation of Surgical Tools
Luca Sestini
Benoit Rosa
Elena De Momi
G. Ferrigno
N. Padoy
31
0
0
04 Sep 2023
Computation-efficient Deep Learning for Computer Vision: A Survey
Yulin Wang
Yizeng Han
Chaofei Wang
Shiji Song
Qi Tian
Gao Huang
VLM
34
20
0
27 Aug 2023
EFaR 2023: Efficient Face Recognition Competition
J. Kolf
Fadi Boutros
Jurek Elliesen
Markus Theuerkauf
Naser Damer
...
D. Nunes
Ahmad Hassanpour
Pankaj Khatiwada
A. Toor
Bian Yang
CVBM
MQ
24
13
0
08 Aug 2023
Advancing Frame-Dropping in Multi-Object Tracking-by-Detection Systems Through Event-Based Detection Triggering
M. Henning
M. Buchholz
Klaus C. J. Dietmayer
19
0
0
01 Aug 2023
A Model for Every User and Budget: Label-Free and Personalized Mixed-Precision Quantization
Edward Fish
Umberto Michieli
Mete Ozay
MQ
30
4
0
24 Jul 2023
Neural Image Compression: Generalization, Robustness, and Spectral Biases
Kelsey Lieberman
James Diffenderfer
Charles Godfrey
B. Kailkhura
24
4
0
17 Jul 2023
Flexible and Fully Quantized Ultra-Lightweight TinyissimoYOLO for Ultra-Low-Power Edge Systems
Julian Moosmann
H. Mueller
Nicky Zimmerman
Georg Rutishauser
Luca Benini
Michele Magno
25
8
0
12 Jul 2023
Distilling Universal and Joint Knowledge for Cross-Domain Model Compression on Time Series Data
Qing Xu
Min-man Wu
Xiaoli Li
K. Mao
Zhenghua Chen
19
5
0
07 Jul 2023
Cloud-Native Computing: A Survey from the Perspective of Services
Shuiguang Deng
Hailiang Zhao
Binbin Huang
Cheng Zhang
Feiyi Chen
Yinuo Deng
Jianwei Yin
Schahram Dustdar
Albert Y. Zomaya
AI4TS
33
17
0
26 Jun 2023
Binary domain generalization for sparsifying binary neural networks
Riccardo Schiavone
Francesco Galati
Maria A. Zuluaga
MQ
19
0
0
23 Jun 2023
Towards Exascale CFD Simulations Using the Discontinuous Galerkin Solver FLEXI
Marcel P. Blind
Min Gao
Daniel Kempf
Patrick Kopper
Marius Kurz
A. Schwarz
Andrea Beck
AI4CE
20
4
0
22 Jun 2023
Neural Network Compression using Binarization and Few Full-Precision Weights
F. M. Nardini
Cosimo Rulli
Salvatore Trani
Rossano Venturini
MQ
19
1
0
15 Jun 2023
Resource Efficient Neural Networks Using Hessian Based Pruning
J. Chong
Manas Gupta
Lihui Chen
22
2
0
12 Jun 2023
E-PANNs: Sound Recognition Using Efficient Pre-trained Audio Neural Networks
Arshdeep Singh
Haohe Liu
Mark D. Plumbley
VLM
14
4
0
30 May 2023
Compressing audio CNNs with graph centrality based filter pruning
James A. King
Ashutosh Kumar Singh
Mark D. Plumbley
GNN
9
2
0
05 May 2023
Model Pruning Enables Localized and Efficient Federated Learning for Yield Forecasting and Data Sharing
An-dong Li
Milan Markovic
P. Edwards
Georgios Leontidis
FedML
22
16
0
19 Apr 2023
The Impact of Frame-Dropping on Performance and Energy Consumption for Multi-Object Tracking
M. Henning
M. Buchholz
Klaus C. J. Dietmayer
13
1
0
17 Apr 2023
Patch-wise Features for Blur Image Classification
Sri Charan Kattamuru
Kshitij Agrawal
S. Adhikari
Abhishek Bose
Hemant Misra
24
1
0
06 Apr 2023
Efficient CNNs via Passive Filter Pruning
Arshdeep Singh
Mark D. Plumbley
21
1
0
05 Apr 2023
Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement
Xiang-yu Zhu
Renrui Zhang
Bowei He
A-Long Zhou
Dong Wang
Bingyan Zhao
Peng Gao
VLM
32
79
0
03 Apr 2023
Bayesian neural networks via MCMC: a Python-based tutorial
Rohitash Chandra
Royce Chen
Joshua Simmons
BDL
31
10
0
02 Apr 2023
Common Subexpression-based Compression and Multiplication of Sparse Constant Matrices
Emre Bilgili
A. Yurdakul
15
0
0
26 Mar 2023
Energy-efficient Task Adaptation for NLP Edge Inference Leveraging Heterogeneous Memory Architectures
Zirui Fu
Aleksandre Avaliani
M. Donato
41
1
0
25 Mar 2023
FS-Real: Towards Real-World Cross-Device Federated Learning
Daoyuan Chen
Dawei Gao
Yuexiang Xie
Xuchen Pan
Zitao Li
Yaliang Li
Bolin Ding
Jingren Zhou
117
26
0
23 Mar 2023
An Empirical Study of Pre-Trained Model Reuse in the Hugging Face Deep Learning Model Registry
Wenxin Jiang
Nicholas Synovic
Matt Hyatt
Taylor R. Schorlemmer
R. Sethi
Yung-Hsiang Lu
George K. Thiruvathukal
James C. Davis
27
64
0
05 Mar 2023
Structured Pruning for Deep Convolutional Neural Networks: A survey
Yang He
Lingao Xiao
3DPC
30
117
0
01 Mar 2023
A Comprehensive Review and a Taxonomy of Edge Machine Learning: Requirements, Paradigms, and Techniques
Wenbin Li
Hakim Hacid
Ebtesam Almazrouei
Merouane Debbah
34
13
0
16 Feb 2023
On Achieving Privacy-Preserving State-of-the-Art Edge Intelligence
Daphnee Chabal
Dolly Sapra
Z. Mann
22
3
0
10 Feb 2023
DepGraph: Towards Any Structural Pruning
Gongfan Fang
Xinyin Ma
Mingli Song
Michael Bi Mi
Xinchao Wang
GNN
91
257
0
30 Jan 2023
Towards Inference Efficient Deep Ensemble Learning
Ziyue Li
Kan Ren
Yifan Yang
Xinyang Jiang
Yuqing Yang
Dongsheng Li
BDL
21
12
0
29 Jan 2023
Optimized learned entropy coding parameters for practical neural-based image and video compression
A. Said
Reza Pourreza
H. Le
MQ
30
2
0
20 Jan 2023
Causal Recurrent Variational Autoencoder for Medical Time Series Generation
Hongming Li
Shujian Yu
José C. Príncipe
CML
BDL
MedIm
28
47
0
16 Jan 2023
Comparative Study of Parameter Selection for Enhanced Edge Inference for a Multi-Output Regression model for Head Pose Estimation
A. Lindamulage
N. Kodagoda
Shyam Reyal
Pradeepa Samarasinghe
P. Yogarajah
CVBM
13
0
0
28 Dec 2022
Pruning On-the-Fly: A Recoverable Pruning Method without Fine-tuning
Danyang Liu
Xue Liu
22
0
0
24 Dec 2022
Efficient Speech Representation Learning with Low-Bit Quantization
Ching-Feng Yeh
Wei-Ning Hsu
Paden Tomasello
Abdel-rahman Mohamed
MQ
20
9
0
14 Dec 2022
PD-Quant: Post-Training Quantization based on Prediction Difference Metric
Jiawei Liu
Lin Niu
Zhihang Yuan
Dawei Yang
Xinggang Wang
Wenyu Liu
MQ
96
68
0
14 Dec 2022
Structured Pruning Adapters
Lukas Hedegaard
Aman Alok
Juby Jose
Alexandros Iosifidis
35
10
0
17 Nov 2022
AskewSGD : An Annealed interval-constrained Optimisation method to train Quantized Neural Networks
Louis Leconte
S. Schechtman
Eric Moulines
29
4
0
07 Nov 2022
Higher-order mutual information reveals synergistic sub-networks for multi-neuron importance
Kenzo Clauw
S. Stramaglia
Daniele Marinazzo
SSL
FAtt
30
6
0
01 Nov 2022
Automated Diagnosis of Cardiovascular Diseases from Cardiac Magnetic Resonance Imaging Using Deep Learning Models: A Review
M. Jafari
A. Shoeibi
Marjane Khodatars
Navid Ghassemi
Parisa Moridian
...
Yu-Dong Zhang
Shui-Hua Wang
Juan M Gorriz
Hamid Alinejad-Rokny
U. Acharya
30
0
0
26 Oct 2022
Towards Global Neural Network Abstractions with Locally-Exact Reconstruction
Edoardo Manino
I. Bessa
Lucas C. Cordeiro
21
1
0
21 Oct 2022
Deep Learning for Iris Recognition: A Survey
Kien X. Nguyen
Hugo Proencca
F. Alonso-Fernandez
VLM
3DV
20
49
0
12 Oct 2022
Energy Consumption of Neural Networks on NVIDIA Edge Boards: an Empirical Model
Seyyidahmed Lahmer
A. Khoshsirat
M. Rossi
Andrea Zanella
8
11
0
04 Oct 2022
Going Further With Winograd Convolutions: Tap-Wise Quantization for Efficient Inference on 4x4 Tile
Renzo Andri
Beatrice Bussolino
A. Cipolletta
Lukas Cavigelli
Zhe Wang
MQ
26
13
0
26 Sep 2022
Learning to Simulate Realistic LiDARs
Benoît Guillard
Sai H. Vemprala
Jayesh K. Gupta
O. Mikšík
Vibhav Vineet
Pascal Fua
Ashish Kapoor
3DPC
11
18
0
22 Sep 2022
Optimizing Connectivity through Network Gradients for Restricted Boltzmann Machines
A. C. N. D. Oliveira
Daniel R. Figueiredo
22
0
0
14 Sep 2022
Interpretations Steered Network Pruning via Amortized Inferred Saliency Maps
Alireza Ganjdanesh
Shangqian Gao
Heng-Chiao Huang
FAtt
AAML
24
19
0
07 Sep 2022
Reducing Computational Complexity of Neural Networks in Optical Channel Equalization: From Concepts to Implementation
Pedro J. Freire
A. Napoli
D. A. Ron
B. Spinnler
M. Anderson
W. Schairer
T. Bex
N. Costa
S. Turitsyn
Jaroslaw E. Prilepsky
27
28
0
26 Aug 2022
Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning
Elias Frantar
Sidak Pal Singh
Dan Alistarh
MQ
20
216
0
24 Aug 2022
Previous
1
2
3
4
5
Next