ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2101.09671
  4. Cited By
Pruning and Quantization for Deep Neural Network Acceleration: A Survey

Pruning and Quantization for Deep Neural Network Acceleration: A Survey

24 January 2021
Tailin Liang
C. Glossner
Lei Wang
Shaobo Shi
Xiaotong Zhang
    MQ
ArXivPDFHTML

Papers citing "Pruning and Quantization for Deep Neural Network Acceleration: A Survey"

50 / 202 papers shown
Title
Efficiency is Not Enough: A Critical Perspective of Environmentally Sustainable AI
Efficiency is Not Enough: A Critical Perspective of Environmentally Sustainable AI
Dustin Wright
Christian Igel
Gabrielle Samuel
Raghavendra Selvan
29
15
0
05 Sep 2023
SAF-IS: a Spatial Annotation Free Framework for Instance Segmentation of
  Surgical Tools
SAF-IS: a Spatial Annotation Free Framework for Instance Segmentation of Surgical Tools
Luca Sestini
Benoit Rosa
Elena De Momi
G. Ferrigno
N. Padoy
31
0
0
04 Sep 2023
Computation-efficient Deep Learning for Computer Vision: A Survey
Computation-efficient Deep Learning for Computer Vision: A Survey
Yulin Wang
Yizeng Han
Chaofei Wang
Shiji Song
Qi Tian
Gao Huang
VLM
34
20
0
27 Aug 2023
EFaR 2023: Efficient Face Recognition Competition
EFaR 2023: Efficient Face Recognition Competition
J. Kolf
Fadi Boutros
Jurek Elliesen
Markus Theuerkauf
Naser Damer
...
D. Nunes
Ahmad Hassanpour
Pankaj Khatiwada
A. Toor
Bian Yang
CVBM
MQ
24
13
0
08 Aug 2023
Advancing Frame-Dropping in Multi-Object Tracking-by-Detection Systems
  Through Event-Based Detection Triggering
Advancing Frame-Dropping in Multi-Object Tracking-by-Detection Systems Through Event-Based Detection Triggering
M. Henning
M. Buchholz
Klaus C. J. Dietmayer
19
0
0
01 Aug 2023
A Model for Every User and Budget: Label-Free and Personalized
  Mixed-Precision Quantization
A Model for Every User and Budget: Label-Free and Personalized Mixed-Precision Quantization
Edward Fish
Umberto Michieli
Mete Ozay
MQ
30
4
0
24 Jul 2023
Neural Image Compression: Generalization, Robustness, and Spectral
  Biases
Neural Image Compression: Generalization, Robustness, and Spectral Biases
Kelsey Lieberman
James Diffenderfer
Charles Godfrey
B. Kailkhura
24
4
0
17 Jul 2023
Flexible and Fully Quantized Ultra-Lightweight TinyissimoYOLO for
  Ultra-Low-Power Edge Systems
Flexible and Fully Quantized Ultra-Lightweight TinyissimoYOLO for Ultra-Low-Power Edge Systems
Julian Moosmann
H. Mueller
Nicky Zimmerman
Georg Rutishauser
Luca Benini
Michele Magno
25
8
0
12 Jul 2023
Distilling Universal and Joint Knowledge for Cross-Domain Model
  Compression on Time Series Data
Distilling Universal and Joint Knowledge for Cross-Domain Model Compression on Time Series Data
Qing Xu
Min-man Wu
Xiaoli Li
K. Mao
Zhenghua Chen
19
5
0
07 Jul 2023
Cloud-Native Computing: A Survey from the Perspective of Services
Cloud-Native Computing: A Survey from the Perspective of Services
Shuiguang Deng
Hailiang Zhao
Binbin Huang
Cheng Zhang
Feiyi Chen
Yinuo Deng
Jianwei Yin
Schahram Dustdar
Albert Y. Zomaya
AI4TS
33
17
0
26 Jun 2023
Binary domain generalization for sparsifying binary neural networks
Binary domain generalization for sparsifying binary neural networks
Riccardo Schiavone
Francesco Galati
Maria A. Zuluaga
MQ
19
0
0
23 Jun 2023
Towards Exascale CFD Simulations Using the Discontinuous Galerkin Solver
  FLEXI
Towards Exascale CFD Simulations Using the Discontinuous Galerkin Solver FLEXI
Marcel P. Blind
Min Gao
Daniel Kempf
Patrick Kopper
Marius Kurz
A. Schwarz
Andrea Beck
AI4CE
20
4
0
22 Jun 2023
Neural Network Compression using Binarization and Few Full-Precision
  Weights
Neural Network Compression using Binarization and Few Full-Precision Weights
F. M. Nardini
Cosimo Rulli
Salvatore Trani
Rossano Venturini
MQ
19
1
0
15 Jun 2023
Resource Efficient Neural Networks Using Hessian Based Pruning
Resource Efficient Neural Networks Using Hessian Based Pruning
J. Chong
Manas Gupta
Lihui Chen
22
2
0
12 Jun 2023
E-PANNs: Sound Recognition Using Efficient Pre-trained Audio Neural
  Networks
E-PANNs: Sound Recognition Using Efficient Pre-trained Audio Neural Networks
Arshdeep Singh
Haohe Liu
Mark D. Plumbley
VLM
14
4
0
30 May 2023
Compressing audio CNNs with graph centrality based filter pruning
Compressing audio CNNs with graph centrality based filter pruning
James A. King
Ashutosh Kumar Singh
Mark D. Plumbley
GNN
9
2
0
05 May 2023
Model Pruning Enables Localized and Efficient Federated Learning for
  Yield Forecasting and Data Sharing
Model Pruning Enables Localized and Efficient Federated Learning for Yield Forecasting and Data Sharing
An-dong Li
Milan Markovic
P. Edwards
Georgios Leontidis
FedML
22
16
0
19 Apr 2023
The Impact of Frame-Dropping on Performance and Energy Consumption for
  Multi-Object Tracking
The Impact of Frame-Dropping on Performance and Energy Consumption for Multi-Object Tracking
M. Henning
M. Buchholz
Klaus C. J. Dietmayer
13
1
0
17 Apr 2023
Patch-wise Features for Blur Image Classification
Patch-wise Features for Blur Image Classification
Sri Charan Kattamuru
Kshitij Agrawal
S. Adhikari
Abhishek Bose
Hemant Misra
24
1
0
06 Apr 2023
Efficient CNNs via Passive Filter Pruning
Efficient CNNs via Passive Filter Pruning
Arshdeep Singh
Mark D. Plumbley
21
1
0
05 Apr 2023
Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior
  Refinement
Not All Features Matter: Enhancing Few-shot CLIP with Adaptive Prior Refinement
Xiang-yu Zhu
Renrui Zhang
Bowei He
A-Long Zhou
Dong Wang
Bingyan Zhao
Peng Gao
VLM
32
79
0
03 Apr 2023
Bayesian neural networks via MCMC: a Python-based tutorial
Bayesian neural networks via MCMC: a Python-based tutorial
Rohitash Chandra
Royce Chen
Joshua Simmons
BDL
31
10
0
02 Apr 2023
Common Subexpression-based Compression and Multiplication of Sparse
  Constant Matrices
Common Subexpression-based Compression and Multiplication of Sparse Constant Matrices
Emre Bilgili
A. Yurdakul
15
0
0
26 Mar 2023
Energy-efficient Task Adaptation for NLP Edge Inference Leveraging
  Heterogeneous Memory Architectures
Energy-efficient Task Adaptation for NLP Edge Inference Leveraging Heterogeneous Memory Architectures
Zirui Fu
Aleksandre Avaliani
M. Donato
41
1
0
25 Mar 2023
FS-Real: Towards Real-World Cross-Device Federated Learning
FS-Real: Towards Real-World Cross-Device Federated Learning
Daoyuan Chen
Dawei Gao
Yuexiang Xie
Xuchen Pan
Zitao Li
Yaliang Li
Bolin Ding
Jingren Zhou
117
26
0
23 Mar 2023
An Empirical Study of Pre-Trained Model Reuse in the Hugging Face Deep
  Learning Model Registry
An Empirical Study of Pre-Trained Model Reuse in the Hugging Face Deep Learning Model Registry
Wenxin Jiang
Nicholas Synovic
Matt Hyatt
Taylor R. Schorlemmer
R. Sethi
Yung-Hsiang Lu
George K. Thiruvathukal
James C. Davis
27
64
0
05 Mar 2023
Structured Pruning for Deep Convolutional Neural Networks: A survey
Structured Pruning for Deep Convolutional Neural Networks: A survey
Yang He
Lingao Xiao
3DPC
30
117
0
01 Mar 2023
A Comprehensive Review and a Taxonomy of Edge Machine Learning:
  Requirements, Paradigms, and Techniques
A Comprehensive Review and a Taxonomy of Edge Machine Learning: Requirements, Paradigms, and Techniques
Wenbin Li
Hakim Hacid
Ebtesam Almazrouei
Merouane Debbah
34
13
0
16 Feb 2023
On Achieving Privacy-Preserving State-of-the-Art Edge Intelligence
On Achieving Privacy-Preserving State-of-the-Art Edge Intelligence
Daphnee Chabal
Dolly Sapra
Z. Mann
22
3
0
10 Feb 2023
DepGraph: Towards Any Structural Pruning
DepGraph: Towards Any Structural Pruning
Gongfan Fang
Xinyin Ma
Mingli Song
Michael Bi Mi
Xinchao Wang
GNN
91
257
0
30 Jan 2023
Towards Inference Efficient Deep Ensemble Learning
Towards Inference Efficient Deep Ensemble Learning
Ziyue Li
Kan Ren
Yifan Yang
Xinyang Jiang
Yuqing Yang
Dongsheng Li
BDL
21
12
0
29 Jan 2023
Optimized learned entropy coding parameters for practical neural-based
  image and video compression
Optimized learned entropy coding parameters for practical neural-based image and video compression
A. Said
Reza Pourreza
H. Le
MQ
30
2
0
20 Jan 2023
Causal Recurrent Variational Autoencoder for Medical Time Series
  Generation
Causal Recurrent Variational Autoencoder for Medical Time Series Generation
Hongming Li
Shujian Yu
José C. Príncipe
CML
BDL
MedIm
28
47
0
16 Jan 2023
Comparative Study of Parameter Selection for Enhanced Edge Inference for
  a Multi-Output Regression model for Head Pose Estimation
Comparative Study of Parameter Selection for Enhanced Edge Inference for a Multi-Output Regression model for Head Pose Estimation
A. Lindamulage
N. Kodagoda
Shyam Reyal
Pradeepa Samarasinghe
P. Yogarajah
CVBM
13
0
0
28 Dec 2022
Pruning On-the-Fly: A Recoverable Pruning Method without Fine-tuning
Pruning On-the-Fly: A Recoverable Pruning Method without Fine-tuning
Danyang Liu
Xue Liu
22
0
0
24 Dec 2022
Efficient Speech Representation Learning with Low-Bit Quantization
Efficient Speech Representation Learning with Low-Bit Quantization
Ching-Feng Yeh
Wei-Ning Hsu
Paden Tomasello
Abdel-rahman Mohamed
MQ
20
9
0
14 Dec 2022
PD-Quant: Post-Training Quantization based on Prediction Difference
  Metric
PD-Quant: Post-Training Quantization based on Prediction Difference Metric
Jiawei Liu
Lin Niu
Zhihang Yuan
Dawei Yang
Xinggang Wang
Wenyu Liu
MQ
96
68
0
14 Dec 2022
Structured Pruning Adapters
Structured Pruning Adapters
Lukas Hedegaard
Aman Alok
Juby Jose
Alexandros Iosifidis
35
10
0
17 Nov 2022
AskewSGD : An Annealed interval-constrained Optimisation method to train
  Quantized Neural Networks
AskewSGD : An Annealed interval-constrained Optimisation method to train Quantized Neural Networks
Louis Leconte
S. Schechtman
Eric Moulines
29
4
0
07 Nov 2022
Higher-order mutual information reveals synergistic sub-networks for
  multi-neuron importance
Higher-order mutual information reveals synergistic sub-networks for multi-neuron importance
Kenzo Clauw
S. Stramaglia
Daniele Marinazzo
SSL
FAtt
30
6
0
01 Nov 2022
Automated Diagnosis of Cardiovascular Diseases from Cardiac Magnetic
  Resonance Imaging Using Deep Learning Models: A Review
Automated Diagnosis of Cardiovascular Diseases from Cardiac Magnetic Resonance Imaging Using Deep Learning Models: A Review
M. Jafari
A. Shoeibi
Marjane Khodatars
Navid Ghassemi
Parisa Moridian
...
Yu-Dong Zhang
Shui-Hua Wang
Juan M Gorriz
Hamid Alinejad-Rokny
U. Acharya
30
0
0
26 Oct 2022
Towards Global Neural Network Abstractions with Locally-Exact
  Reconstruction
Towards Global Neural Network Abstractions with Locally-Exact Reconstruction
Edoardo Manino
I. Bessa
Lucas C. Cordeiro
21
1
0
21 Oct 2022
Deep Learning for Iris Recognition: A Survey
Deep Learning for Iris Recognition: A Survey
Kien X. Nguyen
Hugo Proencca
F. Alonso-Fernandez
VLM
3DV
20
49
0
12 Oct 2022
Energy Consumption of Neural Networks on NVIDIA Edge Boards: an
  Empirical Model
Energy Consumption of Neural Networks on NVIDIA Edge Boards: an Empirical Model
Seyyidahmed Lahmer
A. Khoshsirat
M. Rossi
Andrea Zanella
8
11
0
04 Oct 2022
Going Further With Winograd Convolutions: Tap-Wise Quantization for
  Efficient Inference on 4x4 Tile
Going Further With Winograd Convolutions: Tap-Wise Quantization for Efficient Inference on 4x4 Tile
Renzo Andri
Beatrice Bussolino
A. Cipolletta
Lukas Cavigelli
Zhe Wang
MQ
26
13
0
26 Sep 2022
Learning to Simulate Realistic LiDARs
Learning to Simulate Realistic LiDARs
Benoît Guillard
Sai H. Vemprala
Jayesh K. Gupta
O. Mikšík
Vibhav Vineet
Pascal Fua
Ashish Kapoor
3DPC
11
18
0
22 Sep 2022
Optimizing Connectivity through Network Gradients for Restricted
  Boltzmann Machines
Optimizing Connectivity through Network Gradients for Restricted Boltzmann Machines
A. C. N. D. Oliveira
Daniel R. Figueiredo
22
0
0
14 Sep 2022
Interpretations Steered Network Pruning via Amortized Inferred Saliency
  Maps
Interpretations Steered Network Pruning via Amortized Inferred Saliency Maps
Alireza Ganjdanesh
Shangqian Gao
Heng-Chiao Huang
FAtt
AAML
24
19
0
07 Sep 2022
Reducing Computational Complexity of Neural Networks in Optical Channel
  Equalization: From Concepts to Implementation
Reducing Computational Complexity of Neural Networks in Optical Channel Equalization: From Concepts to Implementation
Pedro J. Freire
A. Napoli
D. A. Ron
B. Spinnler
M. Anderson
W. Schairer
T. Bex
N. Costa
S. Turitsyn
Jaroslaw E. Prilepsky
27
28
0
26 Aug 2022
Optimal Brain Compression: A Framework for Accurate Post-Training
  Quantization and Pruning
Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning
Elias Frantar
Sidak Pal Singh
Dan Alistarh
MQ
20
216
0
24 Aug 2022
Previous
12345
Next