ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1902.06822
  4. Cited By
Low-bit Quantization of Neural Networks for Efficient Inference

Low-bit Quantization of Neural Networks for Efficient Inference

18 February 2019
Yoni Choukroun
Eli Kravchik
Fan Yang
P. Kisilev
    MQ
ArXivPDFHTML

Papers citing "Low-bit Quantization of Neural Networks for Efficient Inference"

50 / 182 papers shown
Title
Attention Round for Post-Training Quantization
Attention Round for Post-Training Quantization
Huabin Diao
Gongyang Li
Shaoyun Xu
Yuexing Hao
MQ
13
10
0
07 Jul 2022
QuantFace: Towards Lightweight Face Recognition by Synthetic Data
  Low-bit Quantization
QuantFace: Towards Lightweight Face Recognition by Synthetic Data Low-bit Quantization
Fadi Boutros
Naser Damer
Arjan Kuijper
CVBM
MQ
22
37
0
21 Jun 2022
Channel-wise Mixed-precision Assignment for DNN Inference on Constrained
  Edge Nodes
Channel-wise Mixed-precision Assignment for DNN Inference on Constrained Edge Nodes
Matteo Risso
Alessio Burrello
Luca Benini
Enrico Macii
M. Poncino
Daniele Jahier Pagliari
MQ
16
11
0
17 Jun 2022
FCN-Pose: A Pruned and Quantized CNN for Robot Pose Estimation for
  Constrained Devices
FCN-Pose: A Pruned and Quantized CNN for Robot Pose Estimation for Constrained Devices
M. Dantas
I. R. R. Silva
A. T. O. Filho
Gibson B. N. Barbosa
Daniel Bezerra
D. Sadok
J. Kelner
M. Marquezini
Ricardo F. D. Silva
17
1
0
26 May 2022
Fast matrix multiplication for binary and ternary CNNs on ARM CPU
Fast matrix multiplication for binary and ternary CNNs on ARM CPU
A. Trusov
E. Limonova
D. Nikolaev
V. Arlazarov
MQ
27
5
0
18 May 2022
A Comprehensive Survey on Model Quantization for Deep Neural Networks in
  Image Classification
A Comprehensive Survey on Model Quantization for Deep Neural Networks in Image Classification
Babak Rokh
A. Azarpeyvand
Alireza Khanteymoori
MQ
30
84
0
14 May 2022
Adaptive Differential Filters for Fast and Communication-Efficient
  Federated Learning
Adaptive Differential Filters for Fast and Communication-Efficient Federated Learning
Daniel Becking
H. Kirchhoffer
G. Tech
Paul Haase
Karsten Müller
H. Schwarz
Wojciech Samek
FedML
29
4
0
09 Apr 2022
SPIQ: Data-Free Per-Channel Static Input Quantization
SPIQ: Data-Free Per-Channel Static Input Quantization
Edouard Yvinec
Arnaud Dapogny
Matthieu Cord
Kévin Bailly
MQ
16
18
0
28 Mar 2022
FxP-QNet: A Post-Training Quantizer for the Design of Mixed
  Low-Precision DNNs with Dynamic Fixed-Point Representation
FxP-QNet: A Post-Training Quantizer for the Design of Mixed Low-Precision DNNs with Dynamic Fixed-Point Representation
Ahmad Shawahna
S. M. Sait
A. El-Maleh
Irfan Ahmad
MQ
20
6
0
22 Mar 2022
QDrop: Randomly Dropping Quantization for Extremely Low-bit
  Post-Training Quantization
QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quantization
Xiuying Wei
Ruihao Gong
Yuhang Li
Xianglong Liu
F. Yu
MQ
VLM
19
166
0
11 Mar 2022
Patch Similarity Aware Data-Free Quantization for Vision Transformers
Patch Similarity Aware Data-Free Quantization for Vision Transformers
Zhikai Li
Liping Ma
Mengjuan Chen
Junrui Xiao
Qingyi Gu
MQ
ViT
17
44
0
04 Mar 2022
Energy-Efficient Respiratory Anomaly Detection in Premature Newborn
  Infants
Energy-Efficient Respiratory Anomaly Detection in Premature Newborn Infants
A. Paul
Md. Abu Saleh Tajin
Anup Das
W. Mongan
K. Dandekar
29
11
0
21 Feb 2022
SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian
  Approximation
SQuant: On-the-Fly Data-Free Quantization via Diagonal Hessian Approximation
Cong Guo
Yuxian Qiu
Jingwen Leng
Xiaotian Gao
Chen Zhang
Yunxin Liu
Fan Yang
Yuhao Zhu
Minyi Guo
MQ
72
70
0
14 Feb 2022
F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization
F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization
Qing Jin
Jian Ren
Richard Zhuang
Sumant Hanumante
Zhengang Li
Zhiyu Chen
Yanzhi Wang
Kai-Min Yang
Sergey Tulyakov
MQ
24
48
0
10 Feb 2022
Quantune: Post-training Quantization of Convolutional Neural Networks
  using Extreme Gradient Boosting for Fast Deployment
Quantune: Post-training Quantization of Convolutional Neural Networks using Extreme Gradient Boosting for Fast Deployment
Jemin Lee
Misun Yu
Yongin Kwon
Teaho Kim
MQ
19
17
0
10 Feb 2022
Post-training Quantization for Neural Networks with Provable Guarantees
Post-training Quantization for Neural Networks with Provable Guarantees
Jinjie Zhang
Yixuan Zhou
Rayan Saab
MQ
23
32
0
26 Jan 2022
TerViT: An Efficient Ternary Vision Transformer
TerViT: An Efficient Ternary Vision Transformer
Sheng Xu
Yanjing Li
Teli Ma
Bo-Wen Zeng
Baochang Zhang
Peng Gao
Jinhu Lv
ViT
23
11
0
20 Jan 2022
Q-ViT: Fully Differentiable Quantization for Vision Transformer
Q-ViT: Fully Differentiable Quantization for Vision Transformer
Zhexin Li
Tong Yang
Peisong Wang
Jian Cheng
ViT
MQ
33
41
0
19 Jan 2022
N3H-Core: Neuron-designed Neural Network Accelerator via FPGA-based
  Heterogeneous Computing Cores
N3H-Core: Neuron-designed Neural Network Accelerator via FPGA-based Heterogeneous Computing Cores
Yu Gong
Zhihang Xu
Zhezhi He
Weifeng Zhang
Xiaobing Tu
Xiaoyao Liang
Li Jiang
25
13
0
15 Dec 2021
A Generalized Zero-Shot Quantization of Deep Convolutional Neural
  Networks via Learned Weights Statistics
A Generalized Zero-Shot Quantization of Deep Convolutional Neural Networks via Learned Weights Statistics
Prasen Kumar Sharma
Arun Abraham
V. N. Rajendiran
MQ
25
7
0
06 Dec 2021
FQ-ViT: Post-Training Quantization for Fully Quantized Vision
  Transformer
FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer
Yang Lin
Tianyu Zhang
Peiqin Sun
Zheng Li
Shuchang Zhou
ViT
MQ
21
147
0
27 Nov 2021
PTQ4ViT: Post-training quantization for vision transformers with twin
  uniform quantization
PTQ4ViT: Post-training quantization for vision transformers with twin uniform quantization
Zhihang Yuan
Chenhao Xue
Yiqi Chen
Qiang Wu
Guangyu Sun
ViT
MQ
17
130
0
24 Nov 2021
Variability-Aware Training and Self-Tuning of Highly Quantized DNNs for
  Analog PIM
Variability-Aware Training and Self-Tuning of Highly Quantized DNNs for Analog PIM
Zihao Deng
Michael Orshansky
MQ
42
6
0
11 Nov 2021
Edge-Cloud Polarization and Collaboration: A Comprehensive Survey for AI
Edge-Cloud Polarization and Collaboration: A Comprehensive Survey for AI
Jiangchao Yao
Shengyu Zhang
Yang Yao
Feng Wang
Jianxin Ma
...
Kun Kuang
Chao-Xiang Wu
Fei Wu
Jingren Zhou
Hongxia Yang
24
91
0
11 Nov 2021
Self-Compression in Bayesian Neural Networks
Self-Compression in Bayesian Neural Networks
Giuseppina Carannante
Dimah Dera
Ghulam Rasool
N. Bouaynaya
UQCV
BDL
33
5
0
10 Nov 2021
MQBench: Towards Reproducible and Deployable Model Quantization
  Benchmark
MQBench: Towards Reproducible and Deployable Model Quantization Benchmark
Yuhang Li
Mingzhu Shen
Jian Ma
Yan Ren
Mingxin Zhao
Qi Zhang
Ruihao Gong
F. Yu
Junjie Yan
MQ
35
49
0
05 Nov 2021
Qu-ANTI-zation: Exploiting Quantization Artifacts for Achieving
  Adversarial Outcomes
Qu-ANTI-zation: Exploiting Quantization Artifacts for Achieving Adversarial Outcomes
Sanghyun Hong
Michael-Andrei Panaitescu-Liess
Yigitcan Kaya
Tudor Dumitras
MQ
60
13
0
26 Oct 2021
Applications and Techniques for Fast Machine Learning in Science
Applications and Techniques for Fast Machine Learning in Science
A. Deiana
Nhan Tran
Joshua C. Agar
Michaela Blott
G. D. Guglielmo
...
Ashish Sharma
S. Summers
Pietro Vischia
J. Vlimant
Olivia Weng
14
71
0
25 Oct 2021
PTQ-SL: Exploring the Sub-layerwise Post-training Quantization
PTQ-SL: Exploring the Sub-layerwise Post-training Quantization
Zhihang Yuan
Yiqi Chen
Chenhao Xue
Chenguang Zhang
Qiankun Wang
Guangyu Sun
MQ
11
3
0
15 Oct 2021
Compact CNN Models for On-device Ocular-based User Recognition in Mobile
  Devices
Compact CNN Models for On-device Ocular-based User Recognition in Mobile Devices
Ali Almadan
A. Rattani
CVBM
10
9
0
11 Oct 2021
Understanding and Overcoming the Challenges of Efficient Transformer
  Quantization
Understanding and Overcoming the Challenges of Efficient Transformer Quantization
Yelysei Bondarenko
Markus Nagel
Tijmen Blankevoort
MQ
25
133
0
27 Sep 2021
Diverse Sample Generation: Pushing the Limit of Generative Data-free
  Quantization
Diverse Sample Generation: Pushing the Limit of Generative Data-free Quantization
Haotong Qin
Yifu Ding
Xiangguo Zhang
Jiakai Wang
Xianglong Liu
Jiwen Lu
DiffM
MQ
21
49
0
01 Sep 2021
Full-Cycle Energy Consumption Benchmark for Low-Carbon Computer Vision
Full-Cycle Energy Consumption Benchmark for Low-Carbon Computer Vision
Bo-wen Li
Xinyang Jiang
Donglin Bai
Yuge Zhang
Ningxin Zheng
Xuanyi Dong
Lu Liu
Yuqing Yang
Dongsheng Li
14
10
0
30 Aug 2021
Auto-Split: A General Framework of Collaborative Edge-Cloud AI
Auto-Split: A General Framework of Collaborative Edge-Cloud AI
Amin Banitalebi-Dehkordi
Naveen Vedula
J. Pei
Fei Xia
Lanjun Wang
Yong Zhang
22
89
0
30 Aug 2021
Machine Learning Advances aiding Recognition and Classification of
  Indian Monuments and Landmarks
Machine Learning Advances aiding Recognition and Classification of Indian Monuments and Landmarks
A. Paul
S. Ghose
K. Aggarwal
Niketha Nethaji
Shivam Pal
Arnab Dutta Purkayastha
20
9
0
29 Jul 2021
Post-Training Quantization for Vision Transformer
Post-Training Quantization for Vision Transformer
Zhenhua Liu
Yunhe Wang
Kai Han
Siwei Ma
Wen Gao
ViT
MQ
56
326
0
27 Jun 2021
Towards Efficient Full 8-bit Integer DNN Online Training on
  Resource-limited Devices without Batch Normalization
Towards Efficient Full 8-bit Integer DNN Online Training on Resource-limited Devices without Batch Normalization
Yukuan Yang
Xiaowei Chi
Lei Deng
Tianyi Yan
Feng Gao
Guoqi Li
MQ
23
6
0
27 May 2021
Post-Training Sparsity-Aware Quantization
Post-Training Sparsity-Aware Quantization
Gil Shomron
F. Gabbay
Samer Kurzum
U. Weiser
MQ
39
33
0
23 May 2021
Automated Backend-Aware Post-Training Quantization
Automated Backend-Aware Post-Training Quantization
Ziheng Jiang
Animesh Jain
An Liu
Josh Fromm
Chengqian Ma
Tianqi Chen
Luis Ceze
MQ
37
2
0
27 Mar 2021
RCT: Resource Constrained Training for Edge AI
RCT: Resource Constrained Training for Edge AI
Tian Huang
Tao Luo
Ming Yan
Qiufeng Wang
Rick Siow Mong Goh
33
8
0
26 Mar 2021
Diversifying Sample Generation for Accurate Data-Free Quantization
Diversifying Sample Generation for Accurate Data-Free Quantization
Xiangguo Zhang
Haotong Qin
Yifu Ding
Ruihao Gong
Qing Yan
Renshuai Tao
Yuhang Li
F. Yu
Xianglong Liu
MQ
56
94
0
01 Mar 2021
On the Effects of Quantisation on Model Uncertainty in Bayesian Neural
  Networks
On the Effects of Quantisation on Model Uncertainty in Bayesian Neural Networks
Martin Ferianc
Partha P. Maji
Matthew Mattina
Miguel R. D. Rodrigues
UQCV
BDL
22
9
0
22 Feb 2021
Confounding Tradeoffs for Neural Network Quantization
Confounding Tradeoffs for Neural Network Quantization
Sahaj Garg
Anirudh Jain
Joe Lou
Mitchell Nahmias
MQ
21
17
0
12 Feb 2021
BRECQ: Pushing the Limit of Post-Training Quantization by Block
  Reconstruction
BRECQ: Pushing the Limit of Post-Training Quantization by Block Reconstruction
Yuhang Li
Ruihao Gong
Xu Tan
Yang Yang
Peng Hu
Qi Zhang
F. Yu
Wei Wang
Shi Gu
MQ
24
416
0
10 Feb 2021
Fixed-point Quantization of Convolutional Neural Networks for Quantized
  Inference on Embedded Platforms
Fixed-point Quantization of Convolutional Neural Networks for Quantized Inference on Embedded Platforms
Rishabh Goyal
Joaquin Vanschoren
V. V. Acht
S. Nijssen
MQ
27
23
0
03 Feb 2021
Rethinking Floating Point Overheads for Mixed Precision DNN Accelerators
Rethinking Floating Point Overheads for Mixed Precision DNN Accelerators
Hamzah Abdel-Aziz
Ali Shafiee
J. Shin
A. Pedram
Joseph Hassoun
MQ
42
10
0
27 Jan 2021
Generative Zero-shot Network Quantization
Generative Zero-shot Network Quantization
Xiangyu He
Qinghao Hu
Peisong Wang
Jian Cheng
GAN
MQ
28
23
0
21 Jan 2021
Hybrid and Non-Uniform quantization methods using retro synthesis data
  for efficient inference
Hybrid and Non-Uniform quantization methods using retro synthesis data for efficient inference
Gvsl Tej Pratap
R. Kumar
MQ
24
1
0
26 Dec 2020
DAQ: Channel-Wise Distribution-Aware Quantization for Deep Image
  Super-Resolution Networks
DAQ: Channel-Wise Distribution-Aware Quantization for Deep Image Super-Resolution Networks
Chee Hong
Heewon Kim
Sungyong Baik
Junghun Oh
Kyoung Mu Lee
OOD
SupR
MQ
24
41
0
21 Dec 2020
Exploring Neural Networks Quantization via Layer-Wise Quantization
  Analysis
Exploring Neural Networks Quantization via Layer-Wise Quantization Analysis
Shachar Gluska
Mark Grobman
MQ
16
5
0
15 Dec 2020
Previous
1234
Next