ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.05877
  4. Cited By
Quantization and Training of Neural Networks for Efficient
  Integer-Arithmetic-Only Inference

Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference

15 December 2017
Benoit Jacob
S. Kligys
Bo Chen
Menglong Zhu
Matthew Tang
Andrew G. Howard
Hartwig Adam
Dmitry Kalenichenko
    MQ
ArXiv (abs)PDFHTML

Papers citing "Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference"

50 / 1,298 papers shown
Title
Quantized Neural Networks for Low-Precision Accumulation with Guaranteed
  Overflow Avoidance
Quantized Neural Networks for Low-Precision Accumulation with Guaranteed Overflow Avoidance
Ian Colbert
Alessandro Pappalardo
Jakoba Petri-Koenig
MQ
31
4
0
31 Jan 2023
Self-Compressing Neural Networks
Self-Compressing Neural Networks
Szabolcs Cséfalvay
J. Imber
49
3
0
30 Jan 2023
The Hidden Power of Pure 16-bit Floating-Point Neural Networks
The Hidden Power of Pure 16-bit Floating-Point Neural Networks
Juyoung Yun
Byungkon Kang
Zhoulai Fu
MQ
42
1
0
30 Jan 2023
Towards Inference Efficient Deep Ensemble Learning
Towards Inference Efficient Deep Ensemble Learning
Ziyue Li
Kan Ren
Yifan Yang
Xinyang Jiang
Yuqing Yang
Dongsheng Li
BDL
60
14
0
29 Jan 2023
Improved knowledge distillation by utilizing backward pass knowledge in
  neural networks
Improved knowledge distillation by utilizing backward pass knowledge in neural networks
A. Jafari
Mehdi Rezagholizadeh
A. Ghodsi
37
1
0
27 Jan 2023
PowerQuant: Automorphism Search for Non-Uniform Quantization
PowerQuant: Automorphism Search for Non-Uniform Quantization
Edouard Yvinec
Arnaud Dapogny
Matthieu Cord
Kévin Bailly
MQ
63
17
0
24 Jan 2023
Tailor: Altering Skip Connections for Resource-Efficient Inference
Tailor: Altering Skip Connections for Resource-Efficient Inference
Olivia Weng
Gabriel Marcano
Vladimir Loncar
Alireza Khodamoradi
Nojan Sheybani
Andres Meza
F. Koushanfar
K. Denolf
Javier Mauricio Duarte
Ryan Kastner
99
13
0
18 Jan 2023
RedBit: An End-to-End Flexible Framework for Evaluating the Accuracy of
  Quantized CNNs
RedBit: An End-to-End Flexible Framework for Evaluating the Accuracy of Quantized CNNs
A. M. Ribeiro-dos-Santos
João Dinis Ferreira
O. Mutlu
G. Falcão
MQ
97
2
0
15 Jan 2023
Transceiver Cooperative Learning-aided Semantic Communications Against
  Mismatched Background Knowledge Bases
Transceiver Cooperative Learning-aided Semantic Communications Against Mismatched Background Knowledge Bases
Yanhuan Wang
Shuaishuai Guo
23
7
0
09 Jan 2023
On the Convergence of Stochastic Gradient Descent in Low-precision
  Number Formats
On the Convergence of Stochastic Gradient Descent in Low-precision Number Formats
M. Cacciola
A. Frangioni
M. Asgharian
Alireza Ghaffari
V. Nia
86
4
0
04 Jan 2023
SAFEMYRIDES: Application of Decentralized Control Edge-Computing to
  Ridesharing Monitoring Services
SAFEMYRIDES: Application of Decentralized Control Edge-Computing to Ridesharing Monitoring Services
S. Elnagar
Manoj A. Thomas
Kweku-Muata A. Osei-Bryson
61
0
0
02 Jan 2023
FlatENN: Train Flat for Enhanced Fault Tolerance of Quantized Deep
  Neural Networks
FlatENN: Train Flat for Enhanced Fault Tolerance of Quantized Deep Neural Networks
Akul Malhotra
S. Gupta
35
0
0
29 Dec 2022
Quality at the Tail of Machine Learning Inference
Quality at the Tail of Machine Learning Inference
Zhengxin Yang
Wanling Gao
Chunjie Luo
Lei Wang
Fei Tang
Xu Wen
Jianfeng Zhan
76
1
0
25 Dec 2022
FFNeRV: Flow-Guided Frame-Wise Neural Representations for Videos
FFNeRV: Flow-Guided Frame-Wise Neural Representations for Videos
J. Lee
Daniel Rho
J. Ko
Eunbyung Park
76
45
0
23 Dec 2022
Training Integer-Only Deep Recurrent Neural Networks
Training Integer-Only Deep Recurrent Neural Networks
V. Nia
Eyyub Sari
Vanessa Courville
M. Asgharian
MQ
98
2
0
22 Dec 2022
Mind Your Heart: Stealthy Backdoor Attack on Dynamic Deep Neural Network
  in Edge Computing
Mind Your Heart: Stealthy Backdoor Attack on Dynamic Deep Neural Network in Edge Computing
Tian Dong
Ziyuan Zhang
Han Qiu
Tianwei Zhang
Hewu Li
T. Wang
AAML
84
6
0
22 Dec 2022
Walking Noise: On Layer-Specific Robustness of Neural Architectures
  against Noisy Computations and Associated Characteristic Learning Dynamics
Walking Noise: On Layer-Specific Robustness of Neural Architectures against Noisy Computations and Associated Characteristic Learning Dynamics
Hendrik Borras
Bernhard Klein
Holger Fröning
AAML
64
1
0
20 Dec 2022
Redistribution of Weights and Activations for AdderNet Quantization
Redistribution of Weights and Activations for AdderNet Quantization
Ying Nie
Kai Han
Haikang Diao
Chuanjian Liu
Enhua Wu
Yunhe Wang
MQ
96
6
0
20 Dec 2022
The case for 4-bit precision: k-bit Inference Scaling Laws
The case for 4-bit precision: k-bit Inference Scaling Laws
Tim Dettmers
Luke Zettlemoyer
MQ
112
234
0
19 Dec 2022
Atrous Space Bender U-Net (ASBU-Net/LogiNet)
Atrous Space Bender U-Net (ASBU-Net/LogiNet)
Anurag Bansal
O. Ostap
Miguel Maestre Trueba
Kristopher Perry
SSeg
95
1
0
16 Dec 2022
RepQ-ViT: Scale Reparameterization for Post-Training Quantization of
  Vision Transformers
RepQ-ViT: Scale Reparameterization for Post-Training Quantization of Vision Transformers
Zhikai Li
Junrui Xiao
Lianwei Yang
Qingyi Gu
MQ
84
90
0
16 Dec 2022
NAWQ-SR: A Hybrid-Precision NPU Engine for Efficient On-Device
  Super-Resolution
NAWQ-SR: A Hybrid-Precision NPU Engine for Efficient On-Device Super-Resolution
Stylianos I. Venieris
Mario Almeida
Royson Lee
Nicholas D. Lane
SupR
65
4
0
15 Dec 2022
Towards Hardware-Specific Automatic Compression of Neural Networks
Towards Hardware-Specific Automatic Compression of Neural Networks
Torben Krieger
Bernhard Klein
Holger Fröning
MQ
66
2
0
15 Dec 2022
Efficient Speech Representation Learning with Low-Bit Quantization
Efficient Speech Representation Learning with Low-Bit Quantization
Ching-Feng Yeh
Wei-Ning Hsu
Paden Tomasello
Abdel-rahman Mohamed
MQ
49
10
0
14 Dec 2022
PD-Quant: Post-Training Quantization based on Prediction Difference
  Metric
PD-Quant: Post-Training Quantization based on Prediction Difference Metric
Jiawei Liu
Lin Niu
Zhihang Yuan
Dawei Yang
Xinggang Wang
Wenyu Liu
MQ
187
71
0
14 Dec 2022
Continuation KD: Improved Knowledge Distillation through the Lens of
  Continuation Optimization
Continuation KD: Improved Knowledge Distillation through the Lens of Continuation Optimization
A. Jafari
I. Kobyzev
Mehdi Rezagholizadeh
Pascal Poupart
A. Ghodsi
VLM
75
5
0
12 Dec 2022
Error-aware Quantization through Noise Tempering
Error-aware Quantization through Noise Tempering
Zheng Wang
Juncheng Billy Li
Shuhui Qu
Florian Metze
Emma Strubell
MQ
50
2
0
11 Dec 2022
QVIP: An ILP-based Formal Verification Approach for Quantized Neural
  Networks
QVIP: An ILP-based Formal Verification Approach for Quantized Neural Networks
Yedi Zhang
Zhe Zhao
Fu Song
Hao Fei
Tao Chen
Jun Sun
69
18
0
10 Dec 2022
Integration of a systolic array based hardware accelerator into a DNN
  operator auto-tuning framework
Integration of a systolic array based hardware accelerator into a DNN operator auto-tuning framework
Federico Nicolás Peccia
Oliver Bringmann
52
5
0
06 Dec 2022
QEBVerif: Quantization Error Bound Verification of Neural Networks
QEBVerif: Quantization Error Bound Verification of Neural Networks
Yedi Zhang
Fu Song
Jun Sun
MQ
99
12
0
06 Dec 2022
QFT: Post-training quantization via fast joint finetuning of all degrees
  of freedom
QFT: Post-training quantization via fast joint finetuning of all degrees of freedom
Alexander Finkelstein
Ella Fuchs
Idan Tal
Mark Grobman
Niv Vosco
Eldad Meller
MQ
79
7
0
05 Dec 2022
Make RepVGG Greater Again: A Quantization-aware Approach
Make RepVGG Greater Again: A Quantization-aware Approach
Xiangxiang Chu
Liang Li
Bo Zhang
MQ
136
51
0
03 Dec 2022
On-device Training: A First Overview on Existing Systems
On-device Training: A First Overview on Existing Systems
Shuai Zhu
Thiemo Voigt
Jeonggil Ko
Fatemeh Rahimian
142
17
0
01 Dec 2022
Pex: Memory-efficient Microcontroller Deep Learning through Partial
  Execution
Pex: Memory-efficient Microcontroller Deep Learning through Partial Execution
Edgar Liberis
Nicholas D. Lane
90
3
0
30 Nov 2022
Compressing Volumetric Radiance Fields to 1 MB
Compressing Volumetric Radiance Fields to 1 MB
Lingzhi Li
Zhen Shen
Zhongshu Wang
Li Shen
Liefeng Bo
79
67
0
29 Nov 2022
Quantization-aware Interval Bound Propagation for Training Certifiably
  Robust Quantized Neural Networks
Quantization-aware Interval Bound Propagation for Training Certifiably Robust Quantized Neural Networks
Mathias Lechner
Dorde Zikelic
K. Chatterjee
T. Henzinger
Daniela Rus
AAML
64
4
0
29 Nov 2022
Post-training Quantization on Diffusion Models
Post-training Quantization on Diffusion Models
Yuzhang Shang
Zhihang Yuan
Bin Xie
Bingzhe Wu
Yan Yan
DiffMMQ
154
182
0
28 Nov 2022
Join the High Accuracy Club on ImageNet with A Binary Neural Network
  Ticket
Join the High Accuracy Club on ImageNet with A Binary Neural Network Ticket
Nianhui Guo
Joseph Bethge
Christoph Meinel
Haojin Yang
MQ
114
20
0
23 Nov 2022
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large
  Language Models
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
Guangxuan Xiao
Ji Lin
Mickael Seznec
Hao Wu
Julien Demouth
Song Han
MQ
261
846
0
18 Nov 2022
CPT-V: A Contrastive Approach to Post-Training Quantization of Vision
  Transformers
CPT-V: A Contrastive Approach to Post-Training Quantization of Vision Transformers
N. Frumkin
Dibakar Gope
Diana Marculescu
ViTMQ
65
1
0
17 Nov 2022
Language models are good pathologists: using attention-based sequence
  reduction and text-pretrained transformers for efficient WSI classification
Language models are good pathologists: using attention-based sequence reduction and text-pretrained transformers for efficient WSI classification
Juan Pisula
Katarzyna Bozek
VLMMedIm
83
3
0
14 Nov 2022
FullPack: Full Vector Utilization for Sub-Byte Quantized Inference on
  General Purpose CPUs
FullPack: Full Vector Utilization for Sub-Byte Quantized Inference on General Purpose CPUs
Hossein Katebi
Navidreza Asadi
M. Goudarzi
MQ
53
0
0
13 Nov 2022
Exploiting the Partly Scratch-off Lottery Ticket for Quantization-Aware
  Training
Exploiting the Partly Scratch-off Lottery Ticket for Quantization-Aware Training
Mingliang Xu
Gongrui Nan
Yuxin Zhang
Yong Li
Rongrong Ji
MQ
59
3
0
12 Nov 2022
ClassPruning: Speed Up Image Restoration Networks by Dynamic N:M Pruning
ClassPruning: Speed Up Image Restoration Networks by Dynamic N:M Pruning
Yang Zhou
Yuda Song
Hui Qian
Xin Du
VLM
67
1
0
10 Nov 2022
Realistic Bokeh Effect Rendering on Mobile GPUs, Mobile AI & AIM 2022
  challenge: Report
Realistic Bokeh Effect Rendering on Mobile GPUs, Mobile AI & AIM 2022 challenge: Report
Andrey D. Ignatov
Radu Timofte
Jin Zhang
Feng Zhang
G. Yu
...
Mingyang Qian
Huixin Ma
Yanan Li
Xiaotao Wang
Lei Lei
71
10
0
07 Nov 2022
Power Efficient Video Super-Resolution on Mobile NPUs with Deep
  Learning, Mobile AI & AIM 2022 challenge: Report
Power Efficient Video Super-Resolution on Mobile NPUs with Deep Learning, Mobile AI & AIM 2022 challenge: Report
Andrey D. Ignatov
Radu Timofte
Cheng-Ming Chiang
Hsien-Kai Kuo
Yu-Syuan Xu
...
Kele Xu
Li Liu
Zehua Cheng
Wenyi Lian
W. Lian
SupR
93
10
0
07 Nov 2022
Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs,
  Mobile AI & AIM 2022 challenge: Report
Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 challenge: Report
Andrey D. Ignatov
Radu Timofte
Maurizio Denna
Abdelbadie Younes
Ganzorig Gankhuyag
...
Jing Liu
Garas Gendy
Nabil Sabor
J. Hou
Guanghui He
SupRMQ
92
32
0
07 Nov 2022
Efficient Single-Image Depth Estimation on Mobile Devices, Mobile AI &
  AIM 2022 Challenge: Report
Efficient Single-Image Depth Estimation on Mobile Devices, Mobile AI & AIM 2022 Challenge: Report
Andrey D. Ignatov
Grigory Malivenko
Radu Timofte
Lukasz Treszczotko
Xin-ke Chang
...
Dongwon Park
Seongmin Hong
Joonhee Lee
Seunggyu Lee
Sengsub Chun
84
17
0
07 Nov 2022
Learned Smartphone ISP on Mobile GPUs with Deep Learning, Mobile AI &
  AIM 2022 Challenge: Report
Learned Smartphone ISP on Mobile GPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report
Andrey D. Ignatov
Radu Timofte
Shuai Liu
Chaoyu Feng
Furui Bai
...
Xin Lou
Wei Zhou
Cong Pang
Haina Qin
Mingxuan Cai
96
24
0
07 Nov 2022
AskewSGD : An Annealed interval-constrained Optimisation method to train
  Quantized Neural Networks
AskewSGD : An Annealed interval-constrained Optimisation method to train Quantized Neural Networks
Louis Leconte
S. Schechtman
Eric Moulines
89
4
0
07 Nov 2022
Previous
123...111213...242526
Next