ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.05877
  4. Cited By
Quantization and Training of Neural Networks for Efficient
  Integer-Arithmetic-Only Inference

Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference

15 December 2017
Benoit Jacob
S. Kligys
Bo Chen
Menglong Zhu
Matthew Tang
Andrew G. Howard
Hartwig Adam
Dmitry Kalenichenko
    MQ
ArXiv (abs)PDFHTML

Papers citing "Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference"

50 / 1,298 papers shown
Title
Anchor-based Plain Net for Mobile Image Super-Resolution
Anchor-based Plain Net for Mobile Image Super-Resolution
Zongcai Du
Jie Liu
Jie Tang
Gangshan Wu
SupRMQ
61
52
0
20 May 2021
BatchQuant: Quantized-for-all Architecture Search with Robust Quantizer
BatchQuant: Quantized-for-all Architecture Search with Robust Quantizer
Haoping Bai
Mengsi Cao
Ping Huang
Jiulong Shan
MQ
83
34
0
19 May 2021
Fast and Accurate Camera Scene Detection on Smartphones
Fast and Accurate Camera Scene Detection on Smartphones
Angeline Pouget
Sidharth Ramesh
Maximilian Giang
Ramithan Chandrapalan
Toni Tanner
Moritz Prussing
Radu Timofte
Andrey D. Ignatov
3DH
57
5
0
17 May 2021
Fast and Accurate Quantized Camera Scene Detection on Smartphones,
  Mobile AI 2021 Challenge: Report
Fast and Accurate Quantized Camera Scene Detection on Smartphones, Mobile AI 2021 Challenge: Report
Andrey D. Ignatov
Grigory Malivenko
Radu Timofte
Sheng Chen
Xin Xia
...
K. Lyda
L. Khojoyan
Abhishek Thanki
Sayak Paul
Shahid Siddiqui
MQ
90
20
0
17 May 2021
Fast and Accurate Single-Image Depth Estimation on Mobile Devices,
  Mobile AI 2021 Challenge: Report
Fast and Accurate Single-Image Depth Estimation on Mobile Devices, Mobile AI 2021 Challenge: Report
Andrey D. Ignatov
Grigory Malivenko
D. Plowman
Samarth Shukla
Radu Timofte
...
Tianpeng Feng
Yang Liu
Chuannan Sheng
Jian Yin
Fausto T. Benavide
MDE
74
36
0
17 May 2021
Real-Time Video Super-Resolution on Smartphones with Deep Learning,
  Mobile AI 2021 Challenge: Report
Real-Time Video Super-Resolution on Smartphones with Deep Learning, Mobile AI 2021 Challenge: Report
Andrey D. Ignatov
Andrés Romero
Heewon Kim
Radu Timofte
C. Ho
...
Xiumei Wang
Jiaming Guo
Xueyi Zhou
Hao Jia
Youliang Yan
SupR
73
54
0
17 May 2021
Real-Time Quantized Image Super-Resolution on Mobile NPUs, Mobile AI
  2021 Challenge: Report
Real-Time Quantized Image Super-Resolution on Mobile NPUs, Mobile AI 2021 Challenge: Report
Andrey D. Ignatov
Radu Timofte
Maurizio Denna
Abdelrazak Younes
A. Lek
...
Kun Zeng
Peirong Li
Zhi-Hao Liu
Shiqi Xue
Shengpeng Wang
SupRMQ
64
60
0
17 May 2021
Fast Camera Image Denoising on Mobile GPUs with Deep Learning, Mobile AI
  2021 Challenge: Report
Fast Camera Image Denoising on Mobile GPUs with Deep Learning, Mobile AI 2021 Challenge: Report
Andrey D. Ignatov
Kim Byeoung-su
Radu Timofte
Angeline Pouget
Fenglong Song
...
Lei Lei
Chaoyu Feng
L. Huang
Z. Lei
Feifei Chen
68
30
0
17 May 2021
Learned Smartphone ISP on Mobile NPUs with Deep Learning, Mobile AI 2021
  Challenge: Report
Learned Smartphone ISP on Mobile NPUs with Deep Learning, Mobile AI 2021 Challenge: Report
Andrey D. Ignatov
Cheng-Ming Chiang
Hsien-Kai Kuo
Anastasia Sycheva
Radu Timofte
...
K. Upla
Kiran Raja
Raghavendra Ramachandra
Christoph Busch
Etienne de Stoutz
86
48
0
17 May 2021
Texture Generation with Neural Cellular Automata
Texture Generation with Neural Cellular Automata
A. Mordvintsev
Eyvind Niklasson
E. Randazzo
47
9
0
15 May 2021
Lightweight Compression of Intermediate Neural Network Features for
  Collaborative Intelligence
Lightweight Compression of Intermediate Neural Network Features for Collaborative Intelligence
R. Cohen
Hyomin Choi
Ivan V. Bajić
54
24
0
15 May 2021
High-Performance FPGA-based Accelerator for Bayesian Neural Networks
High-Performance FPGA-based Accelerator for Bayesian Neural Networks
Hongxiang Fan
Martin Ferianc
Miguel R. D. Rodrigues
Hongyu Zhou
Xinyu Niu
Wayne Luk
BDL
58
23
0
12 May 2021
Agatha: Smart Contract for DNN Computation
Agatha: Smart Contract for DNN Computation
Zihan Zheng
Peichen Xie
Xian Zhang
Shuo Chen
Yang Chen
Xiaobing Guo
Guangzhong Sun
Guangyu Sun
Lidong Zhou
GNN
56
12
0
11 May 2021
In-Hindsight Quantization Range Estimation for Quantized Training
In-Hindsight Quantization Range Estimation for Quantized Training
Marios Fournarakis
Markus Nagel
MQ
49
10
0
10 May 2021
KDExplainer: A Task-oriented Attention Model for Explaining Knowledge
  Distillation
KDExplainer: A Task-oriented Attention Model for Explaining Knowledge Distillation
Mengqi Xue
Mingli Song
Xinchao Wang
Ying Chen
Xingen Wang
Xiuming Zhang
55
10
0
10 May 2021
Pareto-Optimal Quantized ResNet Is Mostly 4-bit
Pareto-Optimal Quantized ResNet Is Mostly 4-bit
AmirAli Abdolrashidi
Lisa Wang
Shivani Agrawal
J. Malmaud
Oleg Rybakov
Chas Leichner
Lukasz Lew
MQ
71
36
0
07 May 2021
Encoding Weights of Irregular Sparsity for Fixed-to-Fixed Model
  Compression
Encoding Weights of Irregular Sparsity for Fixed-to-Fixed Model Compression
Baeseong Park
S. Kwon
Daehwan Oh
Byeongwook Kim
Dongsoo Lee
63
4
0
05 May 2021
Q-Rater: Non-Convex Optimization for Post-Training Uniform Quantization
Q-Rater: Non-Convex Optimization for Post-Training Uniform Quantization
Byeongwook Kim
Dongsoo Lee
Yeonju Ro
Yongkweon Jeon
S. Kwon
Baeseong Park
Daehwan Oh
MQ
53
1
0
05 May 2021
Stealthy Backdoors as Compression Artifacts
Stealthy Backdoors as Compression Artifacts
Yulong Tian
Fnu Suya
Fengyuan Xu
David Evans
94
22
0
30 Apr 2021
AttendSeg: A Tiny Attention Condenser Neural Network for Semantic
  Segmentation on the Edge
AttendSeg: A Tiny Attention Condenser Neural Network for Semantic Segmentation on the Edge
Xiaoyue Wen
M. Famouri
Andrew Hryniowski
Alexander Wong
SSeg
60
7
0
29 Apr 2021
Inspect, Understand, Overcome: A Survey of Practical Methods for AI
  Safety
Inspect, Understand, Overcome: A Survey of Practical Methods for AI Safety
Sebastian Houben
Stephanie Abrecht
Maram Akila
Andreas Bär
Felix Brockherde
...
Serin Varghese
Michael Weber
Sebastian J. Wirkert
Tim Wirtz
Matthias Woehrle
AAML
130
58
0
29 Apr 2021
ActNN: Reducing Training Memory Footprint via 2-Bit Activation
  Compressed Training
ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training
Jianfei Chen
Lianmin Zheng
Z. Yao
Dequan Wang
Ion Stoica
Michael W. Mahoney
Joseph E. Gonzalez
MQ
77
75
0
29 Apr 2021
An optical neural network using less than 1 photon per multiplication
An optical neural network using less than 1 photon per multiplication
Tianyu Wang
Shifan Ma
Logan G. Wright
Tatsuhiro Onodera
Brian C. Richard
Peter L. McMahon
105
185
0
27 Apr 2021
HAO: Hardware-aware neural Architecture Optimization for Efficient
  Inference
HAO: Hardware-aware neural Architecture Optimization for Efficient Inference
Zhen Dong
Yizhao Gao
Qijing Huang
J. Wawrzynek
Hayden Kwok-Hay So
Kurt Keutzer
79
37
0
26 Apr 2021
Quantization of Deep Neural Networks for Accurate Edge Computing
Quantization of Deep Neural Networks for Accurate Edge Computing
Wentao Chen
Hailong Qiu
Zhuang Jian
Chutong Zhang
Yu Hu
Qing Lu
Tianchen Wang
Yiyu Shi
Meiping Huang
Xiaowe Xu
96
24
0
25 Apr 2021
Piggyback GAN: Efficient Lifelong Learning for Image Conditioned
  Generation
Piggyback GAN: Efficient Lifelong Learning for Image Conditioned Generation
Mengyao Zhai
Lei Chen
Jiawei He
Megha Nawhal
Frederick Tung
Greg Mori
CLL
67
29
0
24 Apr 2021
Do All MobileNets Quantize Poorly? Gaining Insights into the Effect of
  Quantization on Depthwise Separable Convolutional Networks Through the Eyes
  of Multi-scale Distributional Dynamics
Do All MobileNets Quantize Poorly? Gaining Insights into the Effect of Quantization on Depthwise Separable Convolutional Networks Through the Eyes of Multi-scale Distributional Dynamics
S. Yun
Alexander Wong
MQ
84
27
0
24 Apr 2021
Measuring what Really Matters: Optimizing Neural Networks for TinyML
Measuring what Really Matters: Optimizing Neural Networks for TinyML
Lennart Heim
Andreas Biri
Zhongnan Qu
Lothar Thiele
84
30
0
21 Apr 2021
DynO: Dynamic Onloading of Deep Neural Networks from Cloud to Device
DynO: Dynamic Onloading of Deep Neural Networks from Cloud to Device
Mario Almeida
Stefanos Laskaridis
Stylianos I. Venieris
Ilias Leontiadis
Nicholas D. Lane
75
37
0
20 Apr 2021
Distilling Knowledge via Knowledge Review
Distilling Knowledge via Knowledge Review
Pengguang Chen
Shu Liu
Hengshuang Zhao
Jiaya Jia
220
450
0
19 Apr 2021
Filtering Empty Camera Trap Images in Embedded Systems
Filtering Empty Camera Trap Images in Embedded Systems
Fagner Cunha
E. M. Santos
R. Barreto
J. Colonna
73
14
0
18 Apr 2021
Random and Adversarial Bit Error Robustness: Energy-Efficient and Secure
  DNN Accelerators
Random and Adversarial Bit Error Robustness: Energy-Efficient and Secure DNN Accelerators
David Stutz
Nandhini Chandramoorthy
Matthias Hein
Bernt Schiele
AAMLMQ
70
18
0
16 Apr 2021
All-You-Can-Fit 8-Bit Flexible Floating-Point Format for Accurate and
  Memory-Efficient Inference of Deep Neural Networks
All-You-Can-Fit 8-Bit Flexible Floating-Point Format for Accurate and Memory-Efficient Inference of Deep Neural Networks
Cheng-Wei Huang
Tim-Wei Chen
Juinn-Dar Huang
MQ
36
6
0
15 Apr 2021
Annealing Knowledge Distillation
Annealing Knowledge Distillation
A. Jafari
Mehdi Rezagholizadeh
Pranav Sharma
A. Ghodsi
98
79
0
14 Apr 2021
Combined Depth Space based Architecture Search For Person
  Re-identification
Combined Depth Space based Architecture Search For Person Re-identification
Hanjun Li
Gaojie Wu
Weishi Zheng
3DPC
83
107
0
09 Apr 2021
Content-Aware GAN Compression
Content-Aware GAN Compression
Yuchen Liu
Zhixin Shu
Yijun Li
Zhe Lin
Federico Perazzi
S. Kung
GAN
73
59
0
06 Apr 2021
TENT: Efficient Quantization of Neural Networks on the tiny Edge with
  Tapered FixEd PoiNT
TENT: Efficient Quantization of Neural Networks on the tiny Edge with Tapered FixEd PoiNT
H. F. Langroudi
Vedant Karia
Tej Pandit
Dhireesha Kudithipudi
MQ
57
10
0
06 Apr 2021
Faster Convolution Inference Through Using Pre-Calculated Lookup Tables
Faster Convolution Inference Through Using Pre-Calculated Lookup Tables
Grigor Gatchev
V. Mollov
VLM
39
0
0
04 Apr 2021
Inference of Recyclable Objects with Convolutional Neural Networks
Inference of Recyclable Objects with Convolutional Neural Networks
Jaime Caballero
Francisco Vergara
Randal Miranda
José Serracín
HAI
18
3
0
02 Apr 2021
Anytime Dense Prediction with Confidence Adaptivity
Anytime Dense Prediction with Confidence Adaptivity
Zhuang Liu
Zhiqiu Xu
H. Wang
Trevor Darrell
Evan Shelhamer
76
20
0
01 Apr 2021
Training Multi-bit Quantized and Binarized Networks with A Learnable
  Symmetric Quantizer
Training Multi-bit Quantized and Binarized Networks with A Learnable Symmetric Quantizer
Phuoc Pham
J. Abraham
Jaeyong Chung
MQ
81
13
0
01 Apr 2021
Bit-Mixer: Mixed-precision networks with runtime bit-width selection
Bit-Mixer: Mixed-precision networks with runtime bit-width selection
Adrian Bulat
Georgios Tzimiropoulos
MQ
77
27
0
31 Mar 2021
Integer-only Zero-shot Quantization for Efficient Speech Recognition
Integer-only Zero-shot Quantization for Efficient Speech Recognition
Sehoon Kim
A. Gholami
Z. Yao
Nicholas Lee
Patrick Wang
Aniruddha Nrusimha
Bohan Zhai
Tianren Gao
Michael W. Mahoney
Kurt Keutzer
MQ
101
25
0
31 Mar 2021
Slimmable Compressive Autoencoders for Practical Neural Image
  Compression
Slimmable Compressive Autoencoders for Practical Neural Image Compression
Feiyu Yang
Luis Herranz
Yongmei Cheng
M. Mozerov
64
66
0
29 Mar 2021
Zero-shot Adversarial Quantization
Zero-shot Adversarial Quantization
Yuang Liu
Wei Zhang
Jun Wang
MQ
114
79
0
29 Mar 2021
Automated Backend-Aware Post-Training Quantization
Automated Backend-Aware Post-Training Quantization
Ziheng Jiang
Animesh Jain
An Liu
Josh Fromm
Chengqian Ma
Tianqi Chen
Luis Ceze
MQ
79
2
0
27 Mar 2021
A Practical Survey on Faster and Lighter Transformers
A Practical Survey on Faster and Lighter Transformers
Quentin Fournier
G. Caron
Daniel Aloise
137
105
0
26 Mar 2021
RCT: Resource Constrained Training for Edge AI
RCT: Resource Constrained Training for Edge AI
Tian Huang
Yaoyu Zhang
Ming Yan
Qiufeng Wang
Rick Siow Mong Goh
82
8
0
26 Mar 2021
Distilling a Powerful Student Model via Online Knowledge Distillation
Distilling a Powerful Student Model via Online Knowledge Distillation
Shaojie Li
Mingbao Lin
Yan Wang
Yongjian Wu
Yonghong Tian
Ling Shao
Rongrong Ji
FedML
117
49
0
26 Mar 2021
Dynamic Domain Adaptation for Efficient Inference
Dynamic Domain Adaptation for Efficient Inference
Shuang Li
Jinming Zhang
Wen-hui Ma
Chi Harold Liu
Wei Li
66
13
0
26 Mar 2021
Previous
123...181920...242526
Next