ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1712.05877
  4. Cited By
Quantization and Training of Neural Networks for Efficient
  Integer-Arithmetic-Only Inference

Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference

15 December 2017
Benoit Jacob
S. Kligys
Bo Chen
Menglong Zhu
Matthew Tang
Andrew G. Howard
Hartwig Adam
Dmitry Kalenichenko
    MQ
ArXiv (abs)PDFHTML

Papers citing "Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference"

50 / 1,298 papers shown
Title
Disentangling Neural Architectures and Weights: A Case Study in
  Supervised Classification
Disentangling Neural Architectures and Weights: A Case Study in Supervised Classification
Nicolo Colombo
Yang Gao
49
2
0
11 Sep 2020
Transform Quantization for CNN (Convolutional Neural Network)
  Compression
Transform Quantization for CNN (Convolutional Neural Network) Compression
Sean I. Young
Wang Zhe
David S. Taubman
B. Girod
MQ
119
72
0
02 Sep 2020
One Shot 3D Photography
One Shot 3D Photography
Johannes Kopf
Kevin Blackburn-Matzen
Suhib Alsisan
Ocean Quigley
Francis Ge
...
Peizhao Zhang
Zijian He
Peter Vajda
Ayush Saraf
Michael F. Cohen
110
80
0
27 Aug 2020
One Weight Bitwidth to Rule Them All
One Weight Bitwidth to Rule Them All
Ting-Wu Chin
P. Chuang
Vikas Chandra
Diana Marculescu
MQ
67
25
0
22 Aug 2020
Data-Independent Structured Pruning of Neural Networks via Coresets
Data-Independent Structured Pruning of Neural Networks via Coresets
Ben Mussay
Dan Feldman
Samson Zhou
Vladimir Braverman
Margarita Osadchy
80
26
0
19 Aug 2020
Channel-wise Hessian Aware trace-Weighted Quantization of Neural
  Networks
Channel-wise Hessian Aware trace-Weighted Quantization of Neural Networks
Xu Qian
Victor Li
Darren Crews
MQ
51
9
0
19 Aug 2020
Discovering Multi-Hardware Mobile Models via Architecture Search
Discovering Multi-Hardware Mobile Models via Architecture Search
Grace Chu
Okan Arikan
Gabriel Bender
Weijun Wang
Achille Brighton
Pieter-Jan Kindermans
Hanxiao Liu
Berkin Akin
Suyog Gupta
Andrew G. Howard
MQ
91
16
0
18 Aug 2020
Finding Fast Transformers: One-Shot Neural Architecture Search by
  Component Composition
Finding Fast Transformers: One-Shot Neural Architecture Search by Component Composition
Henry Tsai
Jayden Ooi
Chun-Sung Ferng
Hyung Won Chung
Jason Riesa
ViT
80
21
0
15 Aug 2020
SPINN: Synergistic Progressive Inference of Neural Networks over Device
  and Cloud
SPINN: Synergistic Progressive Inference of Neural Networks over Device and Cloud
Stefanos Laskaridis
Stylianos I. Venieris
Mario Almeida
Ilias Leontiadis
Nicholas D. Lane
102
276
0
14 Aug 2020
Weight Equalizing Shift Scaler-Coupled Post-training Quantization
Weight Equalizing Shift Scaler-Coupled Post-training Quantization
Jihun Oh
Sangjeong Lee
Meejeong Park
Pooni Walagaurav
K. Kwon
MQ
69
1
0
13 Aug 2020
Leveraging Automated Mixed-Low-Precision Quantization for tiny edge
  microcontrollers
Leveraging Automated Mixed-Low-Precision Quantization for tiny edge microcontrollers
Manuele Rusci
Marco Fariselli
Alessandro Capotondi
Luca Benini
MQ
65
17
0
12 Aug 2020
FATNN: Fast and Accurate Ternary Neural Networks
FATNN: Fast and Accurate Ternary Neural Networks
Peng Chen
Bohan Zhuang
Chunhua Shen
MQ
50
15
0
12 Aug 2020
Degree-Quant: Quantization-Aware Training for Graph Neural Networks
Degree-Quant: Quantization-Aware Training for Graph Neural Networks
Shyam A. Tailor
Javier Fernandez-Marques
Nicholas D. Lane
GNNMQ
82
145
0
11 Aug 2020
Hardware-Centric AutoML for Mixed-Precision Quantization
Hardware-Centric AutoML for Mixed-Precision Quantization
Kuan-Chieh Wang
Zhijian Liu
Chengyue Wu
Ji Lin
Song Han
MQ
62
15
0
11 Aug 2020
PROFIT: A Novel Training Method for sub-4-bit MobileNet Models
PROFIT: A Novel Training Method for sub-4-bit MobileNet Models
Eunhyeok Park
S. Yoo
MQ
59
85
0
11 Aug 2020
Neural Compression and Filtering for Edge-assisted Real-time Object
  Detection in Challenged Networks
Neural Compression and Filtering for Edge-assisted Real-time Object Detection in Challenged Networks
Yoshitomo Matsubara
Marco Levorato
81
55
0
31 Jul 2020
WrapNet: Neural Net Inference with Ultra-Low-Resolution Arithmetic
WrapNet: Neural Net Inference with Ultra-Low-Resolution Arithmetic
Renkun Ni
Hong-Min Chu
Oscar Castañeda
Ping Yeh-Chiang
Christoph Studer
Tom Goldstein
MQ
56
14
0
26 Jul 2020
TinyTL: Reduce Activations, Not Trainable Parameters for Efficient
  On-Device Learning
TinyTL: Reduce Activations, Not Trainable Parameters for Efficient On-Device Learning
Han Cai
Chuang Gan
Ligeng Zhu
Song Han
85
53
0
22 Jul 2020
The Effects of Approximate Multiplication on Convolutional Neural
  Networks
The Effects of Approximate Multiplication on Convolutional Neural Networks
Min Soo Kim
A. D. Del Barrio
Hyunjin Kim
N. Bagherzadeh
41
47
0
20 Jul 2020
Search What You Want: Barrier Panelty NAS for Mixed Precision
  Quantization
Search What You Want: Barrier Panelty NAS for Mixed Precision Quantization
Haibao Yu
Qi Han
Jianbo Li
Jianping Shi
Guangliang Cheng
Bin Fan
MQ
81
61
0
20 Jul 2020
HMQ: Hardware Friendly Mixed Precision Quantization Block for CNNs
HMQ: Hardware Friendly Mixed Precision Quantization Block for CNNs
H. Habi
Roy H. Jennings
Arnon Netzer
MQ
72
66
0
20 Jul 2020
DBQ: A Differentiable Branch Quantizer for Lightweight Deep Neural
  Networks
DBQ: A Differentiable Branch Quantizer for Lightweight Deep Neural Networks
Hassan Dbouk
Hetul Sanghvi
M. Mehendale
Naresh R Shanbhag
MQ
51
9
0
19 Jul 2020
AQD: Towards Accurate Fully-Quantized Object Detection
AQD: Towards Accurate Fully-Quantized Object Detection
Peng Chen
Jing Liu
Bohan Zhuang
Mingkui Tan
Chunhua Shen
MQ
95
9
0
14 Jul 2020
T-Basis: a Compact Representation for Neural Networks
T-Basis: a Compact Representation for Neural Networks
Anton Obukhov
M. Rakhuba
Stamatios Georgoulis
Menelaos Kanakis
Dengxin Dai
Luc Van Gool
114
27
0
13 Jul 2020
AUSN: Approximately Uniform Quantization by Adaptively Superimposing
  Non-uniform Distribution for Deep Neural Networks
AUSN: Approximately Uniform Quantization by Adaptively Superimposing Non-uniform Distribution for Deep Neural Networks
Fangxin Liu
Wenbo Zhao
Yanzhi Wang
Changzhi Dai
Li Jiang
MQ
58
3
0
08 Jul 2020
Progressive Tandem Learning for Pattern Recognition with Deep Spiking
  Neural Networks
Progressive Tandem Learning for Pattern Recognition with Deep Spiking Neural Networks
Jibin Wu
Chenglin Xu
Daquan Zhou
Haizhou Li
Kay Chen Tan
67
118
0
02 Jul 2020
Private Speech Classification with Secure Multiparty Computation
Private Speech Classification with Secure Multiparty Computation
Kyle Bittner
Martine De Cock
Rafael Dowsley
70
1
0
01 Jul 2020
EasyQuant: Post-training Quantization via Scale Optimization
EasyQuant: Post-training Quantization via Scale Optimization
Di Wu
Qingming Tang
Yongle Zhao
Ming Zhang
Ying Fu
Debing Zhang
MQ
84
78
0
30 Jun 2020
Efficient Integer-Arithmetic-Only Convolutional Neural Networks
Efficient Integer-Arithmetic-Only Convolutional Neural Networks
Hengrui Zhao
Dong Liu
Houqiang Li
MQ
46
4
0
21 Jun 2020
Efficient Execution of Quantized Deep Learning Models: A Compiler
  Approach
Efficient Execution of Quantized Deep Learning Models: A Compiler Approach
Animesh Jain
Shoubhik Bhattacharya
Masahiro Masuda
Vin Sharma
Yida Wang
MQ
92
34
0
18 Jun 2020
FrostNet: Towards Quantization-Aware Network Architecture Search
FrostNet: Towards Quantization-Aware Network Architecture Search
Taehoon Kim
Y. Yoo
Jihoon Yang
MQ
53
2
0
17 Jun 2020
Quantization of Acoustic Model Parameters in Automatic Speech
  Recognition Framework
Quantization of Acoustic Model Parameters in Automatic Speech Recognition Framework
Amrutha Prasad
P. Motlícek
S. Madikeri
MQ
60
10
0
16 Jun 2020
CNN Acceleration by Low-rank Approximation with Quantized Factors
CNN Acceleration by Low-rank Approximation with Quantized Factors
Nikolay Kozyrskiy
Anh-Huy Phan
MQ
43
3
0
16 Jun 2020
APQ: Joint Search for Network Architecture, Pruning and Quantization
  Policy
APQ: Joint Search for Network Architecture, Pruning and Quantization Policy
Tianzhe Wang
Kuan-Chieh Wang
Han Cai
Ji Lin
Zhijian Liu
Song Han
MQ
85
176
0
15 Jun 2020
Improving Post Training Neural Quantization: Layer-wise Calibration and
  Integer Programming
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
Itay Hubara
Yury Nahshan
Y. Hanani
Ron Banner
Daniel Soudry
MQ
122
129
0
14 Jun 2020
CoDeNet: Efficient Deployment of Input-Adaptive Object Detection on
  Embedded FPGAs
CoDeNet: Efficient Deployment of Input-Adaptive Object Detection on Embedded FPGAs
Zhen Dong
Dequan Wang
Qijing Huang
Yizhao Gao
Yaohui Cai
Tian Li
Bichen Wu
Kurt Keutzer
J. Wawrzynek
ObjD
57
1
0
12 Jun 2020
SECure: A Social and Environmental Certificate for AI Systems
SECure: A Social and Environmental Certificate for AI Systems
Abhishek Gupta
Camylle Lanteigne
Sara Kingsley
61
13
0
11 Jun 2020
Neural Network Activation Quantization with Bitwise Information
  Bottlenecks
Neural Network Activation Quantization with Bitwise Information Bottlenecks
Xichuan Zhou
Kui Liu
Cong Shi
Haijun Liu
Ji Liu
MQ
54
1
0
09 Jun 2020
Automated Design Space Exploration for optimised Deployment of DNN on
  Arm Cortex-A CPUs
Automated Design Space Exploration for optimised Deployment of DNN on Arm Cortex-A CPUs
Miguel de Prado
Andrew Mundy
Rabia Saeed
Maurizo Denna
Nuria Pazos
Luca Benini
72
11
0
09 Jun 2020
Linformer: Self-Attention with Linear Complexity
Linformer: Self-Attention with Linear Complexity
Sinong Wang
Belinda Z. Li
Madian Khabsa
Han Fang
Hao Ma
247
1,720
0
08 Jun 2020
Conditional Neural Architecture Search
Conditional Neural Architecture Search
Sheng-Chun Kao
Arun Ramamurthy
Reed Williams
T. Krishna
28
0
0
06 Jun 2020
Generative Design of Hardware-aware DNNs
Generative Design of Hardware-aware DNNs
Sheng-Chun Kao
Arun Ramamurthy
T. Krishna
MQ
37
2
0
06 Jun 2020
An Overview of Neural Network Compression
An Overview of Neural Network Compression
James OÑeill
AI4CE
160
100
0
05 Jun 2020
Exploring the Potential of Low-bit Training of Convolutional Neural
  Networks
Exploring the Potential of Low-bit Training of Convolutional Neural Networks
Kai Zhong
Xuefei Ning
Guohao Dai
Zhenhua Zhu
Tianchen Zhao
Shulin Zeng
Yu Wang
Huazhong Yang
MQ
79
9
0
04 Jun 2020
Weight Pruning via Adaptive Sparsity Loss
Weight Pruning via Adaptive Sparsity Loss
George Retsinas
Athena Elafrou
G. Goumas
Petros Maragos
64
10
0
04 Jun 2020
A Feature-map Discriminant Perspective for Pruning Deep Neural Networks
A Feature-map Discriminant Perspective for Pruning Deep Neural Networks
Zejiang Hou
S. Kung
35
5
0
28 May 2020
Accelerating Neural Network Inference by Overflow Aware Quantization
Accelerating Neural Network Inference by Overflow Aware Quantization
Hongwei Xie
Shuo Zhang
Huanghao Ding
Yafei Song
Baitao Shao
Conggang Hu
Lingyi Cai
Mingyang Li
MQ
18
0
0
27 May 2020
A Protection against the Extraction of Neural Network Models
A Protection against the Extraction of Neural Network Models
H. Chabanne
Vincent Despiegel
Linda Guiga
FedML
83
5
0
26 May 2020
Conditionally Deep Hybrid Neural Networks Across Edge and Cloud
Conditionally Deep Hybrid Neural Networks Across Edge and Cloud
Yinghan Long
I. Chakraborty
Kaushik Roy
29
4
0
21 May 2020
TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids
TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids
Igor Fedorov
Marko Stamenovic
Carl R. Jensen
Li-Chia Yang
Ari Mandell
Yiming Gan
Matthew Mattina
P. Whatmough
69
98
0
20 May 2020
Previous
123...212223242526
Next