Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1712.05877
Cited By
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference
15 December 2017
Benoit Jacob
S. Kligys
Bo Chen
Menglong Zhu
Matthew Tang
Andrew G. Howard
Hartwig Adam
Dmitry Kalenichenko
MQ
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference"
50 / 1,298 papers shown
Title
Disentangling Neural Architectures and Weights: A Case Study in Supervised Classification
Nicolo Colombo
Yang Gao
49
2
0
11 Sep 2020
Transform Quantization for CNN (Convolutional Neural Network) Compression
Sean I. Young
Wang Zhe
David S. Taubman
B. Girod
MQ
119
72
0
02 Sep 2020
One Shot 3D Photography
Johannes Kopf
Kevin Blackburn-Matzen
Suhib Alsisan
Ocean Quigley
Francis Ge
...
Peizhao Zhang
Zijian He
Peter Vajda
Ayush Saraf
Michael F. Cohen
110
80
0
27 Aug 2020
One Weight Bitwidth to Rule Them All
Ting-Wu Chin
P. Chuang
Vikas Chandra
Diana Marculescu
MQ
67
25
0
22 Aug 2020
Data-Independent Structured Pruning of Neural Networks via Coresets
Ben Mussay
Dan Feldman
Samson Zhou
Vladimir Braverman
Margarita Osadchy
80
26
0
19 Aug 2020
Channel-wise Hessian Aware trace-Weighted Quantization of Neural Networks
Xu Qian
Victor Li
Darren Crews
MQ
51
9
0
19 Aug 2020
Discovering Multi-Hardware Mobile Models via Architecture Search
Grace Chu
Okan Arikan
Gabriel Bender
Weijun Wang
Achille Brighton
Pieter-Jan Kindermans
Hanxiao Liu
Berkin Akin
Suyog Gupta
Andrew G. Howard
MQ
91
16
0
18 Aug 2020
Finding Fast Transformers: One-Shot Neural Architecture Search by Component Composition
Henry Tsai
Jayden Ooi
Chun-Sung Ferng
Hyung Won Chung
Jason Riesa
ViT
80
21
0
15 Aug 2020
SPINN: Synergistic Progressive Inference of Neural Networks over Device and Cloud
Stefanos Laskaridis
Stylianos I. Venieris
Mario Almeida
Ilias Leontiadis
Nicholas D. Lane
102
276
0
14 Aug 2020
Weight Equalizing Shift Scaler-Coupled Post-training Quantization
Jihun Oh
Sangjeong Lee
Meejeong Park
Pooni Walagaurav
K. Kwon
MQ
69
1
0
13 Aug 2020
Leveraging Automated Mixed-Low-Precision Quantization for tiny edge microcontrollers
Manuele Rusci
Marco Fariselli
Alessandro Capotondi
Luca Benini
MQ
65
17
0
12 Aug 2020
FATNN: Fast and Accurate Ternary Neural Networks
Peng Chen
Bohan Zhuang
Chunhua Shen
MQ
50
15
0
12 Aug 2020
Degree-Quant: Quantization-Aware Training for Graph Neural Networks
Shyam A. Tailor
Javier Fernandez-Marques
Nicholas D. Lane
GNN
MQ
82
145
0
11 Aug 2020
Hardware-Centric AutoML for Mixed-Precision Quantization
Kuan-Chieh Wang
Zhijian Liu
Chengyue Wu
Ji Lin
Song Han
MQ
62
15
0
11 Aug 2020
PROFIT: A Novel Training Method for sub-4-bit MobileNet Models
Eunhyeok Park
S. Yoo
MQ
59
85
0
11 Aug 2020
Neural Compression and Filtering for Edge-assisted Real-time Object Detection in Challenged Networks
Yoshitomo Matsubara
Marco Levorato
81
55
0
31 Jul 2020
WrapNet: Neural Net Inference with Ultra-Low-Resolution Arithmetic
Renkun Ni
Hong-Min Chu
Oscar Castañeda
Ping Yeh-Chiang
Christoph Studer
Tom Goldstein
MQ
56
14
0
26 Jul 2020
TinyTL: Reduce Activations, Not Trainable Parameters for Efficient On-Device Learning
Han Cai
Chuang Gan
Ligeng Zhu
Song Han
85
53
0
22 Jul 2020
The Effects of Approximate Multiplication on Convolutional Neural Networks
Min Soo Kim
A. D. Del Barrio
Hyunjin Kim
N. Bagherzadeh
41
47
0
20 Jul 2020
Search What You Want: Barrier Panelty NAS for Mixed Precision Quantization
Haibao Yu
Qi Han
Jianbo Li
Jianping Shi
Guangliang Cheng
Bin Fan
MQ
81
61
0
20 Jul 2020
HMQ: Hardware Friendly Mixed Precision Quantization Block for CNNs
H. Habi
Roy H. Jennings
Arnon Netzer
MQ
72
66
0
20 Jul 2020
DBQ: A Differentiable Branch Quantizer for Lightweight Deep Neural Networks
Hassan Dbouk
Hetul Sanghvi
M. Mehendale
Naresh R Shanbhag
MQ
51
9
0
19 Jul 2020
AQD: Towards Accurate Fully-Quantized Object Detection
Peng Chen
Jing Liu
Bohan Zhuang
Mingkui Tan
Chunhua Shen
MQ
95
9
0
14 Jul 2020
T-Basis: a Compact Representation for Neural Networks
Anton Obukhov
M. Rakhuba
Stamatios Georgoulis
Menelaos Kanakis
Dengxin Dai
Luc Van Gool
114
27
0
13 Jul 2020
AUSN: Approximately Uniform Quantization by Adaptively Superimposing Non-uniform Distribution for Deep Neural Networks
Fangxin Liu
Wenbo Zhao
Yanzhi Wang
Changzhi Dai
Li Jiang
MQ
58
3
0
08 Jul 2020
Progressive Tandem Learning for Pattern Recognition with Deep Spiking Neural Networks
Jibin Wu
Chenglin Xu
Daquan Zhou
Haizhou Li
Kay Chen Tan
67
118
0
02 Jul 2020
Private Speech Classification with Secure Multiparty Computation
Kyle Bittner
Martine De Cock
Rafael Dowsley
70
1
0
01 Jul 2020
EasyQuant: Post-training Quantization via Scale Optimization
Di Wu
Qingming Tang
Yongle Zhao
Ming Zhang
Ying Fu
Debing Zhang
MQ
84
78
0
30 Jun 2020
Efficient Integer-Arithmetic-Only Convolutional Neural Networks
Hengrui Zhao
Dong Liu
Houqiang Li
MQ
46
4
0
21 Jun 2020
Efficient Execution of Quantized Deep Learning Models: A Compiler Approach
Animesh Jain
Shoubhik Bhattacharya
Masahiro Masuda
Vin Sharma
Yida Wang
MQ
92
34
0
18 Jun 2020
FrostNet: Towards Quantization-Aware Network Architecture Search
Taehoon Kim
Y. Yoo
Jihoon Yang
MQ
53
2
0
17 Jun 2020
Quantization of Acoustic Model Parameters in Automatic Speech Recognition Framework
Amrutha Prasad
P. Motlícek
S. Madikeri
MQ
60
10
0
16 Jun 2020
CNN Acceleration by Low-rank Approximation with Quantized Factors
Nikolay Kozyrskiy
Anh-Huy Phan
MQ
43
3
0
16 Jun 2020
APQ: Joint Search for Network Architecture, Pruning and Quantization Policy
Tianzhe Wang
Kuan-Chieh Wang
Han Cai
Ji Lin
Zhijian Liu
Song Han
MQ
85
176
0
15 Jun 2020
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
Itay Hubara
Yury Nahshan
Y. Hanani
Ron Banner
Daniel Soudry
MQ
122
129
0
14 Jun 2020
CoDeNet: Efficient Deployment of Input-Adaptive Object Detection on Embedded FPGAs
Zhen Dong
Dequan Wang
Qijing Huang
Yizhao Gao
Yaohui Cai
Tian Li
Bichen Wu
Kurt Keutzer
J. Wawrzynek
ObjD
57
1
0
12 Jun 2020
SECure: A Social and Environmental Certificate for AI Systems
Abhishek Gupta
Camylle Lanteigne
Sara Kingsley
61
13
0
11 Jun 2020
Neural Network Activation Quantization with Bitwise Information Bottlenecks
Xichuan Zhou
Kui Liu
Cong Shi
Haijun Liu
Ji Liu
MQ
54
1
0
09 Jun 2020
Automated Design Space Exploration for optimised Deployment of DNN on Arm Cortex-A CPUs
Miguel de Prado
Andrew Mundy
Rabia Saeed
Maurizo Denna
Nuria Pazos
Luca Benini
72
11
0
09 Jun 2020
Linformer: Self-Attention with Linear Complexity
Sinong Wang
Belinda Z. Li
Madian Khabsa
Han Fang
Hao Ma
247
1,720
0
08 Jun 2020
Conditional Neural Architecture Search
Sheng-Chun Kao
Arun Ramamurthy
Reed Williams
T. Krishna
28
0
0
06 Jun 2020
Generative Design of Hardware-aware DNNs
Sheng-Chun Kao
Arun Ramamurthy
T. Krishna
MQ
37
2
0
06 Jun 2020
An Overview of Neural Network Compression
James OÑeill
AI4CE
160
100
0
05 Jun 2020
Exploring the Potential of Low-bit Training of Convolutional Neural Networks
Kai Zhong
Xuefei Ning
Guohao Dai
Zhenhua Zhu
Tianchen Zhao
Shulin Zeng
Yu Wang
Huazhong Yang
MQ
79
9
0
04 Jun 2020
Weight Pruning via Adaptive Sparsity Loss
George Retsinas
Athena Elafrou
G. Goumas
Petros Maragos
64
10
0
04 Jun 2020
A Feature-map Discriminant Perspective for Pruning Deep Neural Networks
Zejiang Hou
S. Kung
35
5
0
28 May 2020
Accelerating Neural Network Inference by Overflow Aware Quantization
Hongwei Xie
Shuo Zhang
Huanghao Ding
Yafei Song
Baitao Shao
Conggang Hu
Lingyi Cai
Mingyang Li
MQ
18
0
0
27 May 2020
A Protection against the Extraction of Neural Network Models
H. Chabanne
Vincent Despiegel
Linda Guiga
FedML
83
5
0
26 May 2020
Conditionally Deep Hybrid Neural Networks Across Edge and Cloud
Yinghan Long
I. Chakraborty
Kaushik Roy
29
4
0
21 May 2020
TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids
Igor Fedorov
Marko Stamenovic
Carl R. Jensen
Li-Chia Yang
Ari Mandell
Yiming Gan
Matthew Mattina
P. Whatmough
69
98
0
20 May 2020
Previous
1
2
3
...
21
22
23
24
25
26
Next