Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1712.05877
Cited By
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference
15 December 2017
Benoit Jacob
S. Kligys
Bo Chen
Menglong Zhu
Matthew Tang
Andrew G. Howard
Hartwig Adam
Dmitry Kalenichenko
MQ
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference"
48 / 1,298 papers shown
Title
Cascaded Projection: End-to-End Network Compression and Acceleration
Breton L. Minnehan
Andreas E. Savakis
54
26
0
12 Mar 2019
Dynamic Multi-path Neural Network
Yingcheng Su
Shunfeng Zhou
Yichao Wu
Tian Su
Ding Liang
Xuebo Liu
Dixin Zheng
Yingxu Wang
Junjie Yan
Xiaolin Hu
18
3
0
28 Feb 2019
Low-bit Quantization of Neural Networks for Efficient Inference
Yoni Choukroun
Eli Kravchik
Fan Yang
P. Kisilev
MQ
86
366
0
18 Feb 2019
Mockingbird: Defending Against Deep-Learning-Based Website Fingerprinting Attacks with Adversarial Traces
Mohammad Saidur Rahman
Mohsen Imani
Nate Mathews
M. Wright
AAML
86
81
0
18 Feb 2019
AutoQ: Automated Kernel-Wise Neural Network Quantization
Qian Lou
Feng Guo
Lantao Liu
Minje Kim
Lei Jiang
MQ
105
98
0
15 Feb 2019
Understanding Chat Messages for Sticker Recommendation in Messaging Apps
Abhishek Laddha
Mohamed Hanoosh
Debdoot Mukherjee
Parth Patwa
Ankur Narang
45
17
0
07 Feb 2019
Same, Same But Different - Recovering Neural Network Quantization Error Through Weight Factorization
Eldad Meller
Alexander Finkelstein
Uri Almog
Mark Grobman
MQ
81
87
0
05 Feb 2019
Towards Federated Learning at Scale: System Design
Keith Bonawitz
Hubert Eichner
W. Grieskamp
Dzmitry Huba
A. Ingerman
...
H. B. McMahan
Timon Van Overveldt
David Petrou
Daniel Ramage
Jason Roselander
FedML
132
2,685
0
04 Feb 2019
Improving Neural Network Quantization without Retraining using Outlier Channel Splitting
Ritchie Zhao
Yuwei Hu
Jordan Dotzel
Christopher De Sa
Zhiru Zhang
OODD
MQ
152
312
0
28 Jan 2019
QGAN: Quantized Generative Adversarial Networks
Peiqi Wang
Dongsheng Wang
Yu Ji
Xinfeng Xie
Haoxuan Song
XuXin Liu
Yongqiang Lyu
Yuan Xie
GAN
MQ
53
32
0
24 Jan 2019
Deep Neural Network Approximation for Custom Hardware: Where We've Been, Where We're Going
Erwei Wang
James J. Davis
Ruizhe Zhao
Ho-Cheung Ng
Xinyu Niu
Wayne Luk
P. Cheung
George A. Constantinides
88
59
0
21 Jan 2019
DSConv: Efficient Convolution Operator
Marcelo Gennari
Roger Fawcett
V. Prisacariu
MQ
48
68
0
07 Jan 2019
Dataflow-based Joint Quantization of Weights and Activations for Deep Neural Networks
Xue Geng
Jie Fu
Bin Zhao
Jie Lin
M. Aly
C. Pal
V. Chandrasekhar
MQ
28
6
0
04 Jan 2019
Dynamic Runtime Feature Map Pruning
Tailin Liang
Lei Wang
Shaobo Shi
C. Glossner
3DPC
44
8
0
24 Dec 2018
Precision Highway for Ultra Low-Precision Quantization
Eunhyeok Park
Dongyoung Kim
S. Yoo
Peter Vajda
MQ
AI4TS
158
12
0
24 Dec 2018
SQuantizer: Simultaneous Learning for Both Sparse and Low-precision Neural Networks
M. Park
Xiaofang Xu
C. Brick
MQ
59
8
0
20 Dec 2018
Fast Adjustable Threshold For Uniform Neural Network Quantization (Winning solution of LPIRC-II)
A. Goncharenko
Andrey Denisov
S. Alyamkin
Evgeny Terentev
MQ
56
20
0
19 Dec 2018
Exploiting Kernel Sparsity and Entropy for Interpretable CNN Compression
Yuchao Li
Shaohui Lin
Baochang Zhang
Jianzhuang Liu
David Doermann
Yongjian Wu
Feiyue Huang
Rongrong Ji
82
130
0
11 Dec 2018
Efficient and Robust Machine Learning for Real-World Systems
Franz Pernkopf
Wolfgang Roth
Matthias Zöhrer
Lukas Pfeifenberger
Günther Schindler
Holger Froening
Sebastian Tschiatschek
Robert Peharz
Matthew Mattina
Zoubin Ghahramani
OOD
31
1
0
05 Dec 2018
Efficient non-uniform quantizer for quantized neural network targeting reconfigurable hardware
Natan Liss
Chaim Baskin
A. Mendelson
A. Bronstein
Raja Giryes
MQ
41
5
0
27 Nov 2018
On Periodic Functions as Regularizers for Quantization of Neural Networks
Maxim Naumov
Utku Diril
Jongsoo Park
Benjamin Ray
Jedrzej Jablonski
Andrew Tulloch
MQ
50
25
0
24 Nov 2018
Structured Binary Neural Networks for Accurate Image Classification and Semantic Segmentation
Bohan Zhuang
Chunhua Shen
Mingkui Tan
Lingqiao Liu
Ian Reid
MQ
90
154
0
22 Nov 2018
HAQ: Hardware-Aware Automated Quantization with Mixed Precision
Kuan-Chieh Wang
Zhijian Liu
Chengyue Wu
Ji Lin
Song Han
MQ
174
886
0
21 Nov 2018
Fast On-the-fly Retraining-free Sparsification of Convolutional Neural Networks
Amir H. Ashouri
T. Abdelrahman
Alwyn Dos Remedios
MQ
89
12
0
10 Nov 2018
Dynamic Representations Toward Efficient Inference on Deep Neural Networks by Decision Gates
Mohammad Saeed Shafiee
M. Shafiee
A. Wong
AI4CE
18
4
0
05 Nov 2018
Rethinking floating point for deep learning
Jeff Johnson
MQ
131
140
0
01 Nov 2018
A Hitchhiker's Guide On Distributed Training of Deep Neural Networks
K. Chahal
Manraj Singh Grover
Kuntal Dey
3DH
OOD
90
54
0
28 Oct 2018
Relaxed Quantization for Discretized Neural Networks
Christos Louizos
M. Reisser
Tijmen Blankevoort
E. Gavves
Max Welling
MQ
103
132
0
03 Oct 2018
2018 Low-Power Image Recognition Challenge
S. Alyamkin
M. Ardi
Achille Brighton
Alexander C. Berg
Yiran Chen
...
George K. Thiruvathukal
Baiwu Zhang
Jingchi Zhang
Xiaopeng Zhang
Shaojie Zhuo
BDL
53
13
0
03 Oct 2018
Post-training 4-bit quantization of convolution networks for rapid-deployment
Ron Banner
Yury Nahshan
Elad Hoffer
Daniel Soudry
MQ
87
95
0
02 Oct 2018
AI Benchmark: Running Deep Neural Networks on Android Smartphones
Andrey D. Ignatov
Radu Timofte
William Chou
Ke Wang
Max Wu
Tim Hartley
Luc Van Gool
ELM
87
324
0
02 Oct 2018
NICE: Noise Injection and Clamping Estimation for Neural Network Quantization
Chaim Baskin
Natan Liss
Yoav Chai
Evgenii Zheltonozhskii
Eli Schwartz
Raja Giryes
A. Mendelson
A. Bronstein
MQ
99
62
0
29 Sep 2018
FermiNets: Learning generative machines to generate efficient neural networks via generative synthesis
A. Wong
M. Shafiee
Brendan Chwyl
Francis Li
53
64
0
17 Sep 2018
Hardware-Aware Machine Learning: Modeling and Optimization
Diana Marculescu
Dimitrios Stamoulis
E. Cai
67
45
0
14 Sep 2018
Discretely Relaxing Continuous Variables for tractable Variational Inference
Trefor W. Evans
P. Nair
BDL
57
0
0
12 Sep 2018
Discovering Low-Precision Networks Close to Full-Precision Networks for Efficient Embedded Inference
J. McKinstry
S. K. Esser
R. Appuswamy
Deepika Bablani
John V. Arthur
Izzet B. Yildiz
D. Modha
MQ
66
94
0
11 Sep 2018
DeepHunter: Hunting Deep Neural Network Defects via Coverage-Guided Fuzzing
Xiaofei Xie
Lei Ma
Felix Juefei Xu
Hongxu Chen
Minhui Xue
Yue Liu
Yang Liu
Jianjun Zhao
Jianxiong Yin
Simon See
116
41
0
04 Sep 2018
Training Compact Neural Networks with Binary Weights and Low Precision Activations
Bohan Zhuang
Chunhua Shen
Ian Reid
MQ
53
14
0
08 Aug 2018
MnasNet: Platform-Aware Neural Architecture Search for Mobile
Mingxing Tan
Bo Chen
Ruoming Pang
Vijay Vasudevan
Mark Sandler
Andrew G. Howard
Quoc V. Le
MQ
139
3,022
0
31 Jul 2018
Learning K-way D-dimensional Discrete Codes for Compact Embedding Representations
Ting-Li Chen
Martin Renqiang Min
Yizhou Sun
77
71
0
21 Jun 2018
Quantizing deep convolutional networks for efficient inference: A whitepaper
Raghuraman Krishnamoorthi
MQ
145
1,024
0
21 Jun 2018
Quantizing Convolutional Neural Networks for Low-Power High-Throughput Inference Engines
S. Settle
Manasa Bollavaram
P. DÁlberto
Elliott Delaye
Oscar Fernández
Nicholas J. Fraser
A. Ng
Ashish Sirasao
Michael Wu
MQ
46
21
0
21 May 2018
MobileFaceNets: Efficient CNNs for Accurate Real-Time Face Verification on Mobile Devices
Sheng Chen
Yang Liu
Xiang Gao
Zhen Han
CVBM
3DH
127
565
0
20 Apr 2018
DPRed: Making Typical Activation and Weight Values Matter In Deep Learning Computing
A. Delmas
Sayeh Sharify
Patrick Judd
Kevin Siu
Milos Nikolic
Andreas Moshovos
MQ
44
3
0
17 Apr 2018
NetAdapt: Platform-Aware Neural Network Adaptation for Mobile Applications
Tien-Ju Yang
Andrew G. Howard
Bo Chen
Xiao Zhang
Alec Go
Mark Sandler
Vivienne Sze
Hartwig Adam
150
522
0
09 Apr 2018
A Quantization-Friendly Separable Convolution for MobileNets
Tao Sheng
Chen Feng
Shaojie Zhuo
Xiaopeng Zhang
Liang Shen
M. Aleksic
MQ
79
115
0
22 Mar 2018
Universal Deep Neural Network Compression
Yoojin Choi
Mostafa El-Khamy
Jungwon Lee
MQ
151
88
0
07 Feb 2018
Learning K-way D-dimensional Discrete Code For Compact Embedding Representations
Ting Chen
Martin Renqiang Min
Yizhou Sun
71
10
0
08 Nov 2017
Previous
1
2
3
...
24
25
26