Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1712.05877
Cited By
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference
15 December 2017
Benoit Jacob
S. Kligys
Bo Chen
Menglong Zhu
Matthew Tang
Andrew G. Howard
Hartwig Adam
Dmitry Kalenichenko
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference"
50 / 1,258 papers shown
Title
Defensive Quantization: When Efficiency Meets Robustness
Ji Lin
Chuang Gan
Song Han
MQ
39
202
0
17 Apr 2019
Towards Real-Time Automatic Portrait Matting on Mobile Devices
Seokjun Seo
Seungwoo Choi
Martin Kersner
Beomjun Shin
Hyungsuk Yoon
Hyeongmin Byun
S. Ha
3DH
11
3
0
08 Apr 2019
Progressive Stochastic Binarization of Deep Networks
David Hartmann
Michael Wand
MQ
17
1
0
03 Apr 2019
Patchwork: A Patch-wise Attention Network for Efficient Object Detection and Segmentation in Video Streams
Yuning Chai
VOS
24
30
0
03 Apr 2019
Training Quantized Neural Networks with a Full-precision Auxiliary Module
Bohan Zhuang
Lingqiao Liu
Mingkui Tan
Chunhua Shen
Ian Reid
MQ
32
62
0
27 Mar 2019
Looking Fast and Slow: Memory-Guided Mobile Video Object Detection
Mason Liu
Menglong Zhu
Marie White
Yinxiao Li
Dmitry Kalenichenko
23
83
0
25 Mar 2019
Towards Optimal Structured CNN Pruning via Generative Adversarial Learning
Shaohui Lin
Rongrong Ji
Chenqian Yan
Baochang Zhang
Liujuan Cao
QiXiang Ye
Feiyue Huang
David Doermann
CVBM
11
504
0
22 Mar 2019
Trained Quantization Thresholds for Accurate and Efficient Fixed-Point Inference of Deep Neural Networks
Sambhav R. Jain
Albert Gural
Michael Wu
Chris Dick
MQ
16
147
0
19 Mar 2019
AttoNets: Compact and Efficient Deep Neural Networks for the Edge via Human-Machine Collaborative Design
A. Wong
Z. Q. Lin
Brendan Chwyl
HAI
14
14
0
18 Mar 2019
Cascaded Projection: End-to-End Network Compression and Acceleration
Breton L. Minnehan
Andreas E. Savakis
18
26
0
12 Mar 2019
Dynamic Multi-path Neural Network
Yingcheng Su
Shunfeng Zhou
Yichao Wu
Tian Su
Ding Liang
Xuebo Liu
Dixin Zheng
Yingxu Wang
Junjie Yan
Xiaolin Hu
11
2
0
28 Feb 2019
Low-bit Quantization of Neural Networks for Efficient Inference
Yoni Choukroun
Eli Kravchik
Fan Yang
P. Kisilev
MQ
33
355
0
18 Feb 2019
Mockingbird: Defending Against Deep-Learning-Based Website Fingerprinting Attacks with Adversarial Traces
Mohammad Saidur Rahman
Mohsen Imani
Nate Mathews
M. Wright
AAML
14
80
0
18 Feb 2019
AutoQ: Automated Kernel-Wise Neural Network Quantization
Qian Lou
Feng Guo
Lantao Liu
Minje Kim
Lei Jiang
MQ
27
97
0
15 Feb 2019
Understanding Chat Messages for Sticker Recommendation in Messaging Apps
Abhishek Laddha
Mohamed Hanoosh
Debdoot Mukherjee
Parth Patwa
Ankur Narang
19
17
0
07 Feb 2019
Same, Same But Different - Recovering Neural Network Quantization Error Through Weight Factorization
Eldad Meller
Alexander Finkelstein
Uri Almog
Mark Grobman
MQ
24
85
0
05 Feb 2019
Towards Federated Learning at Scale: System Design
Keith Bonawitz
Hubert Eichner
W. Grieskamp
Dzmitry Huba
A. Ingerman
...
H. B. McMahan
Timon Van Overveldt
David Petrou
Daniel Ramage
Jason Roselander
FedML
21
2,632
0
04 Feb 2019
Improving Neural Network Quantization without Retraining using Outlier Channel Splitting
Ritchie Zhao
Yuwei Hu
Jordan Dotzel
Christopher De Sa
Zhiru Zhang
OODD
MQ
50
305
0
28 Jan 2019
QGAN: Quantized Generative Adversarial Networks
Peiqi Wang
Dongsheng Wang
Yu Ji
Xinfeng Xie
Haoxuan Song
XuXin Liu
Yongqiang Lyu
Yuan Xie
GAN
MQ
13
32
0
24 Jan 2019
Deep Neural Network Approximation for Custom Hardware: Where We've Been, Where We're Going
Erwei Wang
James J. Davis
Ruizhe Zhao
Ho-Cheung Ng
Xinyu Niu
Wayne Luk
P. Cheung
George A. Constantinides
24
59
0
21 Jan 2019
DSConv: Efficient Convolution Operator
Marcelo Gennari
Roger Fawcett
V. Prisacariu
MQ
32
62
0
07 Jan 2019
Dataflow-based Joint Quantization of Weights and Activations for Deep Neural Networks
Xue Geng
Jie Fu
Bin Zhao
Jie Lin
M. Aly
C. Pal
V. Chandrasekhar
MQ
24
5
0
04 Jan 2019
Dynamic Runtime Feature Map Pruning
Tailin Liang
Lei Wang
Shaobo Shi
C. Glossner
3DPC
21
8
0
24 Dec 2018
Precision Highway for Ultra Low-Precision Quantization
Eunhyeok Park
Dongyoung Kim
S. Yoo
Peter Vajda
MQ
AI4TS
21
12
0
24 Dec 2018
SQuantizer: Simultaneous Learning for Both Sparse and Low-precision Neural Networks
M. Park
Xiaofang Xu
C. Brick
MQ
27
8
0
20 Dec 2018
Fast Adjustable Threshold For Uniform Neural Network Quantization (Winning solution of LPIRC-II)
A. Goncharenko
Andrey Denisov
S. Alyamkin
Evgeny Terentev
MQ
17
20
0
19 Dec 2018
Exploiting Kernel Sparsity and Entropy for Interpretable CNN Compression
Yuchao Li
Shaohui Lin
Baochang Zhang
Jianzhuang Liu
David Doermann
Yongjian Wu
Feiyue Huang
Rongrong Ji
43
130
0
11 Dec 2018
Efficient and Robust Machine Learning for Real-World Systems
Franz Pernkopf
Wolfgang Roth
Matthias Zöhrer
Lukas Pfeifenberger
Günther Schindler
Holger Froening
Sebastian Tschiatschek
Robert Peharz
Matthew Mattina
Zoubin Ghahramani
OOD
27
1
0
05 Dec 2018
Efficient non-uniform quantizer for quantized neural network targeting reconfigurable hardware
Natan Liss
Chaim Baskin
A. Mendelson
A. Bronstein
Raja Giryes
MQ
24
5
0
27 Nov 2018
On Periodic Functions as Regularizers for Quantization of Neural Networks
Maxim Naumov
Utku Diril
Jongsoo Park
Benjamin Ray
Jedrzej Jablonski
Andrew Tulloch
MQ
11
25
0
24 Nov 2018
Structured Binary Neural Networks for Accurate Image Classification and Semantic Segmentation
Bohan Zhuang
Chunhua Shen
Mingkui Tan
Lingqiao Liu
Ian Reid
MQ
27
152
0
22 Nov 2018
HAQ: Hardware-Aware Automated Quantization with Mixed Precision
Kuan-Chieh Jackson Wang
Zhijian Liu
Yujun Lin
Ji Lin
Song Han
MQ
71
872
0
21 Nov 2018
Fast On-the-fly Retraining-free Sparsification of Convolutional Neural Networks
Amir H. Ashouri
T. Abdelrahman
Alwyn Dos Remedios
MQ
16
12
0
10 Nov 2018
Dynamic Representations Toward Efficient Inference on Deep Neural Networks by Decision Gates
Mohammad Saeed Shafiee
M. Shafiee
A. Wong
AI4CE
12
4
0
05 Nov 2018
Rethinking floating point for deep learning
Jeff Johnson
MQ
11
135
0
01 Nov 2018
A Hitchhiker's Guide On Distributed Training of Deep Neural Networks
K. Chahal
Manraj Singh Grover
Kuntal Dey
3DH
OOD
6
53
0
28 Oct 2018
Relaxed Quantization for Discretized Neural Networks
Christos Louizos
M. Reisser
Tijmen Blankevoort
E. Gavves
Max Welling
MQ
36
131
0
03 Oct 2018
2018 Low-Power Image Recognition Challenge
S. Alyamkin
M. Ardi
Achille Brighton
Alexander C. Berg
Yiran Chen
...
George K. Thiruvathukal
Baiwu Zhang
Jingchi Zhang
Xiaopeng Zhang
Shaojie Zhuo
BDL
15
13
0
03 Oct 2018
Post-training 4-bit quantization of convolution networks for rapid-deployment
Ron Banner
Yury Nahshan
Elad Hoffer
Daniel Soudry
MQ
19
93
0
02 Oct 2018
AI Benchmark: Running Deep Neural Networks on Android Smartphones
Andrey D. Ignatov
Radu Timofte
William Chou
Ke Wang
Max Wu
Tim Hartley
Luc Van Gool
ELM
21
321
0
02 Oct 2018
NICE: Noise Injection and Clamping Estimation for Neural Network Quantization
Chaim Baskin
Natan Liss
Yoav Chai
Evgenii Zheltonozhskii
Eli Schwartz
Raja Giryes
A. Mendelson
A. Bronstein
MQ
11
60
0
29 Sep 2018
FermiNets: Learning generative machines to generate efficient neural networks via generative synthesis
A. Wong
M. Shafiee
Brendan Chwyl
Francis Li
8
64
0
17 Sep 2018
Hardware-Aware Machine Learning: Modeling and Optimization
Diana Marculescu
Dimitrios Stamoulis
E. Cai
19
45
0
14 Sep 2018
Discretely Relaxing Continuous Variables for tractable Variational Inference
Trefor W. Evans
P. Nair
BDL
52
0
0
12 Sep 2018
Discovering Low-Precision Networks Close to Full-Precision Networks for Efficient Embedded Inference
J. McKinstry
S. K. Esser
R. Appuswamy
Deepika Bablani
John V. Arthur
Izzet B. Yildiz
D. Modha
MQ
18
94
0
11 Sep 2018
DeepHunter: Hunting Deep Neural Network Defects via Coverage-Guided Fuzzing
Xiaofei Xie
Lei Ma
Felix Juefei Xu
Hongxu Chen
Minhui Xue
Bo-wen Li
Yang Liu
Jianjun Zhao
Jianxiong Yin
Simon See
43
40
0
04 Sep 2018
Training Compact Neural Networks with Binary Weights and Low Precision Activations
Bohan Zhuang
Chunhua Shen
Ian Reid
MQ
13
14
0
08 Aug 2018
MnasNet: Platform-Aware Neural Architecture Search for Mobile
Mingxing Tan
Bo Chen
Ruoming Pang
Vijay Vasudevan
Mark Sandler
Andrew G. Howard
Quoc V. Le
MQ
51
2,982
0
31 Jul 2018
Learning K-way D-dimensional Discrete Codes for Compact Embedding Representations
Ting-Li Chen
Martin Renqiang Min
Yizhou Sun
26
70
0
21 Jun 2018
Quantizing deep convolutional networks for efficient inference: A whitepaper
Raghuraman Krishnamoorthi
MQ
48
993
0
21 Jun 2018
Previous
1
2
3
...
24
25
26
Next