Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1902.06822
Cited By
Low-bit Quantization of Neural Networks for Efficient Inference
18 February 2019
Yoni Choukroun
Eli Kravchik
Fan Yang
P. Kisilev
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Low-bit Quantization of Neural Networks for Efficient Inference"
32 / 182 papers shown
Title
A Tiny CNN Architecture for Medical Face Mask Detection for Resource-Constrained Endpoints
P. Mohan
A. Paul
Abhay Chirania
CVBM
24
48
0
30 Nov 2020
A Statistical Framework for Low-bitwidth Training of Deep Neural Networks
Jianfei Chen
Yujie Gai
Z. Yao
Michael W. Mahoney
Joseph E. Gonzalez
MQ
14
58
0
27 Oct 2020
ResNet-like Architecture with Low Hardware Requirements
E. Limonova
D. Alfonso
D. Nikolaev
V. Arlazarov
27
15
0
15 Sep 2020
Channel-wise Hessian Aware trace-Weighted Quantization of Neural Networks
Xu Qian
Victor Li
Darren Crews
MQ
24
9
0
19 Aug 2020
AUSN: Approximately Uniform Quantization by Adaptively Superimposing Non-uniform Distribution for Deep Neural Networks
Fangxin Liu
Wenbo Zhao
Yanzhi Wang
Changzhi Dai
Li Jiang
MQ
25
3
0
08 Jul 2020
Rethinking Bottleneck Structure for Efficient Mobile Network Design
Zhou Daquan
Qibin Hou
Yunpeng Chen
Jiashi Feng
Shuicheng Yan
26
197
0
05 Jul 2020
EasyQuant: Post-training Quantization via Scale Optimization
Di Wu
Qingming Tang
Yongle Zhao
Ming Zhang
Ying Fu
Debing Zhang
MQ
30
75
0
30 Jun 2020
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
Itay Hubara
Yury Nahshan
Y. Hanani
Ron Banner
Daniel Soudry
MQ
35
122
0
14 Jun 2020
Neural Network Activation Quantization with Bitwise Information Bottlenecks
Xichuan Zhou
Kui Liu
Cong Shi
Haijun Liu
Ji Liu
MQ
27
1
0
09 Jun 2020
Position-based Scaled Gradient for Model Quantization and Pruning
Jangho Kim
Kiyoon Yoo
Nojun Kwak
MQ
16
7
0
22 May 2020
Up or Down? Adaptive Rounding for Post-Training Quantization
Markus Nagel
Rana Ali Amjad
M. V. Baalen
Christos Louizos
Tijmen Blankevoort
MQ
10
553
0
22 Apr 2020
LSQ+: Improving low-bit quantization through learnable offsets and better initialization
Yash Bhalgat
Jinwon Lee
Markus Nagel
Tijmen Blankevoort
Nojun Kwak
MQ
20
212
0
20 Apr 2020
Non-Blocking Simultaneous Multithreading: Embracing the Resiliency of Deep Neural Networks
Gil Shomron
U. Weiser
8
14
0
17 Apr 2020
Post-training Quantization with Multiple Points: Mixed Precision without Mixed Precision
Xingchao Liu
Mao Ye
Dengyong Zhou
Qiang Liu
MQ
13
42
0
20 Feb 2020
Robust Quantization: One Model to Rule Them All
Moran Shkolnik
Brian Chmiel
Ron Banner
Gil Shomron
Yury Nahshan
A. Bronstein
U. Weiser
OOD
MQ
19
75
0
18 Feb 2020
Gradient
ℓ
1
\ell_1
ℓ
1
Regularization for Quantization Robustness
Milad Alizadeh
Arash Behboodi
M. V. Baalen
Christos Louizos
Tijmen Blankevoort
Max Welling
MQ
12
8
0
18 Feb 2020
A Framework for Semi-Automatic Precision and Accuracy Analysis for Fast and Rigorous Deep Learning
C. Lauter
Anastasia Volkova
9
10
0
10 Feb 2020
Photonic tensor cores for machine learning
M. Miscuglio
V. Sorger
19
147
0
01 Feb 2020
Post-Training Piecewise Linear Quantization for Deep Neural Networks
Jun Fang
Ali Shafiee
Hamzah Abdel-Aziz
D. Thorsley
Georgios Georgiadis
Joseph Hassoun
MQ
17
144
0
31 Jan 2020
ZeroQ: A Novel Zero Shot Quantization Framework
Yaohui Cai
Z. Yao
Zhen Dong
A. Gholami
Michael W. Mahoney
Kurt Keutzer
MQ
35
389
0
01 Jan 2020
Loss Aware Post-training Quantization
Yury Nahshan
Brian Chmiel
Chaim Baskin
Evgenii Zheltonozhskii
Ron Banner
A. Bronstein
A. Mendelson
MQ
31
163
0
17 Nov 2019
Post-Training 4-bit Quantization on Embedding Tables
Hui Guan
Andrey Malevich
Jiyan Yang
Jongsoo Park
Hector Yuen
MQ
13
32
0
05 Nov 2019
Bipolar Morphological Neural Networks: Convolution Without Multiplication
E. Limonova
D. Matveev
D. Nikolaev
V. Arlazarov
19
12
0
05 Nov 2019
Ternary MobileNets via Per-Layer Hybrid Filter Banks
Dibakar Gope
Jesse G. Beu
Urmish Thakker
Matthew Mattina
MQ
29
15
0
04 Nov 2019
Memory Requirement Reduction of Deep Neural Networks Using Low-bit Quantization of Parameters
Niccoló Nicodemo
Gaurav Naithani
K. Drossos
Tuomas Virtanen
R. Saletti
MQ
6
1
0
01 Nov 2019
Neural Epitome Search for Architecture-Agnostic Network Compression
Daquan Zhou
Xiaojie Jin
Qibin Hou
Kaixin Wang
Jianchao Yang
Jiashi Feng
26
13
0
12 Jul 2019
Fighting Quantization Bias With Bias
Alexander Finkelstein
Uri Almog
Mark Grobman
MQ
25
56
0
07 Jun 2019
Memory-Driven Mixed Low Precision Quantization For Enabling Deep Network Inference On Microcontrollers
Manuele Rusci
Alessandro Capotondi
Luca Benini
MQ
21
74
0
30 May 2019
Feature Map Transform Coding for Energy-Efficient CNN Inference
Brian Chmiel
Chaim Baskin
Ron Banner
Evgenii Zheltonozhskii
Yevgeny Yermolin
Alex Karbachevsky
A. Bronstein
A. Mendelson
25
24
0
26 May 2019
AutoQ: Automated Kernel-Wise Neural Network Quantization
Qian Lou
Feng Guo
Lantao Liu
Minje Kim
Lei Jiang
MQ
21
97
0
15 Feb 2019
DSConv: Efficient Convolution Operator
Marcelo Gennari
Roger Fawcett
V. Prisacariu
MQ
29
62
0
07 Jan 2019
Understanding Neural Networks and Individual Neuron Importance via Information-Ordered Cumulative Ablation
Rana Ali Amjad
Kairen Liu
Bernhard C. Geiger
FAtt
19
18
0
18 Apr 2018
Previous
1
2
3
4