Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2103.14949
Cited By
Automated Backend-Aware Post-Training Quantization
27 March 2021
Ziheng Jiang
Animesh Jain
An Liu
Josh Fromm
Chengqian Ma
Tianqi Chen
Luis Ceze
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Automated Backend-Aware Post-Training Quantization"
16 / 16 papers shown
Title
Efficient Execution of Quantized Deep Learning Models: A Compiler Approach
Animesh Jain
Shoubhik Bhattacharya
Masahiro Masuda
Vin Sharma
Yida Wang
MQ
56
33
0
18 Jun 2020
MLIR: A Compiler Infrastructure for the End of Moore's Law
Chris Lattner
M. Amini
Uday Bondhugula
Albert Cohen
Andy Davis
J. Pienaar
River Riddle
T. Shpeisman
Nicolas Vasilache
O. Zinenko
VLM
34
249
0
25 Feb 2020
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
Mingxing Tan
Quoc V. Le
3DV
MedIm
81
17,950
0
28 May 2019
Low-bit Quantization of Neural Networks for Efficient Inference
Yoni Choukroun
Eli Kravchik
Fan Yang
P. Kisilev
MQ
44
358
0
18 Feb 2019
Same, Same But Different - Recovering Neural Network Quantization Error Through Weight Factorization
Eldad Meller
Alexander Finkelstein
Uri Almog
Mark Grobman
MQ
34
85
0
05 Feb 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
882
93,936
0
11 Oct 2018
Learning to Optimize Tensor Programs
Tianqi Chen
Lianmin Zheng
Eddie Q. Yan
Ziheng Jiang
T. Moreau
Luis Ceze
Carlos Guestrin
Arvind Krishnamurthy
53
396
0
21 May 2018
TVM: An Automated End-to-End Optimizing Compiler for Deep Learning
Tianqi Chen
T. Moreau
Ziheng Jiang
Lianmin Zheng
Eddie Q. Yan
...
Leyuan Wang
Yuwei Hu
Luis Ceze
Carlos Guestrin
Arvind Krishnamurthy
121
374
0
12 Feb 2018
MobileNetV2: Inverted Residuals and Linear Bottlenecks
Mark Sandler
Andrew G. Howard
Menglong Zhu
A. Zhmoginov
Liang-Chieh Chen
129
19,124
0
13 Jan 2018
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference
Benoit Jacob
S. Kligys
Bo Chen
Menglong Zhu
Matthew Tang
Andrew G. Howard
Hartwig Adam
Dmitry Kalenichenko
MQ
122
3,090
0
15 Dec 2017
Binarized Neural Networks
Itay Hubara
Daniel Soudry
Ran El-Yaniv
MQ
117
1,349
0
08 Feb 2016
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
1.3K
192,638
0
10 Dec 2015
Rethinking the Inception Architecture for Computer Vision
Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jonathon Shlens
Z. Wojna
3DV
BDL
433
27,231
0
02 Dec 2015
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
Song Han
Huizi Mao
W. Dally
3DGS
189
8,793
0
01 Oct 2015
Distilling the Knowledge in a Neural Network
Geoffrey E. Hinton
Oriol Vinyals
J. Dean
FedML
198
19,448
0
09 Mar 2015
Very Deep Convolutional Networks for Large-Scale Image Recognition
Karen Simonyan
Andrew Zisserman
FAtt
MDE
860
99,991
0
04 Sep 2014
1