Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2001.00281
Cited By
ZeroQ: A Novel Zero Shot Quantization Framework
1 January 2020
Yaohui Cai
Z. Yao
Zhen Dong
A. Gholami
Michael W. Mahoney
Kurt Keutzer
MQ
Re-assign community
ArXiv (abs)
PDF
HTML
Github (279★)
Papers citing
"ZeroQ: A Novel Zero Shot Quantization Framework"
50 / 233 papers shown
Title
DynaMIX: Resource Optimization for DNN-Based Real-Time Applications on a Multi-Tasking System
Minkyoung Cho
Kang G. Shin
31
2
0
03 Feb 2023
PowerQuant: Automorphism Search for Non-Uniform Quantization
Edouard Yvinec
Arnaud Dapogny
Matthieu Cord
Kévin Bailly
MQ
63
16
0
24 Jan 2023
ACQ: Improving Generative Data-free Quantization Via Attention Correction
Jixing Li
Xiaozhou Guo
Benzhe Dai
Guoliang Gong
Min Jin
Gang Chen
Wenyu Mao
Huaxiang Lu
MQ
80
4
0
18 Jan 2023
Guided Hybrid Quantization for Object detection in Multimodal Remote Sensing Imagery via One-to-one Self-teaching
Jiaqing Zhang
Jie Lei
Weiying Xie
Yunsong Li
Wenxuan Wang
MQ
84
23
0
31 Dec 2022
Hyperspherical Quantization: Toward Smaller and More Accurate Models
Dan Liu
X. Chen
Chen Ma
Xue Liu
MQ
53
3
0
24 Dec 2022
CSMPQ:Class Separability Based Mixed-Precision Quantization
Ming-Yu Wang
Taisong Jin
Miaohui Zhang
Zhengtao Yu
MQ
55
0
0
20 Dec 2022
Towards Hardware-Specific Automatic Compression of Neural Networks
Torben Krieger
Bernhard Klein
Holger Fröning
MQ
51
2
0
15 Dec 2022
PD-Quant: Post-Training Quantization based on Prediction Difference Metric
Jiawei Liu
Lin Niu
Zhihang Yuan
Dawei Yang
Xinggang Wang
Wenyu Liu
MQ
185
71
0
14 Dec 2022
Error-aware Quantization through Noise Tempering
Zheng Wang
Juncheng Billy Li
Shuhui Qu
Florian Metze
Emma Strubell
MQ
38
2
0
11 Dec 2022
Genie: Show Me the Data for Quantization
Yongkweon Jeon
Chungman Lee
Ho-Young Kim
MQ
109
13
0
09 Dec 2022
CSQ: Growing Mixed-Precision Quantization Scheme with Bi-level Continuous Sparsification
Lirui Xiao
Huanrui Yang
Zhen Dong
Kurt Keutzer
Li Du
Shanghang Zhang
MQ
73
10
0
06 Dec 2022
Make RepVGG Greater Again: A Quantization-aware Approach
Xiangxiang Chu
Liang Li
Bo Zhang
MQ
124
51
0
03 Dec 2022
NoisyQuant: Noisy Bias-Enhanced Post-Training Activation Quantization for Vision Transformers
Yijiang Liu
Huanrui Yang
Zhen Dong
Kurt Keutzer
Li Du
Shanghang Zhang
MQ
104
53
0
29 Nov 2022
Post-training Quantization on Diffusion Models
Yuzhang Shang
Zhihang Yuan
Bin Xie
Bingzhe Wu
Yan Yan
DiffM
MQ
151
182
0
28 Nov 2022
Zero-Shot Dynamic Quantization for Transformer Inference
Yousef El-Kurdi
Jerry Quinn
Avirup Sil
MQ
64
1
0
17 Nov 2022
Long-Range Zero-Shot Generative Deep Network Quantization
Yan Luo
Yangcheng Gao
Zhao Zhang
Haijun Zhang
Mingliang Xu
Meng Wang
MQ
83
10
0
13 Nov 2022
Realistic Bokeh Effect Rendering on Mobile GPUs, Mobile AI & AIM 2022 challenge: Report
Andrey D. Ignatov
Radu Timofte
Jin Zhang
Feng Zhang
G. Yu
...
Mingyang Qian
Huixin Ma
Yanan Li
Xiaotao Wang
Lei Lei
68
10
0
07 Nov 2022
Power Efficient Video Super-Resolution on Mobile NPUs with Deep Learning, Mobile AI & AIM 2022 challenge: Report
Andrey D. Ignatov
Radu Timofte
Cheng-Ming Chiang
Hsien-Kai Kuo
Yu-Syuan Xu
...
Kele Xu
Li Liu
Zehua Cheng
Wenyi Lian
W. Lian
SupR
87
10
0
07 Nov 2022
Efficient and Accurate Quantized Image Super-Resolution on Mobile NPUs, Mobile AI & AIM 2022 challenge: Report
Andrey D. Ignatov
Radu Timofte
Maurizio Denna
Abdelbadie Younes
Ganzorig Gankhuyag
...
Jing Liu
Garas Gendy
Nabil Sabor
J. Hou
Guanghui He
SupR
MQ
92
32
0
07 Nov 2022
Efficient Single-Image Depth Estimation on Mobile Devices, Mobile AI & AIM 2022 Challenge: Report
Andrey D. Ignatov
Grigory Malivenko
Radu Timofte
Lukasz Treszczotko
Xin-ke Chang
...
Dongwon Park
Seongmin Hong
Joonhee Lee
Seunggyu Lee
Sengsub Chun
82
17
0
07 Nov 2022
Learned Smartphone ISP on Mobile GPUs with Deep Learning, Mobile AI & AIM 2022 Challenge: Report
Andrey D. Ignatov
Radu Timofte
Shuai Liu
Chaoyu Feng
Furui Bai
...
Xin Lou
Wei Zhou
Cong Pang
Haina Qin
Mingxuan Cai
96
24
0
07 Nov 2022
Zero-Shot Learning of a Conditional Generative Adversarial Network for Data-Free Network Quantization
Yoojin Choi
Mostafa El-Khamy
Jungwon Lee
GAN
49
1
0
26 Oct 2022
Structural Pruning via Latency-Saliency Knapsack
Maying Shen
Hongxu Yin
Pavlo Molchanov
Lei Mao
Jianna Liu
J. Álvarez
100
50
0
13 Oct 2022
Synthetic Dataset Generation for Privacy-Preserving Machine Learning
Efstathia Soufleri
Gobinda Saha
Kaushik Roy
DD
124
3
0
06 Oct 2022
Outlier Suppression: Pushing the Limit of Low-bit Transformer Language Models
Xiuying Wei
Yunchen Zhang
Xiangguo Zhang
Ruihao Gong
Shanghang Zhang
Qi Zhang
F. Yu
Xianglong Liu
MQ
133
153
0
27 Sep 2022
Analysis of Quantization on MLP-based Vision Models
Lingran Zhao
Zhen Dong
Kurt Keutzer
MQ
64
7
0
14 Sep 2022
PSAQ-ViT V2: Towards Accurate and General Data-Free Quantization for Vision Transformers
Zhikai Li
Mengjuan Chen
Junrui Xiao
Qingyi Gu
ViT
MQ
123
35
0
13 Sep 2022
ANT: Exploiting Adaptive Numerical Data Type for Low-bit Deep Neural Network Quantization
Cong Guo
Chen Zhang
Jingwen Leng
Zihan Liu
Fan Yang
Yun-Bo Liu
Minyi Guo
Yuhao Zhu
MQ
83
60
0
30 Aug 2022
Efficient Adaptive Activation Rounding for Post-Training Quantization
Zhengyi Li
Cong Guo
Zhanda Zhu
Yangjie Zhou
Yuxian Qiu
Xiaotian Gao
Jingwen Leng
Minyi Guo
MQ
92
4
0
25 Aug 2022
SVD-NAS: Coupling Low-Rank Approximation and Neural Architecture Search
Zhewen Yu
C. Bouganis
44
5
0
22 Aug 2022
FP8 Quantization: The Power of the Exponent
Andrey Kuzmin
M. V. Baalen
Yuwei Ren
Markus Nagel
Jorn W. T. Peters
Tijmen Blankevoort
MQ
85
87
0
19 Aug 2022
Mixed-Precision Neural Networks: A Survey
M. Rakka
M. Fouda
Pramod P. Khargonekar
Fadi J. Kurdahi
MQ
96
12
0
11 Aug 2022
Symmetry Regularization and Saturating Nonlinearity for Robust Quantization
Sein Park
Yeongsang Jang
Eunhyeok Park
MQ
57
2
0
31 Jul 2022
Bitwidth-Adaptive Quantization-Aware Neural Network Training: A Meta-Learning Approach
Jiseok Youn
Jaehun Song
Hyung-Sin Kim
S. Bahk
MQ
56
8
0
20 Jul 2022
ZeroQuant: Efficient and Affordable Post-Training Quantization for Large-Scale Transformers
Z. Yao
Reza Yazdani Aminabadi
Minjia Zhang
Xiaoxia Wu
Conglong Li
Yuxiong He
VLM
MQ
165
484
0
04 Jun 2022
Wavelet Feature Maps Compression for Image-to-Image CNNs
Shahaf E. Finder
Yair Zohav
Maor Ashkenazi
Eran Treister
105
22
0
24 May 2022
OPQ: Compressing Deep Neural Networks with One-shot Pruning-Quantization
Peng Hu
Xi Peng
Erik Cambria
M. Aly
Jie Lin
MQ
103
61
0
23 May 2022
UnrealNAS: Can We Search Neural Architectures with Unreal Data?
Zhen Dong
Kaichen Zhou
Ge Li
Qiang Zhou
Mingfei Guo
Guohao Li
Kurt Keutzer
Shanghang Zhang
50
0
0
04 May 2022
Towards Feature Distribution Alignment and Diversity Enhancement for Data-Free Quantization
Yangcheng Gao
Zhao Zhang
Richang Hong
Haijun Zhang
Jicong Fan
Shuicheng Yan
MQ
46
10
0
30 Apr 2022
RAPQ: Rescuing Accuracy for Power-of-Two Low-bit Post-training Quantization
Hongyi Yao
Pu Li
Jian Cao
Xiangcheng Liu
Chenying Xie
Bin Wang
MQ
95
12
0
26 Apr 2022
Enable Deep Learning on Mobile Devices: Methods, Systems, and Applications
Han Cai
Ji Lin
Chengyue Wu
Zhijian Liu
Haotian Tang
Hanrui Wang
Ligeng Zhu
Song Han
107
115
0
25 Apr 2022
Data-Free Quantization with Accurate Activation Clipping and Adaptive Batch Normalization
Yefei He
Luoming Zhang
Weijia Wu
Hong Zhou
MQ
83
2
0
08 Apr 2022
Intelligence at the Extreme Edge: A Survey on Reformable TinyML
Visal Rajapakse
Ishan Karunanayake
Nadeem Ahmed
SyDa
87
57
0
02 Apr 2022
It's All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher
Kanghyun Choi
Hye Yoon Lee
Deokki Hong
Joonsang Yu
Noseong Park
Youngsok Kim
Jinho Lee
MQ
101
33
0
31 Mar 2022
REx: Data-Free Residual Quantization Error Expansion
Edouard Yvinec
Arnaud Dapgony
Matthieu Cord
Kévin Bailly
MQ
88
8
0
28 Mar 2022
SPIQ: Data-Free Per-Channel Static Input Quantization
Edouard Yvinec
Arnaud Dapogny
Matthieu Cord
Kévin Bailly
MQ
52
21
0
28 Mar 2022
GradViT: Gradient Inversion of Vision Transformers
Ali Hatamizadeh
Hongxu Yin
H. Roth
Wenqi Li
Jan Kautz
Daguang Xu
Pavlo Molchanov
ViT
177
65
0
22 Mar 2022
QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quantization
Xiuying Wei
Ruihao Gong
Yuhang Li
Xianglong Liu
F. Yu
MQ
VLM
98
178
0
11 Mar 2022
Patch Similarity Aware Data-Free Quantization for Vision Transformers
Zhikai Li
Liping Ma
Mengjuan Chen
Junrui Xiao
Qingyi Gu
MQ
ViT
113
46
0
04 Mar 2022
Comprehensive Analysis of the Object Detection Pipeline on UAVs
Leon Amadeus Varga
Sebastian Koch
A. Zell
35
5
0
01 Mar 2022
Previous
1
2
3
4
5
Next