Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1807.00301
Cited By
SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks
1 July 2018
Julian Faraone
Nicholas J. Fraser
Michaela Blott
Philip H. W. Leong
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks"
50 / 66 papers shown
Title
Compensate Quantization Errors+: Quantized Models Are Inquisitive Learners
Yifei Gao
Jie Ou
Lei Wang
Fanhua Shang
Jaji Wu
MQ
49
0
0
22 Jul 2024
LLMEasyQuant: Scalable Quantization for Parallel and Distributed LLM Inference
Dong Liu
Meng Jiang
MQ
38
12
0
28 Jun 2024
Compensate Quantization Errors: Make Weights Hierarchical to Compensate Each Other
Yifei Gao
Jie Ou
Lei Wang
Yuting Xiao
Zhiyuan Xiang
Ruiting Dai
Jun Cheng
MQ
36
3
0
24 Jun 2024
From Algorithm to Hardware: A Survey on Efficient and Safe Deployment of Deep Neural Networks
Xue Geng
Zhe Wang
Chunyun Chen
Qing Xu
Kaixin Xu
...
Zhenghua Chen
M. Aly
Jie Lin
Min-man Wu
Xiaoli Li
33
1
0
09 May 2024
NASH: Neural Architecture Search for Hardware-Optimized Machine Learning Models
Mengfei Ji
Yuchun Chang
Baolin Zhang
Zaid Al-Ars
19
0
0
04 Mar 2024
BiLLM: Pushing the Limit of Post-Training Quantization for LLMs
Wei Huang
Yangdong Liu
Haotong Qin
Ying Li
Shiming Zhang
Xianglong Liu
Michele Magno
Xiaojuan Qi
MQ
82
69
0
06 Feb 2024
Learning Discrete Weights and Activations Using the Local Reparameterization Trick
G. Berger
Aviv Navon
Ethan Fetaya
MQ
22
0
0
04 Jul 2023
MobileNMT: Enabling Translation in 15MB and 30ms
Ye Lin
Xiaohui Wang
Zhexi Zhang
Mingxuan Wang
Tong Xiao
Jingbo Zhu
MQ
30
1
0
07 Jun 2023
Ternary Quantization: A Survey
Danyang Liu
Xue Liu
MQ
26
3
0
02 Mar 2023
RedBit: An End-to-End Flexible Framework for Evaluating the Accuracy of Quantized CNNs
A. M. Ribeiro-dos-Santos
João Dinis Ferreira
O. Mutlu
G. Falcão
MQ
21
1
0
15 Jan 2023
A Comprehensive Survey of Dataset Distillation
Shiye Lei
Dacheng Tao
DD
31
88
0
13 Jan 2023
AskewSGD : An Annealed interval-constrained Optimisation method to train Quantized Neural Networks
Louis Leconte
S. Schechtman
Eric Moulines
29
4
0
07 Nov 2022
Mixed-Precision Neural Networks: A Survey
M. Rakka
M. Fouda
Pramod P. Khargonekar
Fadi J. Kurdahi
MQ
25
11
0
11 Aug 2022
LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification
Sharath Girish
Kamal Gupta
Saurabh Singh
Abhinav Shrivastava
36
11
0
06 Apr 2022
It's All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher
Kanghyun Choi
Hye Yoon Lee
Deokki Hong
Joonsang Yu
Noseong Park
Youngsok Kim
Jinho Lee
MQ
38
31
0
31 Mar 2022
FxP-QNet: A Post-Training Quantizer for the Design of Mixed Low-Precision DNNs with Dynamic Fixed-Point Representation
Ahmad Shawahna
S. M. Sait
A. El-Maleh
Irfan Ahmad
MQ
20
6
0
22 Mar 2022
LDP: Learnable Dynamic Precision for Efficient Deep Neural Network Training and Inference
Zhongzhi Yu
Y. Fu
Shang Wu
Mengquan Li
Haoran You
Yingyan Lin
28
1
0
15 Mar 2022
Elastic Significant Bit Quantization and Acceleration for Deep Neural Networks
Cheng Gong
Ye Lu
Kunpeng Xie
Zongming Jin
Tao Li
Yanzhi Wang
MQ
27
7
0
08 Sep 2021
Quantized Neural Networks via {-1, +1} Encoding Decomposition and Acceleration
Qigong Sun
Xiufang Li
Fanhua Shang
Hongying Liu
Kan Yang
L. Jiao
Zhouchen Lin
MQ
31
0
0
18 Jun 2021
ARC: A Vision-based Automatic Retail Checkout System
Syed Talha Bukhari
Abdul Wahab Amin
Muhammad Naveed
Muhammad Rzi Abbas
20
6
0
07 Apr 2021
FAT: Learning Low-Bitwidth Parametric Representation via Frequency-Aware Transformation
Chaofan Tao
Rui Lin
Quan Chen
Zhaoyang Zhang
Ping Luo
Ngai Wong
MQ
28
7
0
15 Feb 2021
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
Tailin Liang
C. Glossner
Lei Wang
Shaobo Shi
Xiaotong Zhang
MQ
150
675
0
24 Jan 2021
Improving Accuracy of Binary Neural Networks using Unbalanced Activation Distribution
Hyungjun Kim
Jihoon Park
Chang-Ho Lee
Jae-Joon Kim
MQ
20
30
0
02 Dec 2020
PAMS: Quantized Super-Resolution via Parameterized Max Scale
Huixia Li
Chenqian Yan
Shaohui Lin
Xiawu Zheng
Yuchao Li
Baochang Zhang
Fan Yang
Rongrong Ji
MQ
25
84
0
09 Nov 2020
FTBNN: Rethinking Non-linearity for 1-bit CNNs and Going Beyond
Z. Su
Linpu Fang
Deke Guo
Duwen Hu
M. Pietikäinen
Li Liu
MQ
16
3
0
19 Oct 2020
High-Capacity Expert Binary Networks
Adrian Bulat
Brais Martínez
Georgios Tzimiropoulos
MQ
27
57
0
07 Oct 2020
Searching for Low-Bit Weights in Quantized Neural Networks
Zhaohui Yang
Yunhe Wang
Kai Han
Chunjing Xu
Chao Xu
Dacheng Tao
Chang Xu
MQ
25
82
0
18 Sep 2020
QuantNet: Learning to Quantize by Learning within Fully Differentiable Framework
Junjie Liu
Dongchao Wen
Deyu Wang
Wei Tao
Tse-Wei Chen
Kinya Osa
Masami Kato
MQ
20
3
0
10 Sep 2020
Transform Quantization for CNN (Convolutional Neural Network) Compression
Sean I. Young
Wang Zhe
David S. Taubman
B. Girod
MQ
29
69
0
02 Sep 2020
Towards Lossless Binary Convolutional Neural Networks Using Piecewise Approximation
Baozhou Zhu
Zaid Al-Ars
Wei Pan
MQ
22
8
0
08 Aug 2020
NASB: Neural Architecture Search for Binary Convolutional Neural Networks
Baozhou Zhu
Zaid Al-Ars
P. Hofstee
MQ
24
23
0
08 Aug 2020
T-Basis: a Compact Representation for Neural Networks
Anton Obukhov
M. Rakhuba
Stamatios Georgoulis
Menelaos Kanakis
Dengxin Dai
Luc Van Gool
39
27
0
13 Jul 2020
Binary Neural Networks: A Survey
Haotong Qin
Ruihao Gong
Xianglong Liu
Xiao Bai
Jingkuan Song
N. Sebe
MQ
50
458
0
31 Mar 2020
Training Binary Neural Networks with Real-to-Binary Convolutions
Brais Martínez
Jing Yang
Adrian Bulat
Georgios Tzimiropoulos
MQ
17
226
0
25 Mar 2020
Kernel Quantization for Efficient Network Compression
Zhongzhi Yu
Yemin Shi
Tiejun Huang
Yizhou Yu
MQ
31
3
0
11 Mar 2020
ReActNet: Towards Precise Binary Neural Network with Generalized Activation Functions
Zechun Liu
Zhiqiang Shen
Marios Savvides
Kwang-Ting Cheng
MQ
33
348
0
07 Mar 2020
Propagating Asymptotic-Estimated Gradients for Low Bitwidth Quantized Neural Networks
Jun Chen
Yong Liu
Hao Zhang
Shengnan Hou
Jian Yang
MQ
25
7
0
04 Mar 2020
BATS: Binary ArchitecTure Search
Adrian Bulat
Brais Martínez
Georgios Tzimiropoulos
MQ
25
67
0
03 Mar 2020
Learned Threshold Pruning
K. Azarian
Yash Bhalgat
Jinwon Lee
Tijmen Blankevoort
MQ
28
38
0
28 Feb 2020
SYMOG: learning symmetric mixture of Gaussian modes for improved fixed-point quantization
Lukas Enderich
Fabian Timm
Wolfram Burgard
MQ
22
6
0
19 Feb 2020
Post-Training Piecewise Linear Quantization for Deep Neural Networks
Jun Fang
Ali Shafiee
Hamzah Abdel-Aziz
D. Thorsley
Georgios Georgiadis
Joseph Hassoun
MQ
17
144
0
31 Jan 2020
MeliusNet: Can Binary Neural Networks Achieve MobileNet-level Accuracy?
Joseph Bethge
Christian Bartz
Haojin Yang
Ying-Cong Chen
Christoph Meinel
MQ
25
91
0
16 Jan 2020
Least squares binary quantization of neural networks
Hadi Pouransari
Zhucheng Tu
Oncel Tuzel
MQ
17
32
0
09 Jan 2020
Adaptive Loss-aware Quantization for Multi-bit Networks
Zhongnan Qu
Zimu Zhou
Yun Cheng
Lothar Thiele
MQ
36
53
0
18 Dec 2019
Structured Multi-Hashing for Model Compression
Elad Eban
Yair Movshovitz-Attias
Hao Wu
Mark Sandler
Andrew Poon
Yerlan Idelbayev
M. A. Carreira-Perpiñán
17
18
0
25 Nov 2019
AddNet: Deep Neural Networks Using FPGA-Optimized Multipliers
Julian Faraone
M. Kumm
M. Hardieck
P. Zipf
Xueyuan Liu
David Boland
Philip H. W. Leong
MQ
6
45
0
19 Nov 2019
Loss Aware Post-training Quantization
Yury Nahshan
Brian Chmiel
Chaim Baskin
Evgenii Zheltonozhskii
Ron Banner
A. Bronstein
A. Mendelson
MQ
31
163
0
17 Nov 2019
XNOR-Net++: Improved Binary Neural Networks
Adrian Bulat
Georgios Tzimiropoulos
MQ
39
200
0
30 Sep 2019
Structured Binary Neural Networks for Image Recognition
Bohan Zhuang
Chunhua Shen
Mingkui Tan
Peng Chen
Lingqiao Liu
Ian Reid
MQ
22
17
0
22 Sep 2019
GDRQ: Group-based Distribution Reshaping for Quantization
Haibao Yu
Tuopu Wen
Guangliang Cheng
Jiankai Sun
Qi Han
Jianping Shi
MQ
33
3
0
05 Aug 2019
1
2
Next