ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1807.00301
  4. Cited By
SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks

SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks

1 July 2018
Julian Faraone
Nicholas J. Fraser
Michaela Blott
Philip H. W. Leong
    MQ
ArXivPDFHTML

Papers citing "SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks"

50 / 66 papers shown
Title
Compensate Quantization Errors+: Quantized Models Are Inquisitive Learners
Compensate Quantization Errors+: Quantized Models Are Inquisitive Learners
Yifei Gao
Jie Ou
Lei Wang
Fanhua Shang
Jaji Wu
MQ
49
0
0
22 Jul 2024
LLMEasyQuant: Scalable Quantization for Parallel and Distributed LLM Inference
LLMEasyQuant: Scalable Quantization for Parallel and Distributed LLM Inference
Dong Liu
Meng Jiang
MQ
38
12
0
28 Jun 2024
Compensate Quantization Errors: Make Weights Hierarchical to Compensate
  Each Other
Compensate Quantization Errors: Make Weights Hierarchical to Compensate Each Other
Yifei Gao
Jie Ou
Lei Wang
Yuting Xiao
Zhiyuan Xiang
Ruiting Dai
Jun Cheng
MQ
36
3
0
24 Jun 2024
From Algorithm to Hardware: A Survey on Efficient and Safe Deployment of
  Deep Neural Networks
From Algorithm to Hardware: A Survey on Efficient and Safe Deployment of Deep Neural Networks
Xue Geng
Zhe Wang
Chunyun Chen
Qing Xu
Kaixin Xu
...
Zhenghua Chen
M. Aly
Jie Lin
Min-man Wu
Xiaoli Li
33
1
0
09 May 2024
NASH: Neural Architecture Search for Hardware-Optimized Machine Learning
  Models
NASH: Neural Architecture Search for Hardware-Optimized Machine Learning Models
Mengfei Ji
Yuchun Chang
Baolin Zhang
Zaid Al-Ars
19
0
0
04 Mar 2024
BiLLM: Pushing the Limit of Post-Training Quantization for LLMs
BiLLM: Pushing the Limit of Post-Training Quantization for LLMs
Wei Huang
Yangdong Liu
Haotong Qin
Ying Li
Shiming Zhang
Xianglong Liu
Michele Magno
Xiaojuan Qi
MQ
82
69
0
06 Feb 2024
Learning Discrete Weights and Activations Using the Local
  Reparameterization Trick
Learning Discrete Weights and Activations Using the Local Reparameterization Trick
G. Berger
Aviv Navon
Ethan Fetaya
MQ
22
0
0
04 Jul 2023
MobileNMT: Enabling Translation in 15MB and 30ms
MobileNMT: Enabling Translation in 15MB and 30ms
Ye Lin
Xiaohui Wang
Zhexi Zhang
Mingxuan Wang
Tong Xiao
Jingbo Zhu
MQ
30
1
0
07 Jun 2023
Ternary Quantization: A Survey
Ternary Quantization: A Survey
Danyang Liu
Xue Liu
MQ
26
3
0
02 Mar 2023
RedBit: An End-to-End Flexible Framework for Evaluating the Accuracy of
  Quantized CNNs
RedBit: An End-to-End Flexible Framework for Evaluating the Accuracy of Quantized CNNs
A. M. Ribeiro-dos-Santos
João Dinis Ferreira
O. Mutlu
G. Falcão
MQ
21
1
0
15 Jan 2023
A Comprehensive Survey of Dataset Distillation
A Comprehensive Survey of Dataset Distillation
Shiye Lei
Dacheng Tao
DD
31
88
0
13 Jan 2023
AskewSGD : An Annealed interval-constrained Optimisation method to train
  Quantized Neural Networks
AskewSGD : An Annealed interval-constrained Optimisation method to train Quantized Neural Networks
Louis Leconte
S. Schechtman
Eric Moulines
29
4
0
07 Nov 2022
Mixed-Precision Neural Networks: A Survey
Mixed-Precision Neural Networks: A Survey
M. Rakka
M. Fouda
Pramod P. Khargonekar
Fadi J. Kurdahi
MQ
25
11
0
11 Aug 2022
LilNetX: Lightweight Networks with EXtreme Model Compression and
  Structured Sparsification
LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification
Sharath Girish
Kamal Gupta
Saurabh Singh
Abhinav Shrivastava
36
11
0
06 Apr 2022
It's All In the Teacher: Zero-Shot Quantization Brought Closer to the
  Teacher
It's All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher
Kanghyun Choi
Hye Yoon Lee
Deokki Hong
Joonsang Yu
Noseong Park
Youngsok Kim
Jinho Lee
MQ
38
31
0
31 Mar 2022
FxP-QNet: A Post-Training Quantizer for the Design of Mixed
  Low-Precision DNNs with Dynamic Fixed-Point Representation
FxP-QNet: A Post-Training Quantizer for the Design of Mixed Low-Precision DNNs with Dynamic Fixed-Point Representation
Ahmad Shawahna
S. M. Sait
A. El-Maleh
Irfan Ahmad
MQ
20
6
0
22 Mar 2022
LDP: Learnable Dynamic Precision for Efficient Deep Neural Network
  Training and Inference
LDP: Learnable Dynamic Precision for Efficient Deep Neural Network Training and Inference
Zhongzhi Yu
Y. Fu
Shang Wu
Mengquan Li
Haoran You
Yingyan Lin
28
1
0
15 Mar 2022
Elastic Significant Bit Quantization and Acceleration for Deep Neural
  Networks
Elastic Significant Bit Quantization and Acceleration for Deep Neural Networks
Cheng Gong
Ye Lu
Kunpeng Xie
Zongming Jin
Tao Li
Yanzhi Wang
MQ
27
7
0
08 Sep 2021
Quantized Neural Networks via {-1, +1} Encoding Decomposition and
  Acceleration
Quantized Neural Networks via {-1, +1} Encoding Decomposition and Acceleration
Qigong Sun
Xiufang Li
Fanhua Shang
Hongying Liu
Kan Yang
L. Jiao
Zhouchen Lin
MQ
31
0
0
18 Jun 2021
ARC: A Vision-based Automatic Retail Checkout System
ARC: A Vision-based Automatic Retail Checkout System
Syed Talha Bukhari
Abdul Wahab Amin
Muhammad Naveed
Muhammad Rzi Abbas
20
6
0
07 Apr 2021
FAT: Learning Low-Bitwidth Parametric Representation via Frequency-Aware
  Transformation
FAT: Learning Low-Bitwidth Parametric Representation via Frequency-Aware Transformation
Chaofan Tao
Rui Lin
Quan Chen
Zhaoyang Zhang
Ping Luo
Ngai Wong
MQ
28
7
0
15 Feb 2021
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
Tailin Liang
C. Glossner
Lei Wang
Shaobo Shi
Xiaotong Zhang
MQ
150
675
0
24 Jan 2021
Improving Accuracy of Binary Neural Networks using Unbalanced Activation
  Distribution
Improving Accuracy of Binary Neural Networks using Unbalanced Activation Distribution
Hyungjun Kim
Jihoon Park
Chang-Ho Lee
Jae-Joon Kim
MQ
20
30
0
02 Dec 2020
PAMS: Quantized Super-Resolution via Parameterized Max Scale
PAMS: Quantized Super-Resolution via Parameterized Max Scale
Huixia Li
Chenqian Yan
Shaohui Lin
Xiawu Zheng
Yuchao Li
Baochang Zhang
Fan Yang
Rongrong Ji
MQ
25
84
0
09 Nov 2020
FTBNN: Rethinking Non-linearity for 1-bit CNNs and Going Beyond
FTBNN: Rethinking Non-linearity for 1-bit CNNs and Going Beyond
Z. Su
Linpu Fang
Deke Guo
Duwen Hu
M. Pietikäinen
Li Liu
MQ
16
3
0
19 Oct 2020
High-Capacity Expert Binary Networks
High-Capacity Expert Binary Networks
Adrian Bulat
Brais Martínez
Georgios Tzimiropoulos
MQ
27
57
0
07 Oct 2020
Searching for Low-Bit Weights in Quantized Neural Networks
Searching for Low-Bit Weights in Quantized Neural Networks
Zhaohui Yang
Yunhe Wang
Kai Han
Chunjing Xu
Chao Xu
Dacheng Tao
Chang Xu
MQ
25
82
0
18 Sep 2020
QuantNet: Learning to Quantize by Learning within Fully Differentiable
  Framework
QuantNet: Learning to Quantize by Learning within Fully Differentiable Framework
Junjie Liu
Dongchao Wen
Deyu Wang
Wei Tao
Tse-Wei Chen
Kinya Osa
Masami Kato
MQ
20
3
0
10 Sep 2020
Transform Quantization for CNN (Convolutional Neural Network)
  Compression
Transform Quantization for CNN (Convolutional Neural Network) Compression
Sean I. Young
Wang Zhe
David S. Taubman
B. Girod
MQ
29
69
0
02 Sep 2020
Towards Lossless Binary Convolutional Neural Networks Using Piecewise
  Approximation
Towards Lossless Binary Convolutional Neural Networks Using Piecewise Approximation
Baozhou Zhu
Zaid Al-Ars
Wei Pan
MQ
22
8
0
08 Aug 2020
NASB: Neural Architecture Search for Binary Convolutional Neural
  Networks
NASB: Neural Architecture Search for Binary Convolutional Neural Networks
Baozhou Zhu
Zaid Al-Ars
P. Hofstee
MQ
24
23
0
08 Aug 2020
T-Basis: a Compact Representation for Neural Networks
T-Basis: a Compact Representation for Neural Networks
Anton Obukhov
M. Rakhuba
Stamatios Georgoulis
Menelaos Kanakis
Dengxin Dai
Luc Van Gool
39
27
0
13 Jul 2020
Binary Neural Networks: A Survey
Binary Neural Networks: A Survey
Haotong Qin
Ruihao Gong
Xianglong Liu
Xiao Bai
Jingkuan Song
N. Sebe
MQ
50
458
0
31 Mar 2020
Training Binary Neural Networks with Real-to-Binary Convolutions
Training Binary Neural Networks with Real-to-Binary Convolutions
Brais Martínez
Jing Yang
Adrian Bulat
Georgios Tzimiropoulos
MQ
17
226
0
25 Mar 2020
Kernel Quantization for Efficient Network Compression
Kernel Quantization for Efficient Network Compression
Zhongzhi Yu
Yemin Shi
Tiejun Huang
Yizhou Yu
MQ
31
3
0
11 Mar 2020
ReActNet: Towards Precise Binary Neural Network with Generalized
  Activation Functions
ReActNet: Towards Precise Binary Neural Network with Generalized Activation Functions
Zechun Liu
Zhiqiang Shen
Marios Savvides
Kwang-Ting Cheng
MQ
33
348
0
07 Mar 2020
Propagating Asymptotic-Estimated Gradients for Low Bitwidth Quantized
  Neural Networks
Propagating Asymptotic-Estimated Gradients for Low Bitwidth Quantized Neural Networks
Jun Chen
Yong Liu
Hao Zhang
Shengnan Hou
Jian Yang
MQ
25
7
0
04 Mar 2020
BATS: Binary ArchitecTure Search
BATS: Binary ArchitecTure Search
Adrian Bulat
Brais Martínez
Georgios Tzimiropoulos
MQ
25
67
0
03 Mar 2020
Learned Threshold Pruning
Learned Threshold Pruning
K. Azarian
Yash Bhalgat
Jinwon Lee
Tijmen Blankevoort
MQ
28
38
0
28 Feb 2020
SYMOG: learning symmetric mixture of Gaussian modes for improved
  fixed-point quantization
SYMOG: learning symmetric mixture of Gaussian modes for improved fixed-point quantization
Lukas Enderich
Fabian Timm
Wolfram Burgard
MQ
22
6
0
19 Feb 2020
Post-Training Piecewise Linear Quantization for Deep Neural Networks
Post-Training Piecewise Linear Quantization for Deep Neural Networks
Jun Fang
Ali Shafiee
Hamzah Abdel-Aziz
D. Thorsley
Georgios Georgiadis
Joseph Hassoun
MQ
17
144
0
31 Jan 2020
MeliusNet: Can Binary Neural Networks Achieve MobileNet-level Accuracy?
MeliusNet: Can Binary Neural Networks Achieve MobileNet-level Accuracy?
Joseph Bethge
Christian Bartz
Haojin Yang
Ying-Cong Chen
Christoph Meinel
MQ
25
91
0
16 Jan 2020
Least squares binary quantization of neural networks
Least squares binary quantization of neural networks
Hadi Pouransari
Zhucheng Tu
Oncel Tuzel
MQ
17
32
0
09 Jan 2020
Adaptive Loss-aware Quantization for Multi-bit Networks
Adaptive Loss-aware Quantization for Multi-bit Networks
Zhongnan Qu
Zimu Zhou
Yun Cheng
Lothar Thiele
MQ
36
53
0
18 Dec 2019
Structured Multi-Hashing for Model Compression
Structured Multi-Hashing for Model Compression
Elad Eban
Yair Movshovitz-Attias
Hao Wu
Mark Sandler
Andrew Poon
Yerlan Idelbayev
M. A. Carreira-Perpiñán
17
18
0
25 Nov 2019
AddNet: Deep Neural Networks Using FPGA-Optimized Multipliers
AddNet: Deep Neural Networks Using FPGA-Optimized Multipliers
Julian Faraone
M. Kumm
M. Hardieck
P. Zipf
Xueyuan Liu
David Boland
Philip H. W. Leong
MQ
6
45
0
19 Nov 2019
Loss Aware Post-training Quantization
Loss Aware Post-training Quantization
Yury Nahshan
Brian Chmiel
Chaim Baskin
Evgenii Zheltonozhskii
Ron Banner
A. Bronstein
A. Mendelson
MQ
31
163
0
17 Nov 2019
XNOR-Net++: Improved Binary Neural Networks
XNOR-Net++: Improved Binary Neural Networks
Adrian Bulat
Georgios Tzimiropoulos
MQ
39
200
0
30 Sep 2019
Structured Binary Neural Networks for Image Recognition
Structured Binary Neural Networks for Image Recognition
Bohan Zhuang
Chunhua Shen
Mingkui Tan
Peng Chen
Lingqiao Liu
Ian Reid
MQ
22
17
0
22 Sep 2019
GDRQ: Group-based Distribution Reshaping for Quantization
GDRQ: Group-based Distribution Reshaping for Quantization
Haibao Yu
Tuopu Wen
Guangliang Cheng
Jiankai Sun
Qi Han
Jianping Shi
MQ
33
3
0
05 Aug 2019
12
Next