ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1807.06964
  4. Cited By
Bridging the Accuracy Gap for 2-bit Quantized Neural Networks (QNN)

Bridging the Accuracy Gap for 2-bit Quantized Neural Networks (QNN)

17 July 2018
Jungwook Choi
P. Chuang
Zhuo Wang
Swagath Venkataramani
Vijayalakshmi Srinivasan
K. Gopalakrishnan
    MQ
ArXivPDFHTML

Papers citing "Bridging the Accuracy Gap for 2-bit Quantized Neural Networks (QNN)"

40 / 40 papers shown
Title
Realizing Unaligned Block-wise Pruning for DNN Acceleration on Mobile
  Devices
Realizing Unaligned Block-wise Pruning for DNN Acceleration on Mobile Devices
Hayun Lee
Dongkun Shin
MQ
28
0
0
29 Jul 2024
2DQuant: Low-bit Post-Training Quantization for Image Super-Resolution
2DQuant: Low-bit Post-Training Quantization for Image Super-Resolution
Kai Liu
Haotong Qin
Yong Guo
Xin Yuan
Linghe Kong
Guihai Chen
Yulun Zhang
MQ
35
5
0
10 Jun 2024
BOLD: Boolean Logic Deep Learning
BOLD: Boolean Logic Deep Learning
Van Minh Nguyen
Cristian Ocampo
Aymen Askri
Louis Leconte
Ba-Hien Tran
AI4CE
40
0
0
25 May 2024
CBQ: Cross-Block Quantization for Large Language Models
CBQ: Cross-Block Quantization for Large Language Models
Xin Ding
Xiaoyu Liu
Zhijun Tu
Yun-feng Zhang
Wei Li
...
Hanting Chen
Yehui Tang
Zhiwei Xiong
Baoqun Yin
Yunhe Wang
MQ
38
13
0
13 Dec 2023
Finding Interpretable Class-Specific Patterns through Efficient Neural
  Search
Finding Interpretable Class-Specific Patterns through Efficient Neural Search
Nils Philipp Walter
Jonas Fischer
Jilles Vreeken
20
4
0
07 Dec 2023
DeepGEMM: Accelerated Ultra Low-Precision Inference on CPU Architectures
  using Lookup Tables
DeepGEMM: Accelerated Ultra Low-Precision Inference on CPU Architectures using Lookup Tables
Darshan C. Ganji
Saad Ashfaq
Ehsan Saboori
Sudhakar Sah
Saptarshi Mitra
Mohammadhossein Askarihemmat
Alexander Hoffman
Ahmed Hassanien
Mathieu Léonardon
MQ
13
3
0
18 Apr 2023
AskewSGD : An Annealed interval-constrained Optimisation method to train
  Quantized Neural Networks
AskewSGD : An Annealed interval-constrained Optimisation method to train Quantized Neural Networks
Louis Leconte
S. Schechtman
Eric Moulines
29
4
0
07 Nov 2022
Emergent Quantized Communication
Emergent Quantized Communication
Boaz Carmeli
Ron Meir
Yonatan Belinkov
MQ
AI4CE
28
8
0
04 Nov 2022
FxP-QNet: A Post-Training Quantizer for the Design of Mixed
  Low-Precision DNNs with Dynamic Fixed-Point Representation
FxP-QNet: A Post-Training Quantizer for the Design of Mixed Low-Precision DNNs with Dynamic Fixed-Point Representation
Ahmad Shawahna
S. M. Sait
A. El-Maleh
Irfan Ahmad
MQ
20
7
0
22 Mar 2022
Minimum Variance Unbiased N:M Sparsity for the Neural Gradients
Minimum Variance Unbiased N:M Sparsity for the Neural Gradients
Brian Chmiel
Itay Hubara
Ron Banner
Daniel Soudry
23
10
0
21 Mar 2022
Accurate Neural Training with 4-bit Matrix Multiplications at Standard
  Formats
Accurate Neural Training with 4-bit Matrix Multiplications at Standard Formats
Brian Chmiel
Ron Banner
Elad Hoffer
Hilla Ben Yaacov
Daniel Soudry
MQ
33
23
0
19 Dec 2021
4-bit Quantization of LSTM-based Speech Recognition Models
4-bit Quantization of LSTM-based Speech Recognition Models
A. Fasoli
Chia-Yu Chen
Mauricio Serrano
Xiao Sun
Naigang Wang
...
Xiaodong Cui
Brian Kingsbury
Wei Zhang
Zoltán Tüske
K. Gopalakrishnan
MQ
26
21
0
27 Aug 2021
Quantization and Deployment of Deep Neural Networks on Microcontrollers
Quantization and Deployment of Deep Neural Networks on Microcontrollers
Pierre-Emmanuel Novac
G. B. Hacene
Alain Pegatoquet
Benoit Miramond
Vincent Gripon
MQ
22
116
0
27 May 2021
DNN Quantization with Attention
DNN Quantization with Attention
G. B. Hacene
Lukas Mauch
Stefan Uhlich
Fabien Cardinaux
MQ
11
2
0
24 Mar 2021
Exploring Neural Networks Quantization via Layer-Wise Quantization
  Analysis
Exploring Neural Networks Quantization via Layer-Wise Quantization Analysis
Shachar Gluska
Mark Grobman
MQ
19
5
0
15 Dec 2020
One Shot 3D Photography
One Shot 3D Photography
Johannes Kopf
Kevin Blackburn-Matzen
Suhib Alsisan
Ocean Quigley
Francis Ge
...
Peizhao Zhang
Zijian He
Peter Vajda
Ayush Saraf
Michael F. Cohen
28
79
0
27 Aug 2020
One Weight Bitwidth to Rule Them All
One Weight Bitwidth to Rule Them All
Ting-Wu Chin
P. Chuang
Vikas Chandra
Diana Marculescu
MQ
28
25
0
22 Aug 2020
Channel-wise Hessian Aware trace-Weighted Quantization of Neural
  Networks
Channel-wise Hessian Aware trace-Weighted Quantization of Neural Networks
Xu Qian
Victor Li
Darren Crews
MQ
24
9
0
19 Aug 2020
PROFIT: A Novel Training Method for sub-4-bit MobileNet Models
PROFIT: A Novel Training Method for sub-4-bit MobileNet Models
Eunhyeok Park
S. Yoo
MQ
10
84
0
11 Aug 2020
WrapNet: Neural Net Inference with Ultra-Low-Resolution Arithmetic
WrapNet: Neural Net Inference with Ultra-Low-Resolution Arithmetic
Renkun Ni
Hong-Min Chu
Oscar Castañeda
Ping Yeh-Chiang
Christoph Studer
Tom Goldstein
MQ
34
14
0
26 Jul 2020
Neural gradients are near-lognormal: improved quantized and sparse
  training
Neural gradients are near-lognormal: improved quantized and sparse training
Brian Chmiel
Liad Ben-Uri
Moran Shkolnik
Elad Hoffer
Ron Banner
Daniel Soudry
MQ
8
5
0
15 Jun 2020
A Learning Framework for n-bit Quantized Neural Networks toward FPGAs
A Learning Framework for n-bit Quantized Neural Networks toward FPGAs
Jun Chen
L. Liu
Yong Liu
Xianfang Zeng
MQ
41
26
0
06 Apr 2020
Propagating Asymptotic-Estimated Gradients for Low Bitwidth Quantized
  Neural Networks
Propagating Asymptotic-Estimated Gradients for Low Bitwidth Quantized Neural Networks
Jun Chen
Yong Liu
Hao Zhang
Shengnan Hou
Jian Yang
MQ
25
7
0
04 Mar 2020
Least squares binary quantization of neural networks
Least squares binary quantization of neural networks
Hadi Pouransari
Zhucheng Tu
Oncel Tuzel
MQ
17
32
0
09 Jan 2020
Sparse Weight Activation Training
Sparse Weight Activation Training
Md Aamir Raihan
Tor M. Aamodt
34
73
0
07 Jan 2020
Additive Powers-of-Two Quantization: An Efficient Non-uniform
  Discretization for Neural Networks
Additive Powers-of-Two Quantization: An Efficient Non-uniform Discretization for Neural Networks
Yuhang Li
Xin Dong
Wei Wang
MQ
31
254
0
28 Sep 2019
CAT: Compression-Aware Training for bandwidth reduction
CAT: Compression-Aware Training for bandwidth reduction
Chaim Baskin
Brian Chmiel
Evgenii Zheltonozhskii
Ron Banner
A. Bronstein
A. Mendelson
MQ
22
10
0
25 Sep 2019
Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit
  Neural Networks
Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks
Ruihao Gong
Xianglong Liu
Shenghu Jiang
Tian-Hao Li
Peng Hu
Jiazhen Lin
F. Yu
Junjie Yan
MQ
32
446
0
14 Aug 2019
Tuning Algorithms and Generators for Efficient Edge Inference
Tuning Algorithms and Generators for Efficient Edge Inference
R. Naous
Lazar Supic
Yoonhwan Kang
Ranko Seradejovic
Anish Singhani
Vladimir M. Stojanović
11
2
0
31 Jul 2019
Training large-scale ANNs on simulated resistive crossbar arrays
Training large-scale ANNs on simulated resistive crossbar arrays
Malte J. Rasch
Tayfun Gokmen
W. Haensch
11
12
0
06 Jun 2019
DeepShift: Towards Multiplication-Less Neural Networks
DeepShift: Towards Multiplication-Less Neural Networks
Mostafa Elhoushi
Zihao Chen
F. Shafiq
Ye Tian
Joey Yiwei Li
MQ
38
97
0
30 May 2019
Feature Map Transform Coding for Energy-Efficient CNN Inference
Feature Map Transform Coding for Energy-Efficient CNN Inference
Brian Chmiel
Chaim Baskin
Ron Banner
Evgenii Zheltonozhskii
Yevgeny Yermolin
Alex Karbachevsky
A. Bronstein
A. Mendelson
28
24
0
26 May 2019
Towards Efficient Model Compression via Learned Global Ranking
Towards Efficient Model Compression via Learned Global Ranking
Ting-Wu Chin
Ruizhou Ding
Cha Zhang
Diana Marculescu
16
170
0
28 Apr 2019
Towards Learning of Filter-Level Heterogeneous Compression of
  Convolutional Neural Networks
Towards Learning of Filter-Level Heterogeneous Compression of Convolutional Neural Networks
Y. Zur
Chaim Baskin
Evgenii Zheltonozhskii
Brian Chmiel
Itay Evron
A. Bronstein
A. Mendelson
MQ
37
7
0
22 Apr 2019
Learned Step Size Quantization
Learned Step Size Quantization
S. K. Esser
J. McKinstry
Deepika Bablani
R. Appuswamy
D. Modha
MQ
20
782
0
21 Feb 2019
Improving Neural Network Quantization without Retraining using Outlier
  Channel Splitting
Improving Neural Network Quantization without Retraining using Outlier Channel Splitting
Ritchie Zhao
Yuwei Hu
Jordan Dotzel
Christopher De Sa
Zhiru Zhang
OODD
MQ
50
305
0
28 Jan 2019
Precision Highway for Ultra Low-Precision Quantization
Precision Highway for Ultra Low-Precision Quantization
Eunhyeok Park
Dongyoung Kim
S. Yoo
Peter Vajda
MQ
AI4TS
21
12
0
24 Dec 2018
SQuantizer: Simultaneous Learning for Both Sparse and Low-precision
  Neural Networks
SQuantizer: Simultaneous Learning for Both Sparse and Low-precision Neural Networks
M. Park
Xiaofang Xu
C. Brick
MQ
27
8
0
20 Dec 2018
On Periodic Functions as Regularizers for Quantization of Neural
  Networks
On Periodic Functions as Regularizers for Quantization of Neural Networks
Maxim Naumov
Utku Diril
Jongsoo Park
Benjamin Ray
Jedrzej Jablonski
Andrew Tulloch
MQ
13
25
0
24 Nov 2018
Synetgy: Algorithm-hardware Co-design for ConvNet Accelerators on
  Embedded FPGAs
Synetgy: Algorithm-hardware Co-design for ConvNet Accelerators on Embedded FPGAs
Yifan Yang
Qijing Huang
Bichen Wu
Tianjun Zhang
Liang Ma
...
Michaela Blott
Sebastiano Fabio Schifano
K. Vissers
J. Wawrzynek
Kurt Keutzer
24
113
0
21 Nov 2018
1