ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1808.04752
  4. Cited By
A Survey on Methods and Theories of Quantized Neural Networks
v1v2 (latest)

A Survey on Methods and Theories of Quantized Neural Networks

13 August 2018
Yunhui Guo
    MQ
ArXiv (abs)PDFHTML

Papers citing "A Survey on Methods and Theories of Quantized Neural Networks"

50 / 59 papers shown
Title
Automatic mixed precision for optimizing gained time with constrained loss mean-squared-error based on model partition to sequential sub-graphs
Automatic mixed precision for optimizing gained time with constrained loss mean-squared-error based on model partition to sequential sub-graphs
Shmulik Markovich-Golan
Daniel Ohayon
Itay Niv
Yair Hanani
MQ
102
0
0
19 May 2025
Gradual Binary Search and Dimension Expansion : A general method for activation quantization in LLMs
Gradual Binary Search and Dimension Expansion : A general method for activation quantization in LLMs
Lucas Maisonnave
Cyril Moineau
Olivier Bichler
Fabrice Rastello
MQ
79
0
0
18 Apr 2025
Unsupervised detection of semantic correlations in big data
Unsupervised detection of semantic correlations in big data
Santiago Acevedo
Alex Rodriguez
Alessandro Laio
103
2
0
04 Nov 2024
Loss-aware Weight Quantization of Deep Networks
Loss-aware Weight Quantization of Deep Networks
Lu Hou
James T. Kwok
MQ
82
127
0
23 Feb 2018
Model compression via distillation and quantization
Model compression via distillation and quantization
A. Polino
Razvan Pascanu
Dan Alistarh
MQ
83
731
0
15 Feb 2018
Training and Inference with Integers in Deep Neural Networks
Training and Inference with Integers in Deep Neural Networks
Shuang Wu
Guoqi Li
F. Chen
Luping Shi
MQ
65
391
0
13 Feb 2018
Edge-Host Partitioning of Deep Neural Networks with Feature Space
  Encoding for Resource-Constrained Internet-of-Things Platforms
Edge-Host Partitioning of Deep Neural Networks with Feature Space Encoding for Resource-Constrained Internet-of-Things Platforms
J. Ko
Taesik Na
M. Amir
Saibal Mukhopadhyay
54
150
0
11 Feb 2018
Alternating Multi-bit Quantization for Recurrent Neural Networks
Alternating Multi-bit Quantization for Recurrent Neural Networks
Chen Xu
Jianqiang Yao
Zhouchen Lin
Wenwu Ou
Yuanbin Cao
Zhirong Wang
H. Zha
MQ
76
116
0
01 Feb 2018
Adaptive Quantization for Deep Neural Network
Adaptive Quantization for Deep Neural Network
Yiren Zhou
Seyed-Mohsen Moosavi-Dezfooli
Ngai-Man Cheung
P. Frossard
MQ
67
183
0
04 Dec 2017
Towards Accurate Binary Convolutional Neural Network
Towards Accurate Binary Convolutional Neural Network
Xiaofan Lin
Cong Zhao
Wei Pan
MQ
84
644
0
30 Nov 2017
Minimum Energy Quantized Neural Networks
Minimum Energy Quantized Neural Networks
Bert Moons
Koen Goetschalckx
Nick Van Berckelaer
Marian Verhelst
MQ
56
123
0
01 Nov 2017
Learning Discrete Weights Using the Local Reparameterization Trick
Learning Discrete Weights Using the Local Reparameterization Trick
Oran Shayer
Dan Levi
Ethan Fetaya
43
88
0
21 Oct 2017
WRPN: Wide Reduced-Precision Networks
WRPN: Wide Reduced-Precision Networks
Asit K. Mishra
Eriko Nurvitadhi
Jeffrey J. Cook
Debbie Marr
MQ
78
267
0
04 Sep 2017
Recent Trends in Deep Learning Based Natural Language Processing
Recent Trends in Deep Learning Based Natural Language Processing
Tom Young
Devamanyu Hazarika
Soujanya Poria
Min Zhang
75
2,835
0
09 Aug 2017
Extremely Low Bit Neural Network: Squeeze the Last Bit Out with ADMM
Extremely Low Bit Neural Network: Squeeze the Last Bit Out with ADMM
Cong Leng
Hao Li
Shenghuo Zhu
Rong Jin
MQ
63
287
0
24 Jul 2017
Model compression as constrained optimization, with application to
  neural nets. Part II: quantization
Model compression as constrained optimization, with application to neural nets. Part II: quantization
M. A. Carreira-Perpiñán
Yerlan Idelbayev
MQ
53
37
0
13 Jul 2017
ShiftCNN: Generalized Low-Precision Architecture for Inference of
  Convolutional Neural Networks
ShiftCNN: Generalized Low-Precision Architecture for Inference of Convolutional Neural Networks
Denis A. Gudovskiy
Luca Rigazio
MQ
96
53
0
07 Jun 2017
Training Quantized Nets: A Deeper Understanding
Training Quantized Nets: A Deeper Understanding
Hao Li
Soham De
Zheng Xu
Christoph Studer
H. Samet
Tom Goldstein
MQ
53
211
0
07 Jun 2017
Network Sketching: Exploiting Binary Structure in Deep CNNs
Network Sketching: Exploiting Binary Structure in Deep CNNs
Yiwen Guo
Anbang Yao
Hao Zhao
Yurong Chen
MQ
69
95
0
07 Jun 2017
GXNOR-Net: Training deep neural networks with ternary weights and
  activations without full-precision memory under a unified discretization
  framework
GXNOR-Net: Training deep neural networks with ternary weights and activations without full-precision memory under a unified discretization framework
Lei Deng
Peng Jiao
Jing Pei
Zhenzhi Wu
Guoqi Li
MQ
55
20
0
25 May 2017
TernGrad: Ternary Gradients to Reduce Communication in Distributed Deep
  Learning
TernGrad: Ternary Gradients to Reduce Communication in Distributed Deep Learning
W. Wen
Cong Xu
Feng Yan
Chunpeng Wu
Yandan Wang
Yiran Chen
Hai Helen Li
140
989
0
22 May 2017
The High-Dimensional Geometry of Binary Neural Networks
The High-Dimensional Geometry of Binary Neural Networks
Alexander G. Anderson
C. P. Berg
MQ
62
76
0
19 May 2017
Ternary Neural Networks with Fine-Grained Quantization
Ternary Neural Networks with Fine-Grained Quantization
Naveen Mellempudi
Abhisek Kundu
Dheevatsa Mudigere
Dipankar Das
Bharat Kaul
Pradeep Dubey
MQ
94
111
0
02 May 2017
Incremental Network Quantization: Towards Lossless CNNs with
  Low-Precision Weights
Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights
Aojun Zhou
Anbang Yao
Yiwen Guo
Lin Xu
Yurong Chen
MQ
395
1,051
0
10 Feb 2017
Deep Learning with Low Precision by Half-wave Gaussian Quantization
Deep Learning with Low Precision by Half-wave Gaussian Quantization
Zhaowei Cai
Xiaodong He
Jian Sun
Nuno Vasconcelos
MQ
136
505
0
03 Feb 2017
Variational Dropout Sparsifies Deep Neural Networks
Variational Dropout Sparsifies Deep Neural Networks
Dmitry Molchanov
Arsenii Ashukha
Dmitry Vetrov
BDL
144
830
0
19 Jan 2017
Towards the Limit of Network Quantization
Towards the Limit of Network Quantization
Yoojin Choi
Mostafa El-Khamy
Jungwon Lee
MQ
50
194
0
05 Dec 2016
Trained Ternary Quantization
Trained Ternary Quantization
Chenzhuo Zhu
Song Han
Huizi Mao
W. Dally
MQ
137
1,035
0
04 Dec 2016
Training Bit Fully Convolutional Network for Fast Semantic Segmentation
Training Bit Fully Convolutional Network for Fast Semantic Segmentation
He Wen
Shuchang Zhou
Zhe Liang
Yuxiang Zhang
Dieqiao Feng
Xinyu Zhou
Cong Yao
MQSSeg
71
10
0
01 Dec 2016
Effective Quantization Methods for Recurrent Neural Networks
Effective Quantization Methods for Recurrent Neural Networks
Qinyao He
He Wen
Shuchang Zhou
Yuxin Wu
Cong Yao
Xinyu Zhou
Yuheng Zou
MQ
63
75
0
30 Nov 2016
Sigma Delta Quantized Networks
Sigma Delta Quantized Networks
Peter O'Connor
Max Welling
57
48
0
07 Nov 2016
Fixed-point Factorized Networks
Fixed-point Factorized Networks
Peisong Wang
Jian Cheng
MQ
40
43
0
07 Nov 2016
Loss-aware Binarization of Deep Networks
Loss-aware Binarization of Deep Networks
Lu Hou
Quanming Yao
James T. Kwok
MQ
70
220
0
05 Nov 2016
Federated Learning: Strategies for Improving Communication Efficiency
Federated Learning: Strategies for Improving Communication Efficiency
Jakub Konecný
H. B. McMahan
Felix X. Yu
Peter Richtárik
A. Suresh
Dave Bacon
FedML
306
4,646
0
18 Oct 2016
Recurrent Neural Networks With Limited Numerical Precision
Recurrent Neural Networks With Limited Numerical Precision
Joachim Ott
Zhouhan Lin
Yanzhe Zhang
Shih-Chii Liu
Yoshua Bengio
MQ
74
77
0
24 Aug 2016
Overcoming Challenges in Fixed Point Training of Deep Convolutional
  Networks
Overcoming Challenges in Fixed Point Training of Deep Convolutional Networks
D. Lin
S. Talathi
67
45
0
08 Jul 2016
DoReFa-Net: Training Low Bitwidth Convolutional Neural Networks with Low
  Bitwidth Gradients
DoReFa-Net: Training Low Bitwidth Convolutional Neural Networks with Low Bitwidth Gradients
Shuchang Zhou
Yuxin Wu
Zekun Ni
Xinyu Zhou
He Wen
Yuheng Zou
MQ
122
2,088
0
20 Jun 2016
Deep neural networks are robust to weight binarization and other
  non-linear distortions
Deep neural networks are robust to weight binarization and other non-linear distortions
P. Merolla
R. Appuswamy
John V. Arthur
S. K. Esser
D. Modha
OODMQ
88
96
0
07 Jun 2016
XNOR-Net: ImageNet Classification Using Binary Convolutional Neural
  Networks
XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks
Mohammad Rastegari
Vicente Ordonez
Joseph Redmon
Ali Farhadi
MQ
170
4,357
0
16 Mar 2016
Bitwise Neural Networks
Bitwise Neural Networks
Minje Kim
Paris Smaragdis
MQ
83
217
0
22 Jan 2016
Quantized Convolutional Neural Networks for Mobile Devices
Quantized Convolutional Neural Networks for Mobile Devices
Jiaxiang Wu
Cong Leng
Yuhang Wang
Qinghao Hu
Jian Cheng
MQ
88
1,166
0
21 Dec 2015
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
2.2K
194,020
0
10 Dec 2015
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin
Dario Amodei
Rishita Anubhai
Eric Battenberg
Carl Case
Jared Casper
...
Chong-Jun Wang
Bo Xiao
Dani Yogatama
J. Zhan
Zhenyao Zhu
137
2,973
0
08 Dec 2015
Resiliency of Deep Neural Networks under Quantization
Resiliency of Deep Neural Networks under Quantization
Wonyong Sung
Sungho Shin
Kyuyeon Hwang
MQ
58
157
0
20 Nov 2015
Fixed Point Quantization of Deep Convolutional Networks
Fixed Point Quantization of Deep Convolutional Networks
D. Lin
S. Talathi
V. Annapureddy
MQ
95
815
0
19 Nov 2015
BinaryConnect: Training Deep Neural Networks with binary weights during
  propagations
BinaryConnect: Training Deep Neural Networks with binary weights during propagations
Matthieu Courbariaux
Yoshua Bengio
J. David
MQ
209
2,985
0
02 Nov 2015
Neural Networks with Few Multiplications
Neural Networks with Few Multiplications
Zhouhan Lin
Matthieu Courbariaux
Roland Memisevic
Yoshua Bengio
87
331
0
11 Oct 2015
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
Song Han
Huizi Mao
W. Dally
3DGS
259
8,842
0
01 Oct 2015
Variational Dropout and the Local Reparameterization Trick
Variational Dropout and the Local Reparameterization Trick
Diederik P. Kingma
Tim Salimans
Max Welling
BDL
226
1,514
0
08 Jun 2015
Rounding Methods for Neural Networks with Low Resolution Synaptic
  Weights
Rounding Methods for Neural Networks with Low Resolution Synaptic Weights
Lorenz K. Muller
Giacomo Indiveri
69
53
0
22 Apr 2015
12
Next