v1v2 (latest)

A Survey on Methods and Theories of Quantized Neural Networks

13 August 2018

Papers citing "A Survey on Methods and Theories of Quantized Neural Networks"

50 / 59 papers shown

Title
Automatic mixed precision for optimizing gained time with constrained loss mean-squared-error based on model partition to sequential sub-graphs Shmulik Markovich-Golan Daniel Ohayon Itay Niv Yair Hanani MQ 102 0 0 19 May 2025
Gradual Binary Search and Dimension Expansion : A general method for activation quantization in LLMs Lucas Maisonnave Cyril Moineau Olivier Bichler Fabrice Rastello MQ 79 0 0 18 Apr 2025
Unsupervised detection of semantic correlations in big data Santiago Acevedo Alex Rodriguez Alessandro Laio 103 2 0 04 Nov 2024
Loss-aware Weight Quantization of Deep Networks Lu Hou James T. Kwok MQ 82 127 0 23 Feb 2018
Model compression via distillation and quantization A. Polino Razvan Pascanu Dan Alistarh MQ 83 731 0 15 Feb 2018
Training and Inference with Integers in Deep Neural Networks Shuang Wu Guoqi Li F. Chen Luping Shi MQ 65 391 0 13 Feb 2018
Edge-Host Partitioning of Deep Neural Networks with Feature Space Encoding for Resource-Constrained Internet-of-Things Platforms J. Ko Taesik Na M. Amir Saibal Mukhopadhyay 54 150 0 11 Feb 2018
Alternating Multi-bit Quantization for Recurrent Neural Networks Chen Xu Jianqiang Yao Zhouchen Lin Wenwu Ou Yuanbin Cao Zhirong Wang H. Zha MQ 76 116 0 01 Feb 2018
Adaptive Quantization for Deep Neural Network Yiren Zhou Seyed-Mohsen Moosavi-Dezfooli Ngai-Man Cheung P. Frossard MQ 67 183 0 04 Dec 2017
Towards Accurate Binary Convolutional Neural Network Xiaofan Lin Cong Zhao Wei Pan MQ 84 644 0 30 Nov 2017
Minimum Energy Quantized Neural Networks Bert Moons Koen Goetschalckx Nick Van Berckelaer Marian Verhelst MQ 56 123 0 01 Nov 2017
Learning Discrete Weights Using the Local Reparameterization Trick Oran Shayer Dan Levi Ethan Fetaya 43 88 0 21 Oct 2017
WRPN: Wide Reduced-Precision Networks Asit K. Mishra Eriko Nurvitadhi Jeffrey J. Cook Debbie Marr MQ 78 267 0 04 Sep 2017
Recent Trends in Deep Learning Based Natural Language Processing Tom Young Devamanyu Hazarika Soujanya Poria Min Zhang 75 2,835 0 09 Aug 2017
Extremely Low Bit Neural Network: Squeeze the Last Bit Out with ADMM Cong Leng Hao Li Shenghuo Zhu Rong Jin MQ 63 287 0 24 Jul 2017
Model compression as constrained optimization, with application to neural nets. Part II: quantization M. A. Carreira-Perpiñán Yerlan Idelbayev MQ 53 37 0 13 Jul 2017
ShiftCNN: Generalized Low-Precision Architecture for Inference of Convolutional Neural Networks Denis A. Gudovskiy Luca Rigazio MQ 96 53 0 07 Jun 2017
Training Quantized Nets: A Deeper Understanding Hao Li Soham De Zheng Xu Christoph Studer H. Samet Tom Goldstein MQ 53 211 0 07 Jun 2017
Network Sketching: Exploiting Binary Structure in Deep CNNs Yiwen Guo Anbang Yao Hao Zhao Yurong Chen MQ 69 95 0 07 Jun 2017
GXNOR-Net: Training deep neural networks with ternary weights and activations without full-precision memory under a unified discretization framework Lei Deng Peng Jiao Jing Pei Zhenzhi Wu Guoqi Li MQ 55 20 0 25 May 2017
TernGrad: Ternary Gradients to Reduce Communication in Distributed Deep Learning W. Wen Cong Xu Feng Yan Chunpeng Wu Yandan Wang Yiran Chen Hai Helen Li 140 989 0 22 May 2017
The High-Dimensional Geometry of Binary Neural Networks Alexander G. Anderson C. P. Berg MQ 62 76 0 19 May 2017
Ternary Neural Networks with Fine-Grained Quantization Naveen Mellempudi Abhisek Kundu Dheevatsa Mudigere Dipankar Das Bharat Kaul Pradeep Dubey MQ 94 111 0 02 May 2017
Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights Aojun Zhou Anbang Yao Yiwen Guo Lin Xu Yurong Chen MQ 395 1,051 0 10 Feb 2017
Deep Learning with Low Precision by Half-wave Gaussian Quantization Zhaowei Cai Xiaodong He Jian Sun Nuno Vasconcelos MQ 136 505 0 03 Feb 2017
Variational Dropout Sparsifies Deep Neural Networks Dmitry Molchanov Arsenii Ashukha Dmitry Vetrov BDL 144 830 0 19 Jan 2017
Towards the Limit of Network Quantization Yoojin Choi Mostafa El-Khamy Jungwon Lee MQ 50 194 0 05 Dec 2016
Trained Ternary Quantization Chenzhuo Zhu Song Han Huizi Mao W. Dally MQ 137 1,035 0 04 Dec 2016
Training Bit Fully Convolutional Network for Fast Semantic Segmentation He Wen Shuchang Zhou Zhe Liang Yuxiang Zhang Dieqiao Feng Xinyu Zhou Cong Yao MQ SSeg 71 10 0 01 Dec 2016
Effective Quantization Methods for Recurrent Neural Networks Qinyao He He Wen Shuchang Zhou Yuxin Wu Cong Yao Xinyu Zhou Yuheng Zou MQ 63 75 0 30 Nov 2016
Sigma Delta Quantized Networks Peter O'Connor Max Welling 57 48 0 07 Nov 2016
Fixed-point Factorized Networks Peisong Wang Jian Cheng MQ 40 43 0 07 Nov 2016
Loss-aware Binarization of Deep Networks Lu Hou Quanming Yao James T. Kwok MQ 70 220 0 05 Nov 2016
Federated Learning: Strategies for Improving Communication Efficiency Jakub Konecný H. B. McMahan Felix X. Yu Peter Richtárik A. Suresh Dave Bacon FedML 306 4,646 0 18 Oct 2016
Recurrent Neural Networks With Limited Numerical Precision Joachim Ott Zhouhan Lin Yanzhe Zhang Shih-Chii Liu Yoshua Bengio MQ 74 77 0 24 Aug 2016
Overcoming Challenges in Fixed Point Training of Deep Convolutional Networks D. Lin S. Talathi 67 45 0 08 Jul 2016
DoReFa-Net: Training Low Bitwidth Convolutional Neural Networks with Low Bitwidth Gradients Shuchang Zhou Yuxin Wu Zekun Ni Xinyu Zhou He Wen Yuheng Zou MQ 122 2,088 0 20 Jun 2016
Deep neural networks are robust to weight binarization and other non-linear distortions P. Merolla R. Appuswamy John V. Arthur S. K. Esser D. Modha OOD MQ 88 96 0 07 Jun 2016
XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks Mohammad Rastegari Vicente Ordonez Joseph Redmon Ali Farhadi MQ 170 4,357 0 16 Mar 2016
Bitwise Neural Networks Minje Kim Paris Smaragdis MQ 83 217 0 22 Jan 2016
Quantized Convolutional Neural Networks for Mobile Devices Jiaxiang Wu Cong Leng Yuhang Wang Qinghao Hu Jian Cheng MQ 88 1,166 0 21 Dec 2015
Deep Residual Learning for Image Recognition Kaiming He Xinming Zhang Shaoqing Ren Jian Sun MedIm 2.2K 194,020 0 10 Dec 2015
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin Dario Amodei Rishita Anubhai Eric Battenberg Carl Case Jared Casper ... Chong-Jun Wang Bo Xiao Dani Yogatama J. Zhan Zhenyao Zhu 137 2,973 0 08 Dec 2015
Resiliency of Deep Neural Networks under Quantization Wonyong Sung Sungho Shin Kyuyeon Hwang MQ 58 157 0 20 Nov 2015
Fixed Point Quantization of Deep Convolutional Networks D. Lin S. Talathi V. Annapureddy MQ 95 815 0 19 Nov 2015
BinaryConnect: Training Deep Neural Networks with binary weights during propagations Matthieu Courbariaux Yoshua Bengio J. David MQ 209 2,985 0 02 Nov 2015
Neural Networks with Few Multiplications Zhouhan Lin Matthieu Courbariaux Roland Memisevic Yoshua Bengio 87 331 0 11 Oct 2015
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding Song Han Huizi Mao W. Dally 3DGS 259 8,842 0 01 Oct 2015
Variational Dropout and the Local Reparameterization Trick Diederik P. Kingma Tim Salimans Max Welling BDL 226 1,514 0 08 Jun 2015
Rounding Methods for Neural Networks with Low Resolution Synaptic Weights Lorenz K. Muller Giacomo Indiveri 69 53 0 22 Apr 2015