Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.07061
Cited By
Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations
22 September 2016
Itay Hubara
Matthieu Courbariaux
Daniel Soudry
Ran El-Yaniv
Yoshua Bengio
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations"
50 / 301 papers shown
Title
Vertical Layering of Quantized Neural Networks for Heterogeneous Inference
Hai Wu
Ruifei He
Hao Hao Tan
Xiaojuan Qi
Kaibin Huang
MQ
27
2
0
10 Dec 2022
QFT: Post-training quantization via fast joint finetuning of all degrees of freedom
Alexander Finkelstein
Ella Fuchs
Idan Tal
Mark Grobman
Niv Vosco
Eldad Meller
MQ
32
6
0
05 Dec 2022
Fast Inference from Transformers via Speculative Decoding
Yaniv Leviathan
Matan Kalman
Yossi Matias
LRM
46
636
0
30 Nov 2022
Verifying And Interpreting Neural Networks using Finite Automata
Marco Sälzer
Eric Alsmann
Florian Bruse
M. Lange
AAML
30
3
0
02 Nov 2022
Towards Global Neural Network Abstractions with Locally-Exact Reconstruction
Edoardo Manino
I. Bessa
Lucas C. Cordeiro
21
1
0
21 Oct 2022
MotionDeltaCNN: Sparse CNN Inference of Frame Differences in Moving Camera Videos
Mathias Parger
Chengcheng Tang
Thomas Neff
Christopher D. Twigg
Cem Keskin
Robert Y. Wang
M. Steinberger
27
6
0
18 Oct 2022
AttTrack: Online Deep Attention Transfer for Multi-object Tracking
Keivan Nalaie
Rong Zheng
VOT
21
5
0
16 Oct 2022
AlphaTuning: Quantization-Aware Parameter-Efficient Adaptation of Large-Scale Pre-Trained Language Models
S. Kwon
Jeonghoon Kim
Jeongin Bae
Kang Min Yoo
Jin-Hwa Kim
Baeseong Park
Byeongwook Kim
Jung-Woo Ha
Nako Sung
Dongsoo Lee
MQ
29
30
0
08 Oct 2022
Limitations of neural network training due to numerical instability of backpropagation
Clemens Karner
V. Kazeev
P. Petersen
40
3
0
03 Oct 2022
Survey: Exploiting Data Redundancy for Optimization of Deep Learning
Jou-An Chen
Wei Niu
Bin Ren
Yanzhi Wang
Xipeng Shen
23
24
0
29 Aug 2022
Towards Transmission-Friendly and Robust CNN Models over Cloud and Device
Chuntao Ding
Zhichao Lu
F. Xu
Vishnu Boddeti
Yidong Li
Jiannong Cao
27
14
0
20 Jul 2022
Green, Quantized Federated Learning over Wireless Networks: An Energy-Efficient Design
Minsu Kim
Walid Saad
Mohammad Mozaffari
Merouane Debbah
FedML
MQ
25
28
0
19 Jul 2022
MCTensor: A High-Precision Deep Learning Library with Multi-Component Floating-Point
Tao Yu
Wen-Ping Guo
Jianan Canal Li
Tiancheng Yuan
Chris De Sa
30
4
0
18 Jul 2022
CEG4N: Counter-Example Guided Neural Network Quantization Refinement
J. Matos
I. Bessa
Edoardo Manino
Xidan Song
Lucas C. Cordeiro
MQ
48
2
0
09 Jul 2022
QReg: On Regularization Effects of Quantization
Mohammadhossein Askarihemmat
Reyhane Askari Hemmat
Alexander Hoffman
Ivan Lazarevich
Ehsan Saboori
Olivier Mastropietro
Sudhakar Sah
Yvon Savaria
J. David
MQ
37
5
0
24 Jun 2022
Why Quantization Improves Generalization: NTK of Binary Weight Neural Networks
Kaiqi Zhang
Ming Yin
Yu-Xiang Wang
MQ
24
4
0
13 Jun 2022
8-bit Numerical Formats for Deep Neural Networks
Badreddine Noune
Philip Jones
Daniel Justus
Dominic Masters
Carlo Luschi
MQ
23
34
0
06 Jun 2022
Fine-tuning Language Models over Slow Networks using Activation Compression with Guarantees
Jue Wang
Binhang Yuan
Luka Rimanic
Yongjun He
Tri Dao
Beidi Chen
Christopher Ré
Ce Zhang
AI4CE
24
11
0
02 Jun 2022
Ultra-compact Binary Neural Networks for Human Activity Recognition on RISC-V Processors
Francesco Daghero
Chenhao Xie
Daniele Jahier Pagliari
Alessio Burrello
Marco Castellano
Luca Gandolfi
A. Calimera
Enrico Macii
M. Poncino
BDL
MQ
35
13
0
25 May 2022
Machine Learning Operations (MLOps): Overview, Definition, and Architecture
Dominik Kreuzberger
Niklas Kühl
Sebastian Hirschl
VLM
AI4CE
19
334
0
04 May 2022
LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification
Sharath Girish
Kamal Gupta
Saurabh Singh
Abhinav Shrivastava
38
11
0
06 Apr 2022
FxP-QNet: A Post-Training Quantizer for the Design of Mixed Low-Precision DNNs with Dynamic Fixed-Point Representation
Ahmad Shawahna
S. M. Sait
A. El-Maleh
Irfan Ahmad
MQ
20
7
0
22 Mar 2022
FlexBlock: A Flexible DNN Training Accelerator with Multi-Mode Block Floating Point Support
Seock-Hwan Noh
Jahyun Koo
Seunghyun Lee
Jongse Park
Jaeha Kung
AI4CE
32
17
0
13 Mar 2022
Highly-Efficient Binary Neural Networks for Visual Place Recognition
Bruno Ferrarini
Michael Milford
Klaus D. McDonald-Maier
Shoaib Ehsan
14
7
0
24 Feb 2022
Rare Gems: Finding Lottery Tickets at Initialization
Kartik K. Sreenivasan
Jy-yong Sohn
Liu Yang
Matthew Grinde
Alliot Nagle
Hongyi Wang
Eric P. Xing
Kangwook Lee
Dimitris Papailiopoulos
32
42
0
24 Feb 2022
Distilled Neural Networks for Efficient Learning to Rank
F. M. Nardini
Cosimo Rulli
Salvatore Trani
Rossano Venturini
FedML
29
16
0
22 Feb 2022
Inverse design of photonic devices with strict foundry fabrication constraints
M. Schubert
A. C. H. Cheung
Ian A. D. Williamson
Aleksandra Spyra
David H. Alexander
25
51
0
31 Jan 2022
Automatic Mixed-Precision Quantization Search of BERT
Changsheng Zhao
Ting Hua
Yilin Shen
Qian Lou
Hongxia Jin
MQ
22
19
0
30 Dec 2021
Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation
Zechun Liu
Kwang-Ting Cheng
Dong Huang
Eric P. Xing
Zhiqiang Shen
MQ
25
103
0
29 Nov 2021
Mixed Precision Low-bit Quantization of Neural Network Language Models for Speech Recognition
Junhao Xu
Jianwei Yu
Shoukang Hu
Xunying Liu
Helen Meng
MQ
30
13
0
29 Nov 2021
Mixed Precision of Quantization of Transformer Language Models for Speech Recognition
Junhao Xu
Shoukang Hu
Jianwei Yu
Xunying Liu
Helen M. Meng
MQ
40
15
0
29 Nov 2021
On the Tradeoff between Energy, Precision, and Accuracy in Federated Quantized Neural Networks
Minsu Kim
Walid Saad
Mohammad Mozaffari
Merouane Debbah
FedML
MQ
19
23
0
15 Nov 2021
Representation Edit Distance as a Measure of Novelty
J. Alspector
30
6
0
04 Nov 2021
Automatic Sleep Staging of EEG Signals: Recent Development, Challenges, and Future Directions
Huy P Phan
Kaare B. Mikkelsen
19
94
0
03 Nov 2021
Whole Brain Segmentation with Full Volume Neural Network
Yeshu Li
Jianwei Cui
Yilun Sheng
Xiao Liang
Jingdong Wang
E. Chang
Yan Xu
32
11
0
29 Oct 2021
PAC-Bayesian Learning of Aggregated Binary Activated Neural Networks with Probabilities over Representations
Louis Fortier-Dubois
Gaël Letarte
Benjamin Leblanc
Franccois Laviolette
Pascal Germain
UQCV
19
0
0
28 Oct 2021
Demystifying and Generalizing BinaryConnect
Abhishek Sharma
Yaoliang Yu
Eyyub Sari
Mahdi Zolnouri
V. Nia
MQ
22
8
0
25 Oct 2021
Instance-Conditional Knowledge Distillation for Object Detection
Zijian Kang
Peizhen Zhang
Xinming Zhang
Jian Sun
N. Zheng
27
76
0
25 Oct 2021
ConformalLayers: A non-linear sequential neural network with associative layers
Zhen Wan
Zhuoyuan Mao
C. N. Vasconcelos
22
3
0
23 Oct 2021
Sub-bit Neural Networks: Learning to Compress and Accelerate Binary Neural Networks
Yikai Wang
Yi Yang
Gang Hua
Anbang Yao
MQ
29
15
0
18 Oct 2021
Training Deep Neural Networks with Joint Quantization and Pruning of Weights and Activations
Xinyu Zhang
Ian Colbert
Ken Kreutz-Delgado
Srinjoy Das
MQ
32
11
0
15 Oct 2021
Haar Wavelet Feature Compression for Quantized Graph Convolutional Networks
Moshe Eliasof
Ben Bodner
Eran Treister
GNN
35
7
0
10 Oct 2021
Graphs as Tools to Improve Deep Learning Methods
Carlos Lassance
Myriam Bontonou
Mounia Hamidouche
Bastien Pasdeloup
Lucas Drumetz
Vincent Gripon
GNN
AI4CE
AAML
47
0
0
08 Oct 2021
VC dimension of partially quantized neural networks in the overparametrized regime
Yutong Wang
Clayton D. Scott
25
1
0
06 Oct 2021
Convolutional Neural Network Compression through Generalized Kronecker Product Decomposition
Marawan Gamal Abdel Hameed
Marzieh S. Tahaei
A. Mosleh
V. Nia
47
25
0
29 Sep 2021
Understanding and Overcoming the Challenges of Efficient Transformer Quantization
Yelysei Bondarenko
Markus Nagel
Tijmen Blankevoort
MQ
25
133
0
27 Sep 2021
iRNN: Integer-only Recurrent Neural Network
Eyyub Sari
Vanessa Courville
V. Nia
MQ
56
4
0
20 Sep 2021
Quantized Convolutional Neural Networks Through the Lens of Partial Differential Equations
Ido Ben-Yair
Gil Ben Shalom
Moshe Eliasof
Eran Treister
MQ
36
5
0
31 Aug 2021
Compact representations of convolutional neural networks via weight pruning and quantization
Giosuè Cataldo Marinò
A. Petrini
D. Malchiodi
Marco Frasca
MQ
21
4
0
28 Aug 2021
4-bit Quantization of LSTM-based Speech Recognition Models
A. Fasoli
Chia-Yu Chen
Mauricio Serrano
Xiao Sun
Naigang Wang
...
Xiaodong Cui
Brian Kingsbury
Wei Zhang
Zoltán Tüske
K. Gopalakrishnan
MQ
26
21
0
27 Aug 2021
Previous
1
2
3
4
5
6
7
Next