Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1609.07061
Cited By
Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations
22 September 2016
Itay Hubara
Matthieu Courbariaux
Daniel Soudry
Ran El-Yaniv
Yoshua Bengio
MQ
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations"
50 / 55 papers shown
Title
L-MTP: Leap Multi-Token Prediction Beyond Adjacent Context for Large Language Models
Xiaohao Liu
Xiaobo Xia
Weixiang Zhao
Manyi Zhang
Xianzhi Yu
Xiu Su
Shuo Yang
See-Kiong Ng
Tat-Seng Chua
KELM
LRM
87
0
0
23 May 2025
Cauchy-Schwarz Regularizers
Sueda Taner
Ziyi Wang
Christoph Studer
91
0
0
03 Mar 2025
EAGLE-3: Scaling up Inference Acceleration of Large Language Models via Training-Time Test
Yuhui Li
Fangyun Wei
Chao Zhang
Hongyang R. Zhang
211
18
0
03 Mar 2025
HadamRNN: Binary and Sparse Ternary Orthogonal RNNs
Armand Foucault
Franck Mamalet
François Malgouyres
MQ
265
0
0
28 Jan 2025
BILLNET: A Binarized Conv3D-LSTM Network with Logic-gated residual architecture for hardware-efficient video inference
Van Thien Nguyen
William Guicquero
Gilles Sicard
3DV
MQ
143
2
0
24 Jan 2025
MOGNET: A Mux-residual quantized Network leveraging Online-Generated weights
Van Thien Nguyen
William Guicquero
Gilles Sicard
MQ
136
1
0
17 Jan 2025
Histogram-Equalized Quantization for logic-gated Residual Neural Networks
Van Thien Nguyen
William Guicquero
Gilles Sicard
MQ
118
2
0
10 Jan 2025
Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning
Zhen Li
Yupeng Su
Runming Yang
C. Xie
Zehua Wang
Zhongwei Xie
Ngai Wong
Hongxia Yang
MQ
LRM
148
4
0
06 Jan 2025
Unsupervised detection of semantic correlations in big data
Santiago Acevedo
Alex Rodriguez
Alessandro Laio
132
3
0
04 Nov 2024
Data Generation for Hardware-Friendly Post-Training Quantization
Lior Dikstein
Ariel Lapid
Arnon Netzer
H. Habi
MQ
464
0
0
29 Oct 2024
MLPerf Power: Benchmarking the Energy Efficiency of Machine Learning Systems from Microwatts to Megawatts for Sustainable AI
Arya Tschand
Arun Tejusve Raghunath Rajan
S. Idgunji
Anirban Ghosh
J. Holleman
...
Rowan Taubitz
Sean Zhan
Scott Wasson
David Kanter
Vijay Janapa Reddi
118
3
0
15 Oct 2024
Selective Attention Improves Transformer
Yaniv Leviathan
Matan Kalman
Yossi Matias
105
12
0
03 Oct 2024
MimiQ: Low-Bit Data-Free Quantization of Vision Transformers with Encouraging Inter-Head Attention Similarity
Kanghyun Choi
Hyeyoon Lee
Dain Kwon
Sunjong Park
Kyuyeun Kim
Noseong Park
Jinho Lee
Jinho Lee
MQ
113
2
0
29 Jul 2024
EAGLE: Speculative Sampling Requires Rethinking Feature Uncertainty
Yuhui Li
Fangyun Wei
Chao Zhang
Hongyang R. Zhang
142
165
0
26 Jan 2024
Data-Free Quantization via Mixed-Precision Compensation without Fine-Tuning
Jun Chen
Shipeng Bai
Tianxin Huang
Mengmeng Wang
Guanzhong Tian
Y. Liu
MQ
98
19
0
02 Jul 2023
DANCE: DAta-Network Co-optimization for Efficient Segmentation Model Training and Inference
Chaojian Li
Wuyang Chen
Yuchen Gu
Tianlong Chen
Yonggan Fu
Zhangyang Wang
Yingyan Lin
109
0
0
16 Jul 2021
Optimal Lottery Tickets via SubsetSum: Logarithmic Over-Parameterization is Sufficient
Ankit Pensia
Shashank Rajput
Alliot Nagle
Harit Vishwakarma
Dimitris Papailiopoulos
60
104
0
14 Jun 2020
Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming
Itay Hubara
Yury Nahshan
Y. Hanani
Ron Banner
Daniel Soudry
MQ
107
128
0
14 Jun 2020
Ensemble Distillation for Robust Model Fusion in Federated Learning
Tao R. Lin
Lingjing Kong
Sebastian U. Stich
Martin Jaggi
FedML
106
1,051
0
12 Jun 2020
Ternary Neural Networks with Fine-Grained Quantization
Naveen Mellempudi
Abhisek Kundu
Dheevatsa Mudigere
Dipankar Das
Bharat Kaul
Pradeep Dubey
MQ
97
111
0
02 May 2017
Quantization and Training of Low Bit-Width Convolutional Neural Networks for Object Detection
Penghang Yin
Shuai Zhang
Y. Qi
Jack Xin
MQ
161
42
0
19 Dec 2016
Recurrent Neural Networks With Limited Numerical Precision
Joachim Ott
Zhouhan Lin
Yanzhe Zhang
Shih-Chii Liu
Yoshua Bengio
MQ
77
77
0
24 Aug 2016
DoReFa-Net: Training Low Bitwidth Convolutional Neural Networks with Low Bitwidth Gradients
Shuchang Zhou
Yuxin Wu
Zekun Ni
Xinyu Zhou
He Wen
Yuheng Zou
MQ
132
2,090
0
20 Jun 2016
YodaNN: An Architecture for Ultra-Low Power Binary-Weight CNN Acceleration
Renzo Andri
Lukas Cavigelli
D. Rossi
Luca Benini
67
196
0
17 Jun 2016
Deep neural networks are robust to weight binarization and other non-linear distortions
P. Merolla
R. Appuswamy
John V. Arthur
S. K. Esser
D. Modha
OOD
MQ
91
96
0
07 Jun 2016
Hardware-oriented Approximation of Convolutional Neural Networks
Philipp Gysel
Mohammad Motamedi
S. Ghiasi
92
310
0
11 Apr 2016
XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks
Mohammad Rastegari
Vicente Ordonez
Joseph Redmon
Ali Farhadi
MQ
175
4,369
0
16 Mar 2016
Convolutional Neural Networks using Logarithmic Data Representation
Daisuke Miyashita
Edward H. Lee
B. Murmann
MQ
81
429
0
03 Mar 2016
Bitwise Neural Networks
Minje Kim
Paris Smaragdis
MQ
83
217
0
22 Jan 2016
BinaryConnect: Training Deep Neural Networks with binary weights during propagations
Matthieu Courbariaux
Yoshua Bengio
J. David
MQ
212
2,993
0
02 Nov 2015
Neural Networks with Few Multiplications
Zhouhan Lin
Matthieu Courbariaux
Roland Memisevic
Yoshua Bengio
102
331
0
11 Oct 2015
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
Song Han
Huizi Mao
W. Dally
3DGS
263
8,862
0
01 Oct 2015
Generalizing Pooling Functions in Convolutional Neural Networks: Mixed, Gated, and Tree
Chen-Yu Lee
Patrick W. Gallagher
Zhuowen Tu
AI4CE
75
484
0
30 Sep 2015
Subdominant Dense Clusters Allow for Simple Learning and High Computational Performance in Neural Networks with Discrete Synapses
Carlo Baldassi
Alessandro Ingrosso
Carlo Lucibello
Luca Saglietti
R. Zecchina
64
129
0
18 Sep 2015
Learning both Weights and Connections for Efficient Neural Networks
Song Han
Jeff Pool
J. Tran
W. Dally
CVBM
316
6,709
0
08 Jun 2015
Compressing Neural Networks with the Hashing Trick
Wenlin Chen
James T. Wilson
Stephen Tyree
Kilian Q. Weinberger
Yixin Chen
168
1,191
0
19 Apr 2015
Training Binary Multilayer Neural Networks for Image Classification using Expectation Backpropagation
Zhiyong Cheng
Daniel Soudry
Zexi Mao
Zhenzhong Lan
MQ
69
52
0
12 Mar 2015
Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
Sergey Ioffe
Christian Szegedy
OOD
467
43,347
0
11 Feb 2015
Deep Learning with Limited Numerical Precision
Suyog Gupta
A. Agrawal
K. Gopalakrishnan
P. Narayanan
HAI
207
2,049
0
09 Feb 2015
Adam: A Method for Stochastic Optimization
Diederik P. Kingma
Jimmy Ba
ODL
2.1K
150,364
0
22 Dec 2014
FitNets: Hints for Thin Deep Nets
Adriana Romero
Nicolas Ballas
Samira Ebrahimi Kahou
Antoine Chassang
C. Gatta
Yoshua Bengio
FedML
322
3,899
0
19 Dec 2014
Compressing Deep Convolutional Networks using Vector Quantization
Yunchao Gong
Liu Liu
Ming Yang
Lubomir D. Bourdev
MQ
179
1,171
0
18 Dec 2014
Efficient and Accurate Approximations of Nonlinear Convolutional Networks
Xinming Zhang
Jianhua Zou
Xiang Ming
Kaiming He
Jian Sun
3DV
87
263
0
16 Nov 2014
Spatially-sparse convolutional neural networks
Benjamin Graham
98
231
0
22 Sep 2014
Deeply-Supervised Nets
Chen-Yu Lee
Saining Xie
Patrick W. Gallagher
Zhengyou Zhang
Zhuowen Tu
352
2,243
0
18 Sep 2014
Going Deeper with Convolutions
Christian Szegedy
Wei Liu
Yangqing Jia
P. Sermanet
Scott E. Reed
Dragomir Anguelov
D. Erhan
Vincent Vanhoucke
Andrew Rabinovich
494
43,698
0
17 Sep 2014
Sequence to Sequence Learning with Neural Networks
Ilya Sutskever
Oriol Vinyals
Quoc V. Le
AIMat
448
20,606
0
10 Sep 2014
Very Deep Convolutional Networks for Large-Scale Image Recognition
Karen Simonyan
Andrew Zisserman
FAtt
MDE
1.7K
100,529
0
04 Sep 2014
Neural Machine Translation by Jointly Learning to Align and Translate
Dzmitry Bahdanau
Kyunghyun Cho
Yoshua Bengio
AIMat
578
27,338
0
01 Sep 2014
One weird trick for parallelizing convolutional neural networks
A. Krizhevsky
GNN
93
1,303
0
23 Apr 2014
1
2
Next