Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1706.02379
Cited By
Training Quantized Nets: A Deeper Understanding
7 June 2017
Hao Li
Soham De
Zheng Xu
Christoph Studer
H. Samet
Tom Goldstein
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Training Quantized Nets: A Deeper Understanding"
37 / 37 papers shown
Title
Stochastic Rounding for LLM Training: Theory and Practice
Kaan Ozkara
Tao Yu
Youngsuk Park
43
0
0
27 Feb 2025
AdaRankGrad: Adaptive Gradient-Rank and Moments for Memory-Efficient LLMs Training and Fine-Tuning
Yehonathan Refael
Jonathan Svirsky
Boris Shustin
Wasim Huleihel
Ofir Lindenbaum
47
3
0
31 Dec 2024
BOLD: Boolean Logic Deep Learning
Van Minh Nguyen
Cristian Ocampo
Aymen Askri
Louis Leconte
Ba-Hien Tran
AI4CE
40
0
0
25 May 2024
OAC: Output-adaptive Calibration for Accurate Post-training Quantization
Ali Edalati
Alireza Ghaffari
M. Asgharian
Lu Hou
Boxing Chen
Vahid Partovi Nia
V. Nia
MQ
86
0
0
23 May 2024
Fixed-point quantization aware training for on-device keyword-spotting
Sashank Macha
Om Oza
Alex Escott
Francesco Calivá
Robert M. Armitano
S. Cheekatmalla
S. Parthasarathi
Yuzong Liu
MQ
16
4
0
04 Mar 2023
Gradient Shaping: Enhancing Backdoor Attack Against Reverse Engineering
Rui Zhu
Di Tang
Siyuan Tang
Guanhong Tao
Shiqing Ma
Xiaofeng Wang
Haixu Tang
DD
20
3
0
29 Jan 2023
Adaptive Low-Precision Training for Embeddings in Click-Through Rate Prediction
Shiwei Li
Huifeng Guo
Luyao Hou
Wei Zhang
Xing Tang
Ruiming Tang
Rui Zhang
Rui Li
MQ
109
9
0
12 Dec 2022
AskewSGD : An Annealed interval-constrained Optimisation method to train Quantized Neural Networks
Louis Leconte
S. Schechtman
Eric Moulines
29
4
0
07 Nov 2022
MinUn: Accurate ML Inference on Microcontrollers
Shikhar Jaiswal
R. Goli
Aayan Kumar
Vivek Seshadri
Rahul Sharma
26
2
0
29 Oct 2022
Update Compression for Deep Neural Networks on the Edge
Bo Chen
A. Bakhshi
Gustavo E. A. P. A. Batista
Brian Ng
Tat-Jun Chin
24
17
0
09 Mar 2022
PokeBNN: A Binary Pursuit of Lightweight Accuracy
Yichi Zhang
Zhiru Zhang
Lukasz Lew
MQ
35
57
0
30 Nov 2021
DAdaQuant: Doubly-adaptive quantization for communication-efficient Federated Learning
Robert Hönig
Yiren Zhao
Robert D. Mullins
FedML
109
54
0
31 Oct 2021
VC dimension of partially quantized neural networks in the overparametrized regime
Yutong Wang
Clayton D. Scott
20
1
0
06 Oct 2021
QuPeD: Quantized Personalization via Distillation with Applications to Federated Learning
Kaan Ozkara
Navjot Singh
Deepesh Data
Suhas Diggavi
FedML
MQ
24
56
0
29 Jul 2021
Random and Adversarial Bit Error Robustness: Energy-Efficient and Secure DNN Accelerators
David Stutz
Nandhini Chandramoorthy
Matthias Hein
Bernt Schiele
AAML
MQ
24
18
0
16 Apr 2021
Advances in Electron Microscopy with Deep Learning
Jeffrey M. Ede
32
2
0
04 Jan 2021
Review: Deep Learning in Electron Microscopy
Jeffrey M. Ede
34
79
0
17 Sep 2020
Binary Neural Networks: A Survey
Haotong Qin
Ruihao Gong
Xianglong Liu
Xiao Bai
Jingkuan Song
N. Sebe
MQ
50
457
0
31 Mar 2020
Resource-Efficient Neural Networks for Embedded Systems
Wolfgang Roth
Günther Schindler
Lukas Pfeifenberger
Robert Peharz
Sebastian Tschiatschek
Holger Fröning
Franz Pernkopf
Zoubin Ghahramani
31
47
0
07 Jan 2020
Towards Unified INT8 Training for Convolutional Neural Network
Feng Zhu
Ruihao Gong
F. Yu
Xianglong Liu
Yanfei Wang
Zhelong Li
Xiuqi Yang
Junjie Yan
MQ
29
150
0
29 Dec 2019
Post-Training 4-bit Quantization on Embedding Tables
Hui Guan
Andrey Malevich
Jiyan Yang
Jongsoo Park
Hector Yuen
MQ
11
32
0
05 Nov 2019
On-Device Machine Learning: An Algorithms and Learning Theory Perspective
Sauptik Dhar
Junyao Guo
Jiayi Liu
S. Tripathi
Unmesh Kurup
Mohak Shah
19
141
0
02 Nov 2019
Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks
Ruihao Gong
Xianglong Liu
Shenghu Jiang
Tian-Hao Li
Peng Hu
Jiazhen Lin
F. Yu
Junjie Yan
MQ
23
446
0
14 Aug 2019
QUOTIENT: Two-Party Secure Neural Network Training and Prediction
Nitin Agrawal
Ali Shahin Shamsabadi
Matt J. Kusner
Adria Gascon
24
212
0
08 Jul 2019
ViP: Virtual Pooling for Accelerating CNN-based Image Classification and Object Detection
Zhuo Chen
Jiyuan Zhang
Ruizhou Ding
Diana Marculescu
13
12
0
19 Jun 2019
Latent Weights Do Not Exist: Rethinking Binarized Neural Network Optimization
K. Helwegen
James Widdicombe
Lukas Geiger
Zechun Liu
K. Cheng
Roeland Nusselder
MQ
27
110
0
05 Jun 2019
SWALP : Stochastic Weight Averaging in Low-Precision Training
Guandao Yang
Tianyi Zhang
Polina Kirichenko
Junwen Bai
A. Wilson
Christopher De Sa
18
94
0
26 Apr 2019
Blended Coarse Gradient Descent for Full Quantization of Deep Neural Networks
Penghang Yin
Shuai Zhang
J. Lyu
Stanley Osher
Y. Qi
Jack Xin
MQ
36
61
0
15 Aug 2018
A Survey on Methods and Theories of Quantized Neural Networks
Yunhui Guo
MQ
29
230
0
13 Aug 2018
Binary Ensemble Neural Network: More Bits per Network or More Networks per Bit?
Shilin Zhu
Xin Dong
Hao Su
MQ
22
135
0
20 Jun 2018
Loss-aware Weight Quantization of Deep Networks
Lu Hou
James T. Kwok
MQ
24
127
0
23 Feb 2018
Training wide residual networks for deployment using a single bit for each weight
Mark D Mcdonnell
MQ
27
71
0
23 Feb 2018
Training and Inference with Integers in Deep Neural Networks
Shuang Wu
Guoqi Li
F. Chen
Luping Shi
MQ
24
389
0
13 Feb 2018
BinaryRelax: A Relaxation Approach For Training Deep Neural Networks With Quantized Weights
Penghang Yin
Shuai Zhang
J. Lyu
Stanley Osher
Y. Qi
Jack Xin
MQ
22
78
0
19 Jan 2018
Deep Learning for Real-Time Crime Forecasting and its Ternarization
Bao Wang
Penghang Yin
Andrea L. Bertozzi
P. Brantingham
Stanley J. Osher
Jack Xin
AI4TS
38
82
0
23 Nov 2017
Deep Learning as a Mixed Convex-Combinatorial Optimization Problem
A. Friesen
Pedro M. Domingos
26
20
0
31 Oct 2017
Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights
Aojun Zhou
Anbang Yao
Yiwen Guo
Lin Xu
Yurong Chen
MQ
334
1,049
0
10 Feb 2017
1