Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2001.02786
Cited By
Least squares binary quantization of neural networks
9 January 2020
Hadi Pouransari
Zhucheng Tu
Oncel Tuzel
MQ
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Least squares binary quantization of neural networks"
18 / 18 papers shown
Title
PARQ: Piecewise-Affine Regularized Quantization
Lisa Jin
Jianhao Ma
Zechun Liu
Andrey Gromov
Aaron Defazio
Lin Xiao
MQ
38
0
0
19 Mar 2025
Optimal Brain Apoptosis
Mingyuan Sun
Zheng Fang
Jiaxu Wang
Junjie Jiang
Delei Kong
Chenming Hu
Yuetong Fang
Renjing Xu
AAML
66
0
0
25 Feb 2025
PLUM: Improving Inference Efficiency By Leveraging Repetition-Sparsity Trade-Off
Sachit Kuhar
Yash Jain
Alexey Tumanov
MQ
54
0
0
04 Dec 2023
Subgraph Stationary Hardware-Software Inference Co-Design
Payman Behnam
Jianming Tong
Alind Khare
Yang Chen
Yue Pan
Pranav Gadikar
A. Bambhaniya
T. Krishna
Alexey Tumanov
17
3
0
21 Jun 2023
Quick Dense Retrievers Consume KALE: Post Training Kullback Leibler Alignment of Embeddings for Asymmetrical dual encoders
Daniel Fernando Campos
Alessandro Magnani
Chengxiang Zhai
14
2
0
31 Mar 2023
oBERTa: Improving Sparse Transfer Learning via improved initialization, distillation, and pruning regimes
Daniel Fernando Campos
Alexandre Marques
Mark Kurtz
Chengxiang Zhai
VLM
AAML
11
2
0
30 Mar 2023
Ultra-low Precision Multiplication-free Training for Deep Neural Networks
Chang-Shu Liu
Rui Zhang
Xishan Zhang
Yifan Hao
Zidong Du
Xingui Hu
Ling Li
Qi Guo
MQ
32
1
0
28 Feb 2023
Signed Binary Weight Networks
Sachit Kuhar
Alexey Tumanov
Judy Hoffman
MQ
13
1
0
25 Nov 2022
IR2Net: Information Restriction and Information Recovery for Accurate Binary Neural Networks
Ping Xue
Yang Lu
Jingfei Chang
Xing Wei
Zhen Wei
MQ
70
0
0
06 Oct 2022
Gaussian Pre-Activations in Neural Networks: Myth or Reality?
Pierre Wolinski
Julyan Arbel
AI4CE
68
8
0
24 May 2022
FAT: An In-Memory Accelerator with Fast Addition for Ternary Weight Neural Networks
Shien Zhu
Luan H. K. Duong
Hui Chen
Di Liu
Weichen Liu
MQ
14
5
0
19 Jan 2022
CBP: Backpropagation with constraint on weight precision using a pseudo-Lagrange multiplier method
Guhyun Kim
D. Jeong
MQ
34
2
0
06 Oct 2021
On the Acceleration of Deep Neural Network Inference using Quantized Compressed Sensing
Meshia Cédric Oveneke
MQ
14
0
0
23 Aug 2021
TENT: Efficient Quantization of Neural Networks on the tiny Edge with Tapered FixEd PoiNT
H. F. Langroudi
Vedant Karia
Tej Pandit
Dhireesha Kudithipudi
MQ
13
10
0
06 Apr 2021
Self-Distribution Binary Neural Networks
Ping Xue
Yang Lu
Jingfei Chang
Xing Wei
Zhen Wei
MQ
19
10
0
03 Mar 2021
Improving Accuracy of Binary Neural Networks using Unbalanced Activation Distribution
Hyungjun Kim
Jihoon Park
Chang-Ho Lee
Jae-Joon Kim
MQ
4
30
0
02 Dec 2020
FATNN: Fast and Accurate Ternary Neural Networks
Peng Chen
Bohan Zhuang
Chunhua Shen
MQ
4
15
0
12 Aug 2020
Extracurricular Learning: Knowledge Transfer Beyond Empirical Distribution
Hadi Pouransari
Mojan Javaheripi
Vinay Sharma
Oncel Tuzel
11
5
0
30 Jun 2020
1