Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1510.03009
Cited By
Neural Networks with Few Multiplications
11 October 2015
Zhouhan Lin
Matthieu Courbariaux
Roland Memisevic
Yoshua Bengio
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Neural Networks with Few Multiplications"
50 / 66 papers shown
Title
HadamRNN: Binary and Sparse Ternary Orthogonal RNNs
Armand Foucault
Franck Mamalet
François Malgouyres
MQ
85
0
0
28 Jan 2025
Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning
Zhen Li
Yupeng Su
Runming Yang
C. Xie
Zehua Wang
Zhongwei Xie
Ngai Wong
Hongxia Yang
MQ
LRM
56
3
0
06 Jan 2025
AutoQNN: An End-to-End Framework for Automatically Quantizing Neural Networks
Cheng Gong
Ye Lu
Surong Dai
Deng Qian
Chenkun Du
Tao Li
MQ
29
0
0
07 Apr 2023
Distributed Machine Learning for UAV Swarms: Computing, Sensing, and Semantics
Yahao Ding
Zhaohui Yang
Viet Quoc Pham
Zhaoyang Zhang
M. Shikh-Bahaei
36
31
0
03 Jan 2023
Adaptive Low-Precision Training for Embeddings in Click-Through Rate Prediction
Shiwei Li
Huifeng Guo
Luyao Hou
Wei Zhang
Xing Tang
Ruiming Tang
Rui Zhang
Rui Li
MQ
135
9
0
12 Dec 2022
On-device Training: A First Overview on Existing Systems
Shuai Zhu
Thiemo Voigt
Jeonggil Ko
Fatemeh Rahimian
34
14
0
01 Dec 2022
MinUn: Accurate ML Inference on Microcontrollers
Shikhar Jaiswal
R. Goli
Aayan Kumar
Vivek Seshadri
Rahul Sharma
29
2
0
29 Oct 2022
Mixed-Precision Neural Networks: A Survey
M. Rakka
M. Fouda
Pramod P. Khargonekar
Fadi J. Kurdahi
MQ
25
11
0
11 Aug 2022
Combinatorial optimization for low bit-width neural networks
Hanxu Zhou
Aida Ashrafi
Matthew B. Blaschko
MQ
24
0
0
04 Jun 2022
Energy awareness in low precision neural networks
Nurit Spingarn-Eliezer
Ron Banner
Elad Hoffer
Hilla Ben-Yaacov
T. Michaeli
38
0
0
06 Feb 2022
The Ecological Footprint of Neural Machine Translation Systems
D. Shterionov
Eva Vanmassenhove
40
3
0
04 Feb 2022
CBP: Backpropagation with constraint on weight precision using a pseudo-Lagrange multiplier method
Guhyun Kim
D. Jeong
MQ
50
2
0
06 Oct 2021
Learning Gradual Argumentation Frameworks using Genetic Algorithms
J. Spieler
Nico Potyka
Steffen Staab
AI4CE
36
4
0
25 Jun 2021
Distributed Learning in Wireless Networks: Recent Progress and Future Challenges
Mingzhe Chen
Deniz Gündüz
Kaibin Huang
Walid Saad
M. Bennis
Aneta Vulgarakis Feljan
H. Vincent Poor
42
402
0
05 Apr 2021
Pruning and Quantization for Deep Neural Network Acceleration: A Survey
Tailin Liang
C. Glossner
Lei Wang
Shaobo Shi
Xiaotong Zhang
MQ
150
675
0
24 Jan 2021
ShiftAddNet: A Hardware-Inspired Deep Network
Haoran You
Xiaohan Chen
Yongan Zhang
Chaojian Li
Sicheng Li
Zihao Liu
Zhangyang Wang
Yingyan Lin
OOD
MQ
76
76
0
24 Oct 2020
TernaryBERT: Distillation-aware Ultra-low Bit BERT
Wei Zhang
Lu Hou
Yichun Yin
Lifeng Shang
Xiao Chen
Xin Jiang
Qun Liu
MQ
33
208
0
27 Sep 2020
Resource-Efficient Neural Networks for Embedded Systems
Wolfgang Roth
Günther Schindler
Lukas Pfeifenberger
Robert Peharz
Sebastian Tschiatschek
Holger Fröning
Franz Pernkopf
Zoubin Ghahramani
34
47
0
07 Jan 2020
An Efficient Hardware-Oriented Dropout Algorithm
Y. J. Yeoh
Takashi Morie
H. Tamukoh
13
2
0
14 Nov 2019
On-Device Machine Learning: An Algorithms and Learning Theory Perspective
Sauptik Dhar
Junyao Guo
Jiayi Liu
S. Tripathi
Unmesh Kurup
Mohak Shah
28
141
0
02 Nov 2019
LUTNet: Learning FPGA Configurations for Highly Efficient Neural Network Inference
Erwei Wang
James J. Davis
P. Cheung
George A. Constantinides
MQ
9
41
0
24 Oct 2019
Fully Quantized Transformer for Machine Translation
Gabriele Prato
Ella Charlaix
Mehdi Rezagholizadeh
MQ
13
68
0
17 Oct 2019
REQ-YOLO: A Resource-Aware, Efficient Quantization Framework for Object Detection on FPGAs
Caiwen Ding
Shuo Wang
Ning Liu
Kaidi Xu
Yanzhi Wang
Yun Liang
MQ
24
89
0
29 Sep 2019
Accurate and Compact Convolutional Neural Networks with Trained Binarization
Zhe Xu
R. Cheung
MQ
27
54
0
25 Sep 2019
TiM-DNN: Ternary in-Memory accelerator for Deep Neural Networks
Shubham Jain
S. Gupta
A. Raghunathan
MQ
30
37
0
15 Sep 2019
Compressing RNNs for IoT devices by 15-38x using Kronecker Products
Urmish Thakker
Jesse G. Beu
Dibakar Gope
Chu Zhou
Igor Fedorov
Ganesh S. Dasika
Matthew Mattina
27
36
0
07 Jun 2019
Attention Based Pruning for Shift Networks
G. B. Hacene
Carlos Lassance
Vincent Gripon
Matthieu Courbariaux
Yoshua Bengio
41
25
0
29 May 2019
Online Embedding Compression for Text Classification using Low Rank Matrix Factorization
Anish Acharya
Rahul Goel
A. Metallinou
Inderjit Dhillon
22
58
0
01 Nov 2018
Recent Advances in Object Detection in the Age of Deep Convolutional Neural Networks
Shivang Agarwal
Jean Ogier du Terrail
F. Jurie
ObjD
24
123
0
10 Sep 2018
A Survey on Methods and Theories of Quantized Neural Networks
Yunhui Guo
MQ
29
232
0
13 Aug 2018
SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks
Julian Faraone
Nicholas J. Fraser
Michaela Blott
Philip H. W. Leong
MQ
33
133
0
01 Jul 2018
Binary Ensemble Neural Network: More Bits per Network or More Networks per Bit?
Shilin Zhu
Xin Dong
Hao Su
MQ
30
135
0
20 Jun 2018
Adding New Tasks to a Single Network with Weight Transformations using Binary Masks
Massimiliano Mancini
Elisa Ricci
Barbara Caputo
Samuel Rota Buló
25
51
0
28 May 2018
The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches
Md. Zahangir Alom
T. Taha
C. Yakopcic
Stefan Westberg
P. Sidike
Mst Shamima Nasrin
B. Van Essen
A. Awwal
V. Asari
VLM
29
874
0
03 Mar 2018
Loss-aware Weight Quantization of Deep Networks
Lu Hou
James T. Kwok
MQ
35
127
0
23 Feb 2018
ADaPTION: Toolbox and Benchmark for Training Convolutional Neural Networks with Reduced Numerical Precision Weights and Activation
Moritz B. Milde
Daniel Neil
Alessandro Aimar
T. Delbruck
Giacomo Indiveri
MQ
29
9
0
13 Nov 2017
A Survey of Model Compression and Acceleration for Deep Neural Networks
Yu Cheng
Duo Wang
Pan Zhou
Zhang Tao
40
1,087
0
23 Oct 2017
Tracking Persons-of-Interest via Unsupervised Representation Adaptation
Shun Zhang
Jia-Bin Huang
Jongwoo Lim
Yihong Gong
Jinjun Wang
Narendra Ahuja
Ming-Hsuan Yang
CVBM
31
30
0
05 Oct 2017
Machine Learning Models that Remember Too Much
Congzheng Song
Thomas Ristenpart
Vitaly Shmatikov
VLM
30
505
0
22 Sep 2017
WRPN: Wide Reduced-Precision Networks
Asit K. Mishra
Eriko Nurvitadhi
Jeffrey J. Cook
Debbie Marr
MQ
39
266
0
04 Sep 2017
BitNet: Bit-Regularized Deep Neural Networks
Aswin Raghavan
Mohamed R. Amer
S. Chai
Graham Taylor
MQ
38
10
0
16 Aug 2017
SEP-Nets: Small and Effective Pattern Networks
Zhe Li
Xiaoyu Wang
Xutao Lv
Tianbao Yang
30
12
0
13 Jun 2017
Deep Learning with Low Precision by Half-wave Gaussian Quantization
Zhaowei Cai
Xiaodong He
Jian Sun
Nuno Vasconcelos
MQ
41
502
0
03 Feb 2017
Towards the Limit of Network Quantization
Yoojin Choi
Mostafa El-Khamy
Jungwon Lee
MQ
22
191
0
05 Dec 2016
Trained Ternary Quantization
Chenzhuo Zhu
Song Han
Huizi Mao
W. Dally
MQ
57
1,035
0
04 Dec 2016
LCNN: Lookup-based Convolutional Neural Network
Hessam Bagherinezhad
Mohammad Rastegari
Ali Farhadi
13
89
0
20 Nov 2016
Sparsely-Connected Neural Networks: Towards Efficient VLSI Implementation of Deep Neural Networks
A. Ardakani
C. Condo
W. Gross
33
40
0
04 Nov 2016
Accelerating Deep Convolutional Networks using low-precision and sparsity
Ganesh Venkatesh
Eriko Nurvitadhi
Debbie Marr
26
135
0
02 Oct 2016
Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations
Itay Hubara
Matthieu Courbariaux
Daniel Soudry
Ran El-Yaniv
Yoshua Bengio
MQ
42
1,842
0
22 Sep 2016
Ternary Neural Networks for Resource-Efficient AI Applications
Hande Alemdar
V. Leroy
Adrien Prost-Boucle
F. Pétrot
24
204
0
01 Sep 2016
1
2
Next