Neural Networks with Few Multiplications

11 October 2015

Papers citing "Neural Networks with Few Multiplications"

50 / 66 papers shown

Title
HadamRNN: Binary and Sparse Ternary Orthogonal RNNs Armand Foucault Franck Mamalet François Malgouyres MQ 85 0 0 28 Jan 2025
Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning Zhen Li Yupeng Su Runming Yang C. Xie Zehua Wang Zhongwei Xie Ngai Wong Hongxia Yang MQ LRM 56 3 0 06 Jan 2025
AutoQNN: An End-to-End Framework for Automatically Quantizing Neural Networks Cheng Gong Ye Lu Surong Dai Deng Qian Chenkun Du Tao Li MQ 29 0 0 07 Apr 2023
Distributed Machine Learning for UAV Swarms: Computing, Sensing, and Semantics Yahao Ding Zhaohui Yang Viet Quoc Pham Zhaoyang Zhang M. Shikh-Bahaei 36 31 0 03 Jan 2023
Adaptive Low-Precision Training for Embeddings in Click-Through Rate Prediction Shiwei Li Huifeng Guo Luyao Hou Wei Zhang Xing Tang Ruiming Tang Rui Zhang Rui Li MQ 135 9 0 12 Dec 2022
On-device Training: A First Overview on Existing Systems Shuai Zhu Thiemo Voigt Jeonggil Ko Fatemeh Rahimian 34 14 0 01 Dec 2022
MinUn: Accurate ML Inference on Microcontrollers Shikhar Jaiswal R. Goli Aayan Kumar Vivek Seshadri Rahul Sharma 29 2 0 29 Oct 2022
Mixed-Precision Neural Networks: A Survey M. Rakka M. Fouda Pramod P. Khargonekar Fadi J. Kurdahi MQ 25 11 0 11 Aug 2022
Combinatorial optimization for low bit-width neural networks Hanxu Zhou Aida Ashrafi Matthew B. Blaschko MQ 24 0 0 04 Jun 2022
Energy awareness in low precision neural networks Nurit Spingarn-Eliezer Ron Banner Elad Hoffer Hilla Ben-Yaacov T. Michaeli 38 0 0 06 Feb 2022
The Ecological Footprint of Neural Machine Translation Systems D. Shterionov Eva Vanmassenhove 40 3 0 04 Feb 2022
CBP: Backpropagation with constraint on weight precision using a pseudo-Lagrange multiplier method Guhyun Kim D. Jeong MQ 50 2 0 06 Oct 2021
Learning Gradual Argumentation Frameworks using Genetic Algorithms J. Spieler Nico Potyka Steffen Staab AI4CE 36 4 0 25 Jun 2021
Distributed Learning in Wireless Networks: Recent Progress and Future Challenges Mingzhe Chen Deniz Gündüz Kaibin Huang Walid Saad M. Bennis Aneta Vulgarakis Feljan H. Vincent Poor 42 402 0 05 Apr 2021
Pruning and Quantization for Deep Neural Network Acceleration: A Survey Tailin Liang C. Glossner Lei Wang Shaobo Shi Xiaotong Zhang MQ 150 675 0 24 Jan 2021
ShiftAddNet: A Hardware-Inspired Deep Network Haoran You Xiaohan Chen Yongan Zhang Chaojian Li Sicheng Li Zihao Liu Zhangyang Wang Yingyan Lin OOD MQ 76 76 0 24 Oct 2020
TernaryBERT: Distillation-aware Ultra-low Bit BERT Wei Zhang Lu Hou Yichun Yin Lifeng Shang Xiao Chen Xin Jiang Qun Liu MQ 33 208 0 27 Sep 2020
Resource-Efficient Neural Networks for Embedded Systems Wolfgang Roth Günther Schindler Lukas Pfeifenberger Robert Peharz Sebastian Tschiatschek Holger Fröning Franz Pernkopf Zoubin Ghahramani 34 47 0 07 Jan 2020
An Efficient Hardware-Oriented Dropout Algorithm Y. J. Yeoh Takashi Morie H. Tamukoh 13 2 0 14 Nov 2019
On-Device Machine Learning: An Algorithms and Learning Theory Perspective Sauptik Dhar Junyao Guo Jiayi Liu S. Tripathi Unmesh Kurup Mohak Shah 28 141 0 02 Nov 2019
LUTNet: Learning FPGA Configurations for Highly Efficient Neural Network Inference Erwei Wang James J. Davis P. Cheung George A. Constantinides MQ 9 41 0 24 Oct 2019
Fully Quantized Transformer for Machine Translation Gabriele Prato Ella Charlaix Mehdi Rezagholizadeh MQ 13 68 0 17 Oct 2019
REQ-YOLO: A Resource-Aware, Efficient Quantization Framework for Object Detection on FPGAs Caiwen Ding Shuo Wang Ning Liu Kaidi Xu Yanzhi Wang Yun Liang MQ 24 89 0 29 Sep 2019
Accurate and Compact Convolutional Neural Networks with Trained Binarization Zhe Xu R. Cheung MQ 27 54 0 25 Sep 2019
TiM-DNN: Ternary in-Memory accelerator for Deep Neural Networks Shubham Jain S. Gupta A. Raghunathan MQ 30 37 0 15 Sep 2019
Compressing RNNs for IoT devices by 15-38x using Kronecker Products Urmish Thakker Jesse G. Beu Dibakar Gope Chu Zhou Igor Fedorov Ganesh S. Dasika Matthew Mattina 27 36 0 07 Jun 2019
Attention Based Pruning for Shift Networks G. B. Hacene Carlos Lassance Vincent Gripon Matthieu Courbariaux Yoshua Bengio 41 25 0 29 May 2019
Online Embedding Compression for Text Classification using Low Rank Matrix Factorization Anish Acharya Rahul Goel A. Metallinou Inderjit Dhillon 22 58 0 01 Nov 2018
Recent Advances in Object Detection in the Age of Deep Convolutional Neural Networks Shivang Agarwal Jean Ogier du Terrail F. Jurie ObjD 24 123 0 10 Sep 2018
A Survey on Methods and Theories of Quantized Neural Networks Yunhui Guo MQ 29 232 0 13 Aug 2018
SYQ: Learning Symmetric Quantization For Efficient Deep Neural Networks Julian Faraone Nicholas J. Fraser Michaela Blott Philip H. W. Leong MQ 33 133 0 01 Jul 2018
Binary Ensemble Neural Network: More Bits per Network or More Networks per Bit? Shilin Zhu Xin Dong Hao Su MQ 30 135 0 20 Jun 2018
Adding New Tasks to a Single Network with Weight Transformations using Binary Masks Massimiliano Mancini Elisa Ricci Barbara Caputo Samuel Rota Buló 25 51 0 28 May 2018
The History Began from AlexNet: A Comprehensive Survey on Deep Learning Approaches Md. Zahangir Alom T. Taha C. Yakopcic Stefan Westberg P. Sidike Mst Shamima Nasrin B. Van Essen A. Awwal V. Asari VLM 29 874 0 03 Mar 2018
Loss-aware Weight Quantization of Deep Networks Lu Hou James T. Kwok MQ 35 127 0 23 Feb 2018
ADaPTION: Toolbox and Benchmark for Training Convolutional Neural Networks with Reduced Numerical Precision Weights and Activation Moritz B. Milde Daniel Neil Alessandro Aimar T. Delbruck Giacomo Indiveri MQ 29 9 0 13 Nov 2017
A Survey of Model Compression and Acceleration for Deep Neural Networks Yu Cheng Duo Wang Pan Zhou Zhang Tao 40 1,087 0 23 Oct 2017
Tracking Persons-of-Interest via Unsupervised Representation Adaptation Shun Zhang Jia-Bin Huang Jongwoo Lim Yihong Gong Jinjun Wang Narendra Ahuja Ming-Hsuan Yang CVBM 31 30 0 05 Oct 2017
Machine Learning Models that Remember Too Much Congzheng Song Thomas Ristenpart Vitaly Shmatikov VLM 30 505 0 22 Sep 2017
WRPN: Wide Reduced-Precision Networks Asit K. Mishra Eriko Nurvitadhi Jeffrey J. Cook Debbie Marr MQ 39 266 0 04 Sep 2017
BitNet: Bit-Regularized Deep Neural Networks Aswin Raghavan Mohamed R. Amer S. Chai Graham Taylor MQ 38 10 0 16 Aug 2017
SEP-Nets: Small and Effective Pattern Networks Zhe Li Xiaoyu Wang Xutao Lv Tianbao Yang 30 12 0 13 Jun 2017
Deep Learning with Low Precision by Half-wave Gaussian Quantization Zhaowei Cai Xiaodong He Jian Sun Nuno Vasconcelos MQ 41 502 0 03 Feb 2017
Towards the Limit of Network Quantization Yoojin Choi Mostafa El-Khamy Jungwon Lee MQ 22 191 0 05 Dec 2016
Trained Ternary Quantization Chenzhuo Zhu Song Han Huizi Mao W. Dally MQ 57 1,035 0 04 Dec 2016
LCNN: Lookup-based Convolutional Neural Network Hessam Bagherinezhad Mohammad Rastegari Ali Farhadi 13 89 0 20 Nov 2016
Sparsely-Connected Neural Networks: Towards Efficient VLSI Implementation of Deep Neural Networks A. Ardakani C. Condo W. Gross 33 40 0 04 Nov 2016
Accelerating Deep Convolutional Networks using low-precision and sparsity Ganesh Venkatesh Eriko Nurvitadhi Debbie Marr 26 135 0 02 Oct 2016
Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations Itay Hubara Matthieu Courbariaux Daniel Soudry Ran El-Yaniv Yoshua Bengio MQ 42 1,842 0 22 Sep 2016
Ternary Neural Networks for Resource-Efficient AI Applications Hande Alemdar V. Leroy Adrien Prost-Boucle F. Pétrot 24 204 0 01 Sep 2016