Efficient Neural Networks for Tiny Machine Learning: A Comprehensive Review

20 November 2023

Papers citing "Efficient Neural Networks for Tiny Machine Learning: A Comprehensive Review"

50 / 70 papers shown

Title
Edge Impulse: An MLOps Platform for Tiny Machine Learning Shawn Hymel Colby R. Banbury Daniel Situnayake A. Elium Carl Ward ... Louis Moreau Dmitry Maslov A. Beavis Jan Jongboom Vijay Janapa Reddi VLM LRM 91 99 0 02 Nov 2022
QReg: On Regularization Effects of Quantization Mohammadhossein Askarihemmat Reyhane Askari Hemmat Alexander Hoffman Ivan Lazarevich Ehsan Saboori Olivier Mastropietro Sudhakar Sah Yvon Savaria J. David MQ 79 5 0 24 Jun 2022
Machine Learning Operations (MLOps): Overview, Definition, and Architecture Dominik Kreuzberger Niklas Kühl Sebastian Hirschl VLM AI4CE 65 354 0 04 May 2022
TinyMLOps: Operational Challenges for Widespread Edge AI Adoption Sam Leroux Pieter Simoens Meelis Lootus Kartik Thakore Akshay Sharma 50 16 0 21 Mar 2022
On the Existence of Universal Lottery Tickets R. Burkholz Nilanjana Laha Rajarshi Mukherjee Alkis Gotovos UQCV 74 33 0 22 Nov 2021
Training Deep Neural Networks with Joint Quantization and Pruning of Weights and Activations Xinyu Zhang Ian Colbert Ken Kreutz-Delgado Srinjoy Das MQ 86 12 0 15 Oct 2021
The Bayesian Learning Rule Mohammad Emtiyaz Khan Håvard Rue BDL 130 81 0 09 Jul 2021
A comparison of LSTM and GRU networks for learning symbolic sequences Roberto Cahuantzi Xinye Chen S. Güttel 86 138 0 05 Jul 2021
MLPerf Tiny Benchmark Colby R. Banbury Vijay Janapa Reddi P. Torelli J. Holleman Nat Jeffries ... Videet Parekh Honson Tran Nhan Tran Niu Wenxu Xu Xuesong VLM 109 190 0 14 Jun 2021
A Survey of Transformers Tianyang Lin Yuxin Wang Xiangyang Liu Xipeng Qiu ViT 172 1,131 0 08 Jun 2021
Quantization and Deployment of Deep Neural Networks on Microcontrollers Pierre-Emmanuel Novac G. B. Hacene Alain Pegatoquet Benoit Miramond Vincent Gripon MQ 59 122 0 27 May 2021
Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks Torsten Hoefler Dan Alistarh Tal Ben-Nun Nikoli Dryden Alexandra Peste MQ 314 725 0 31 Jan 2021
Are wider nets better given the same number of parameters? A. Golubeva Behnam Neyshabur Guy Gur-Ari 82 44 0 27 Oct 2020
$μ$ NAS: Constrained Neural Architecture Search for Microcontrollers Edgar Liberis Łukasz Dudziak Nicholas D. Lane BDL 63 105 0 27 Oct 2020
TensorFlow Lite Micro: Embedded Machine Learning on TinyML Systems R. David Jared Duke Advait Jain Vijay Janapa Reddi Nat Jeffries ... Meghna Natraj Shlomi Regev Rocky Rhodes Tiezhen Wang Pete Warden 245 481 0 17 Oct 2020
Hardware Aware Training for Efficient Keyword Spotting on General Purpose and Specialized Hardware Peter Blouw G. Malik Benjamin Morcos Aaron R. Voelker C. Eliasmith 57 21 0 09 Sep 2020
TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices A. Wong M. Famouri Maya Pavlova Siddharth Surana 112 33 0 10 Aug 2020
MCUNet: Tiny Deep Learning on IoT Devices Ji Lin Wei-Ming Chen Chengyue Wu J. Cohn Chuang Gan Song Han 157 493 0 20 Jul 2020
Efficient Neural Network Deployment for Microcontroller Hasan Unlu 24 15 0 02 Jul 2020
TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids Igor Fedorov Marko Stamenovic Carl R. Jensen Li-Chia Yang Ari Mandell Yiming Gan Matthew Mattina P. Whatmough 56 98 0 20 May 2020
Integer Quantization for Deep Learning Inference: Principles and Empirical Evaluation Hao Wu Patrick Judd Xiaojie Zhang Mikhail Isaev Paulius Micikevicius MQ 97 362 0 20 Apr 2020
LSQ+: Improving low-bit quantization through learnable offsets and better initialization Yash Bhalgat Jinwon Lee Markus Nagel Tijmen Blankevoort Nojun Kwak MQ 65 222 0 20 Apr 2020
Binary Neural Networks: A Survey Haotong Qin Ruihao Gong Xianglong Liu Xiao Bai Jingkuan Song N. Sebe MQ 129 471 0 31 Mar 2020
Training Binary Neural Networks using the Bayesian Learning Rule Xiangming Meng Roman Bachmann Mohammad Emtiyaz Khan BDL MQ 67 42 0 25 Feb 2020
Post-Training Piecewise Linear Quantization for Deep Neural Networks Jun Fang Ali Shafiee Hamzah Abdel-Aziz D. Thorsley Georgios Georgiadis Joseph Hassoun MQ 78 147 0 31 Jan 2020
ZeroQ: A Novel Zero Shot Quantization Framework Yaohui Cai Z. Yao Zhen Dong A. Gholami Michael W. Mahoney Kurt Keutzer MQ 101 399 0 01 Jan 2020
What's Hidden in a Randomly Weighted Neural Network? Vivek Ramanujan Mitchell Wortsman Aniruddha Kembhavi Ali Farhadi Mohammad Rastegari 66 361 0 29 Nov 2019
Neural networks on microcontrollers: saving memory at inference via operator reordering Edgar Liberis Nicholas D. Lane 58 46 0 02 Oct 2019
Theoretical Issues in Deep Networks: Approximation, Optimization and Generalization T. Poggio Andrzej Banburski Q. Liao ODL 118 165 0 25 Aug 2019
Time Matters in Regularizing Deep Networks: Weight Decay and Data Augmentation Affect Early Learning Dynamics, Matter Little Near Convergence Aditya Golatkar Alessandro Achille Stefano Soatto 80 97 0 30 May 2019
SpArSe: Sparse Architecture Search for CNNs on Resource-Constrained Microcontrollers Igor Fedorov Ryan P. Adams Matthew Mattina P. Whatmough 84 168 0 28 May 2019
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks Mingxing Tan Quoc V. Le 3DV MedIm 167 18,193 0 28 May 2019
Searching for MobileNetV3 Andrew G. Howard Mark Sandler Grace Chu Liang-Chieh Chen Bo Chen ... Yukun Zhu Ruoming Pang Vijay Vasudevan Quoc V. Le Hartwig Adam 376 6,811 0 06 May 2019
The State of Sparsity in Deep Neural Networks Trevor Gale Erich Elsen Sara Hooker 163 763 0 25 Feb 2019
A Survey of the Recent Architectures of Deep Convolutional Neural Networks Asifullah Khan A. Sohail Umme Zahoora Aqsa Saeed Qureshi OOD 121 2,310 0 17 Jan 2019
Quantization for Rapid Deployment of Deep Neural Networks J. Lee Sangwon Ha Saerom Choi Won-Jo Lee Seungwon Lee MQ 57 49 0 12 Oct 2018
Rethinking the Value of Network Pruning Zhuang Liu Mingjie Sun Tinghui Zhou Gao Huang Trevor Darrell 38 1,475 0 11 Oct 2018
Relaxed Quantization for Discretized Neural Networks Christos Louizos M. Reisser Tijmen Blankevoort E. Gavves Max Welling MQ 101 132 0 03 Oct 2018
Learning Overparameterized Neural Networks via Stochastic Gradient Descent on Structured Data Yuanzhi Li Yingyu Liang MLT 222 653 0 03 Aug 2018
MnasNet: Platform-Aware Neural Architecture Search for Mobile Mingxing Tan Bo Chen Ruoming Pang Vijay Vasudevan Mark Sandler Andrew G. Howard Quoc V. Le MQ 128 3,018 0 31 Jul 2018
ResNet with one-neuron hidden layers is a Universal Approximator Hongzhou Lin Stefanie Jegelka 113 229 0 28 Jun 2018
Deep $k$ -Means: Re-Training and Parameter Sharing with Harder Cluster Assignments for Compressing Deep Convolutions Junru Wu Yue Wang Zhenyu Wu Zhangyang Wang Ashok Veeraraghavan Yingyan Lin 59 115 0 24 Jun 2018
Understanding Batch Normalization Johan Bjorck Carla P. Gomes B. Selman Kilian Q. Weinberger 159 615 0 01 Jun 2018
PACT: Parameterized Clipping Activation for Quantized Neural Networks Jungwook Choi Zhuo Wang Swagath Venkataramani P. Chuang Vijayalakshmi Srinivasan K. Gopalakrishnan MQ 75 955 0 16 May 2018
The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks Jonathan Frankle Michael Carbin 272 3,488 0 09 Mar 2018
CMSIS-NN: Efficient Neural Network Kernels for Arm Cortex-M CPUs Liangzhen Lai Naveen Suda Vikas Chandra 77 381 0 19 Jan 2018
MobileNetV2: Inverted Residuals and Linear Bottlenecks Mark Sandler Andrew G. Howard Menglong Zhu A. Zhmoginov Liang-Chieh Chen 213 19,335 0 13 Jan 2018
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference Benoit Jacob S. Kligys Bo Chen Menglong Zhu Matthew Tang Andrew G. Howard Hartwig Adam Dmitry Kalenichenko MQ 164 3,143 0 15 Dec 2017
Hello Edge: Keyword Spotting on Microcontrollers Yundong Zhang Naveen Suda Liangzhen Lai Vikas Chandra 87 436 0 20 Nov 2017
To prune, or not to prune: exploring the efficacy of pruning for model compression Michael Zhu Suyog Gupta 197 1,282 0 05 Oct 2017