Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.11883
Cited By
Efficient Neural Networks for Tiny Machine Learning: A Comprehensive Review
20 November 2023
M. Lê
Pierre Wolinski
Julyan Arbel
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Efficient Neural Networks for Tiny Machine Learning: A Comprehensive Review"
50 / 70 papers shown
Title
Edge Impulse: An MLOps Platform for Tiny Machine Learning
Shawn Hymel
Colby R. Banbury
Daniel Situnayake
A. Elium
Carl Ward
...
Louis Moreau
Dmitry Maslov
A. Beavis
Jan Jongboom
Vijay Janapa Reddi
VLM
LRM
91
99
0
02 Nov 2022
QReg: On Regularization Effects of Quantization
Mohammadhossein Askarihemmat
Reyhane Askari Hemmat
Alexander Hoffman
Ivan Lazarevich
Ehsan Saboori
Olivier Mastropietro
Sudhakar Sah
Yvon Savaria
J. David
MQ
79
5
0
24 Jun 2022
Machine Learning Operations (MLOps): Overview, Definition, and Architecture
Dominik Kreuzberger
Niklas Kühl
Sebastian Hirschl
VLM
AI4CE
65
354
0
04 May 2022
TinyMLOps: Operational Challenges for Widespread Edge AI Adoption
Sam Leroux
Pieter Simoens
Meelis Lootus
Kartik Thakore
Akshay Sharma
50
16
0
21 Mar 2022
On the Existence of Universal Lottery Tickets
R. Burkholz
Nilanjana Laha
Rajarshi Mukherjee
Alkis Gotovos
UQCV
74
33
0
22 Nov 2021
Training Deep Neural Networks with Joint Quantization and Pruning of Weights and Activations
Xinyu Zhang
Ian Colbert
Ken Kreutz-Delgado
Srinjoy Das
MQ
86
12
0
15 Oct 2021
The Bayesian Learning Rule
Mohammad Emtiyaz Khan
Håvard Rue
BDL
130
81
0
09 Jul 2021
A comparison of LSTM and GRU networks for learning symbolic sequences
Roberto Cahuantzi
Xinye Chen
S. Güttel
86
138
0
05 Jul 2021
MLPerf Tiny Benchmark
Colby R. Banbury
Vijay Janapa Reddi
P. Torelli
J. Holleman
Nat Jeffries
...
Videet Parekh
Honson Tran
Nhan Tran
Niu Wenxu
Xu Xuesong
VLM
109
190
0
14 Jun 2021
A Survey of Transformers
Tianyang Lin
Yuxin Wang
Xiangyang Liu
Xipeng Qiu
ViT
172
1,131
0
08 Jun 2021
Quantization and Deployment of Deep Neural Networks on Microcontrollers
Pierre-Emmanuel Novac
G. B. Hacene
Alain Pegatoquet
Benoit Miramond
Vincent Gripon
MQ
59
122
0
27 May 2021
Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks
Torsten Hoefler
Dan Alistarh
Tal Ben-Nun
Nikoli Dryden
Alexandra Peste
MQ
314
725
0
31 Jan 2021
Are wider nets better given the same number of parameters?
A. Golubeva
Behnam Neyshabur
Guy Gur-Ari
82
44
0
27 Oct 2020
μ
μ
μ
NAS: Constrained Neural Architecture Search for Microcontrollers
Edgar Liberis
Łukasz Dudziak
Nicholas D. Lane
BDL
63
105
0
27 Oct 2020
TensorFlow Lite Micro: Embedded Machine Learning on TinyML Systems
R. David
Jared Duke
Advait Jain
Vijay Janapa Reddi
Nat Jeffries
...
Meghna Natraj
Shlomi Regev
Rocky Rhodes
Tiezhen Wang
Pete Warden
245
481
0
17 Oct 2020
Hardware Aware Training for Efficient Keyword Spotting on General Purpose and Specialized Hardware
Peter Blouw
G. Malik
Benjamin Morcos
Aaron R. Voelker
C. Eliasmith
57
21
0
09 Sep 2020
TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices
A. Wong
M. Famouri
Maya Pavlova
Siddharth Surana
112
33
0
10 Aug 2020
MCUNet: Tiny Deep Learning on IoT Devices
Ji Lin
Wei-Ming Chen
Chengyue Wu
J. Cohn
Chuang Gan
Song Han
157
493
0
20 Jul 2020
Efficient Neural Network Deployment for Microcontroller
Hasan Unlu
24
15
0
02 Jul 2020
TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids
Igor Fedorov
Marko Stamenovic
Carl R. Jensen
Li-Chia Yang
Ari Mandell
Yiming Gan
Matthew Mattina
P. Whatmough
56
98
0
20 May 2020
Integer Quantization for Deep Learning Inference: Principles and Empirical Evaluation
Hao Wu
Patrick Judd
Xiaojie Zhang
Mikhail Isaev
Paulius Micikevicius
MQ
97
362
0
20 Apr 2020
LSQ+: Improving low-bit quantization through learnable offsets and better initialization
Yash Bhalgat
Jinwon Lee
Markus Nagel
Tijmen Blankevoort
Nojun Kwak
MQ
65
222
0
20 Apr 2020
Binary Neural Networks: A Survey
Haotong Qin
Ruihao Gong
Xianglong Liu
Xiao Bai
Jingkuan Song
N. Sebe
MQ
129
471
0
31 Mar 2020
Training Binary Neural Networks using the Bayesian Learning Rule
Xiangming Meng
Roman Bachmann
Mohammad Emtiyaz Khan
BDL
MQ
67
42
0
25 Feb 2020
Post-Training Piecewise Linear Quantization for Deep Neural Networks
Jun Fang
Ali Shafiee
Hamzah Abdel-Aziz
D. Thorsley
Georgios Georgiadis
Joseph Hassoun
MQ
78
147
0
31 Jan 2020
ZeroQ: A Novel Zero Shot Quantization Framework
Yaohui Cai
Z. Yao
Zhen Dong
A. Gholami
Michael W. Mahoney
Kurt Keutzer
MQ
101
399
0
01 Jan 2020
What's Hidden in a Randomly Weighted Neural Network?
Vivek Ramanujan
Mitchell Wortsman
Aniruddha Kembhavi
Ali Farhadi
Mohammad Rastegari
66
361
0
29 Nov 2019
Neural networks on microcontrollers: saving memory at inference via operator reordering
Edgar Liberis
Nicholas D. Lane
58
46
0
02 Oct 2019
Theoretical Issues in Deep Networks: Approximation, Optimization and Generalization
T. Poggio
Andrzej Banburski
Q. Liao
ODL
118
165
0
25 Aug 2019
Time Matters in Regularizing Deep Networks: Weight Decay and Data Augmentation Affect Early Learning Dynamics, Matter Little Near Convergence
Aditya Golatkar
Alessandro Achille
Stefano Soatto
80
97
0
30 May 2019
SpArSe: Sparse Architecture Search for CNNs on Resource-Constrained Microcontrollers
Igor Fedorov
Ryan P. Adams
Matthew Mattina
P. Whatmough
84
168
0
28 May 2019
EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks
Mingxing Tan
Quoc V. Le
3DV
MedIm
167
18,193
0
28 May 2019
Searching for MobileNetV3
Andrew G. Howard
Mark Sandler
Grace Chu
Liang-Chieh Chen
Bo Chen
...
Yukun Zhu
Ruoming Pang
Vijay Vasudevan
Quoc V. Le
Hartwig Adam
376
6,811
0
06 May 2019
The State of Sparsity in Deep Neural Networks
Trevor Gale
Erich Elsen
Sara Hooker
163
763
0
25 Feb 2019
A Survey of the Recent Architectures of Deep Convolutional Neural Networks
Asifullah Khan
A. Sohail
Umme Zahoora
Aqsa Saeed Qureshi
OOD
121
2,310
0
17 Jan 2019
Quantization for Rapid Deployment of Deep Neural Networks
J. Lee
Sangwon Ha
Saerom Choi
Won-Jo Lee
Seungwon Lee
MQ
57
49
0
12 Oct 2018
Rethinking the Value of Network Pruning
Zhuang Liu
Mingjie Sun
Tinghui Zhou
Gao Huang
Trevor Darrell
38
1,475
0
11 Oct 2018
Relaxed Quantization for Discretized Neural Networks
Christos Louizos
M. Reisser
Tijmen Blankevoort
E. Gavves
Max Welling
MQ
101
132
0
03 Oct 2018
Learning Overparameterized Neural Networks via Stochastic Gradient Descent on Structured Data
Yuanzhi Li
Yingyu Liang
MLT
222
653
0
03 Aug 2018
MnasNet: Platform-Aware Neural Architecture Search for Mobile
Mingxing Tan
Bo Chen
Ruoming Pang
Vijay Vasudevan
Mark Sandler
Andrew G. Howard
Quoc V. Le
MQ
128
3,018
0
31 Jul 2018
ResNet with one-neuron hidden layers is a Universal Approximator
Hongzhou Lin
Stefanie Jegelka
113
229
0
28 Jun 2018
Deep
k
k
k
-Means: Re-Training and Parameter Sharing with Harder Cluster Assignments for Compressing Deep Convolutions
Junru Wu
Yue Wang
Zhenyu Wu
Zhangyang Wang
Ashok Veeraraghavan
Yingyan Lin
59
115
0
24 Jun 2018
Understanding Batch Normalization
Johan Bjorck
Carla P. Gomes
B. Selman
Kilian Q. Weinberger
159
615
0
01 Jun 2018
PACT: Parameterized Clipping Activation for Quantized Neural Networks
Jungwook Choi
Zhuo Wang
Swagath Venkataramani
P. Chuang
Vijayalakshmi Srinivasan
K. Gopalakrishnan
MQ
75
955
0
16 May 2018
The Lottery Ticket Hypothesis: Finding Sparse, Trainable Neural Networks
Jonathan Frankle
Michael Carbin
272
3,488
0
09 Mar 2018
CMSIS-NN: Efficient Neural Network Kernels for Arm Cortex-M CPUs
Liangzhen Lai
Naveen Suda
Vikas Chandra
77
381
0
19 Jan 2018
MobileNetV2: Inverted Residuals and Linear Bottlenecks
Mark Sandler
Andrew G. Howard
Menglong Zhu
A. Zhmoginov
Liang-Chieh Chen
213
19,335
0
13 Jan 2018
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference
Benoit Jacob
S. Kligys
Bo Chen
Menglong Zhu
Matthew Tang
Andrew G. Howard
Hartwig Adam
Dmitry Kalenichenko
MQ
164
3,143
0
15 Dec 2017
Hello Edge: Keyword Spotting on Microcontrollers
Yundong Zhang
Naveen Suda
Liangzhen Lai
Vikas Chandra
87
436
0
20 Nov 2017
To prune, or not to prune: exploring the efficacy of pruning for model compression
Michael Zhu
Suyog Gupta
197
1,282
0
05 Oct 2017
1
2
Next