ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2008.06402
  4. Cited By
SPINN: Synergistic Progressive Inference of Neural Networks over Device
  and Cloud

SPINN: Synergistic Progressive Inference of Neural Networks over Device and Cloud

14 August 2020
Stefanos Laskaridis
Stylianos I. Venieris
Mario Almeida
Ilias Leontiadis
Nicholas D. Lane
ArXivPDFHTML

Papers citing "SPINN: Synergistic Progressive Inference of Neural Networks over Device and Cloud"

45 / 45 papers shown
Title
LimitNet: Progressive, Content-Aware Image Offloading for Extremely Weak Devices & Networks
LimitNet: Progressive, Content-Aware Image Offloading for Extremely Weak Devices & Networks
A. Hojjat
Janek Haberer
Tayyaba Zainab
Olaf Landsiedel
67
3
0
18 Apr 2025
AI-Powered Urban Transportation Digital Twin: Methods and Applications
AI-Powered Urban Transportation Digital Twin: Methods and Applications
Xuan Di
Yongjie Fu
Mehmet K.Turkcan
Mahshid Ghasemi
Zhaobin Mo
Chengbo Zang
Abhishek Adhikari
Z. Kostić
Gil Zussman
AI4CE
90
0
0
30 Dec 2024
Tiny Models are the Computational Saver for Large Models
Tiny Models are the Computational Saver for Large Models
Qingyuan Wang
B. Cardiff
Antoine Frappé
Benoît Larras
Deepu John
73
2
0
26 Mar 2024
On the Impact of Black-box Deployment Strategies for Edge AI on Latency and Model Performance
On the Impact of Black-box Deployment Strategies for Edge AI on Latency and Model Performance
Jaskirat Singh
Emad Fallahzadeh
Bram Adams
Ahmed E. Hassan
MQ
64
3
0
25 Mar 2024
Synergy: Towards On-Body AI via Tiny AI Accelerator Collaboration on Wearables
Synergy: Towards On-Body AI via Tiny AI Accelerator Collaboration on Wearables
Taesik Gong
S. Jang
Utku Günay Acer
F. Kawsar
Chulhong Min
70
2
0
11 Dec 2023
HAPI: Hardware-Aware Progressive Inference
HAPI: Hardware-Aware Progressive Inference
Stefanos Laskaridis
Stylianos I. Venieris
Hyeji Kim
Nicholas D. Lane
45
46
0
10 Aug 2020
DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference
DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference
Ji Xin
Raphael Tang
Jaejun Lee
Yaoliang Yu
Jimmy J. Lin
42
370
0
27 Apr 2020
PyTorch: An Imperative Style, High-Performance Deep Learning Library
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Adam Paszke
Sam Gross
Francisco Massa
Adam Lerer
James Bradbury
...
Sasank Chilamkurthy
Benoit Steiner
Lu Fang
Junjie Bai
Soumith Chintala
ODL
256
42,038
0
03 Dec 2019
AI Benchmark: All About Deep Learning on Smartphones in 2019
AI Benchmark: All About Deep Learning on Smartphones in 2019
Andrey D. Ignatov
Radu Timofte
Andrei Kulik
Seungsoo Yang
Ke Wang
Felix Baum
Max Wu
Lirong Xu
Luc Van Gool
ELM
39
220
0
15 Oct 2019
Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge
  Computing
Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge Computing
En Li
Liekang Zeng
Zhi Zhou
Xu Chen
36
620
0
04 Oct 2019
Neural Network Inference on Mobile SoCs
Neural Network Inference on Mobile SoCs
Siqi Wang
A. Pathania
T. Mitra
39
86
0
24 Aug 2019
MobiSR: Efficient On-Device Super-Resolution through Heterogeneous
  Mobile Processors
MobiSR: Efficient On-Device Super-Resolution through Heterogeneous Mobile Processors
Royson Lee
Stylianos I. Venieris
Łukasz Dudziak
S. Bhattacharya
Nicholas D. Lane
SupR
36
95
0
21 Aug 2019
Improved Techniques for Training Adaptive Deep Networks
Improved Techniques for Training Adaptive Deep Networks
Hao Li
Hong Zhang
Xiaojuan Qi
Ruigang Yang
Gao Huang
48
132
0
17 Aug 2019
And the Bit Goes Down: Revisiting the Quantization of Neural Networks
And the Bit Goes Down: Revisiting the Quantization of Neural Networks
Pierre Stock
Armand Joulin
Rémi Gribonval
Benjamin Graham
Hervé Jégou
MQ
54
149
0
12 Jul 2019
SCAN: A Scalable Neural Networks Framework Towards Compact and Efficient
  Models
SCAN: A Scalable Neural Networks Framework Towards Compact and Efficient Models
Linfeng Zhang
Zhanhong Tan
Jiebo Song
Jingwei Chen
Chenglong Bao
Kaisheng Ma
28
71
0
27 May 2019
EmBench: Quantifying Performance Variations of Deep Neural Networks
  across Modern Commodity Devices
EmBench: Quantifying Performance Variations of Deep Neural Networks across Modern Commodity Devices
Mario Almeida
Stefanos Laskaridis
Ilias Leontiadis
Stylianos I. Venieris
Nicholas D. Lane
33
74
0
17 May 2019
Be Your Own Teacher: Improve the Performance of Convolutional Neural
  Networks via Self Distillation
Be Your Own Teacher: Improve the Performance of Convolutional Neural Networks via Self Distillation
Linfeng Zhang
Jiebo Song
Anni Gao
Jingwei Chen
Chenglong Bao
Kaisheng Ma
FedML
48
852
0
17 May 2019
JALAD: Joint Accuracy- and Latency-Aware Deep Structure Decoupling for
  Edge-Cloud Execution
JALAD: Joint Accuracy- and Latency-Aware Deep Structure Decoupling for Edge-Cloud Execution
Hongshan Li
Chenghao Hu
Jingyan Jiang
Zhi Wang
Yonggang Wen
Wenwu Zhu
65
133
0
25 Dec 2018
NestDNN: Resource-Aware Multi-Tenant On-Device Deep Learning for
  Continuous Mobile Vision
NestDNN: Resource-Aware Multi-Tenant On-Device Deep Learning for Continuous Mobile Vision
Biyi Fang
Xiao Zeng
Mi Zhang
3DH
53
266
0
23 Oct 2018
Optimizing CNN Model Inference on CPUs
Optimizing CNN Model Inference on CPUs
Yizhi Liu
Yao Wang
Ruofei Yu
Mu Li
Vin Sharma
Yida Wang
28
152
0
07 Sep 2018
Anatomy Of High-Performance Deep Learning Convolutions On SIMD
  Architectures
Anatomy Of High-Performance Deep Learning Convolutions On SIMD Architectures
E. Georganas
Sasikanth Avancha
K. Banerjee
Dhiraj D. Kalamkar
G. Henry
Hans Pabst
A. Heinecke
BDL
31
105
0
16 Aug 2018
CascadeCNN: Pushing the performance limits of quantisation
CascadeCNN: Pushing the performance limits of quantisation
Alexandros Kouris
Stylianos I. Venieris
C. Bouganis
MQ
39
24
0
22 May 2018
JointDNN: An Efficient Training and Inference Engine for Intelligent
  Mobile Cloud Computing Services
JointDNN: An Efficient Training and Inference Engine for Intelligent Mobile Cloud Computing Services
Amir Erfan Eshratifar
M. Abrishami
Massoud Pedram
FedML
48
251
0
25 Jan 2018
MobileNetV2: Inverted Residuals and Linear Bottlenecks
MobileNetV2: Inverted Residuals and Linear Bottlenecks
Mark Sandler
Andrew G. Howard
Menglong Zhu
A. Zhmoginov
Liang-Chieh Chen
143
19,124
0
13 Jan 2018
Focus: Querying Large Video Datasets with Low Latency and Low Cost
Focus: Querying Large Video Datasets with Low Latency and Low Cost
Kevin Hsieh
Ganesh Ananthanarayanan
P. Bodík
P. Bahl
Matthai Philipose
Phillip B. Gibbons
O. Mutlu
48
275
0
10 Jan 2018
Quantization and Training of Neural Networks for Efficient
  Integer-Arithmetic-Only Inference
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference
Benoit Jacob
S. Kligys
Bo Chen
Menglong Zhu
Matthew Tang
Andrew G. Howard
Hartwig Adam
Dmitry Kalenichenko
MQ
124
3,090
0
15 Dec 2017
MIT Advanced Vehicle Technology Study: Large-Scale Naturalistic Driving
  Study of Driver Behavior and Interaction with Automation
MIT Advanced Vehicle Technology Study: Large-Scale Naturalistic Driving Study of Driver Behavior and Interaction with Automation
Lex Fridman
Daniel E. Brown
Michael Glazer
William Angell
Spencer Dodd
...
Anthony Pettinato
B. Seppelt
Linda S. Angell
Bruce Mehler
B. Reimer
39
193
0
19 Nov 2017
Bridging the Gap Between Neural Networks and Neuromorphic Hardware with
  A Neural Network Compiler
Bridging the Gap Between Neural Networks and Neuromorphic Hardware with A Neural Network Compiler
Yu Ji
Youhui Zhang
Wenguang Chen
Yuan Xie
44
56
0
15 Nov 2017
BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks
BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks
Surat Teerapittayanon
Bradley McDanel
H. T. Kung
UQCV
52
1,131
0
06 Sep 2017
Distributed Deep Neural Networks over the Cloud, the Edge and End
  Devices
Distributed Deep Neural Networks over the Cloud, the Edge and End Devices
Surat Teerapittayanon
Bradley McDanel
H. T. Kung
FedML
67
714
0
06 Sep 2017
On Calibration of Modern Neural Networks
On Calibration of Modern Neural Networks
Chuan Guo
Geoff Pleiss
Yu Sun
Kilian Q. Weinberger
UQCV
197
5,774
0
14 Jun 2017
Toward Low-Flying Autonomous MAV Trail Navigation using Deep Neural
  Networks for Environmental Awareness
Toward Low-Flying Autonomous MAV Trail Navigation using Deep Neural Networks for Environmental Awareness
Nikolai Smolyanskiy
A. Kamenev
Jeffrey Smith
Stan Birchfield
86
223
0
07 May 2017
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep
  Neural Networks
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks
Minsoo Rhu
Mike O'Connor
Niladrish Chatterjee
Jeff Pool
S. Keckler
42
176
0
03 May 2017
In-Datacenter Performance Analysis of a Tensor Processing Unit
In-Datacenter Performance Analysis of a Tensor Processing Unit
N. Jouppi
C. Young
Nishant Patil
David Patterson
Gaurav Agrawal
...
Vijay Vasudevan
Richard Walter
Walter Wang
Eric Wilcox
Doe Hyun Yoon
178
4,619
0
16 Apr 2017
NoScope: Optimizing Neural Network Queries over Video at Scale
NoScope: Optimizing Neural Network Queries over Video at Scale
Daniel Kang
John Emmons
Firas Abuzaid
Peter Bailis
Matei A. Zaharia
55
205
0
07 Mar 2017
Incremental Network Quantization: Towards Lossless CNNs with
  Low-Precision Weights
Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights
Aojun Zhou
Anbang Yao
Yiwen Guo
Lin Xu
Yurong Chen
MQ
380
1,050
0
10 Feb 2017
Quantized Neural Networks: Training Neural Networks with Low Precision
  Weights and Activations
Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations
Itay Hubara
Matthieu Courbariaux
Daniel Soudry
Ran El-Yaniv
Yoshua Bengio
MQ
96
1,859
0
22 Sep 2016
On the Expressive Power of Deep Neural Networks
On the Expressive Power of Deep Neural Networks
M. Raghu
Ben Poole
Jon M. Kleinberg
Surya Ganguli
Jascha Narain Sohl-Dickstein
49
780
0
16 Jun 2016
TensorFlow: A system for large-scale machine learning
TensorFlow: A system for large-scale machine learning
Martín Abadi
P. Barham
Jianmin Chen
Zhiwen Chen
Andy Davis
...
Vijay Vasudevan
Pete Warden
Martin Wicke
Yuan Yu
Xiaoqiang Zhang
GNN
AI4CE
336
18,300
0
27 May 2016
Inception-v4, Inception-ResNet and the Impact of Residual Connections on
  Learning
Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning
Christian Szegedy
Sergey Ioffe
Vincent Vanhoucke
Alexander A. Alemi
291
14,196
0
23 Feb 2016
Deep Residual Learning for Image Recognition
Deep Residual Learning for Image Recognition
Kaiming He
Xinming Zhang
Shaoqing Ren
Jian Sun
MedIm
1.4K
192,638
0
10 Dec 2015
Rethinking the Inception Architecture for Computer Vision
Rethinking the Inception Architecture for Computer Vision
Christian Szegedy
Vincent Vanhoucke
Sergey Ioffe
Jonathon Shlens
Z. Wojna
3DV
BDL
478
27,231
0
02 Dec 2015
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained
  Quantization and Huffman Coding
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding
Song Han
Huizi Mao
W. Dally
3DGS
203
8,793
0
01 Oct 2015
Empirical Evaluation of Rectified Activations in Convolutional Network
Empirical Evaluation of Rectified Activations in Convolutional Network
Bing Xu
Naiyan Wang
Tianqi Chen
Mu Li
112
2,901
0
05 May 2015
Very Deep Convolutional Networks for Large-Scale Image Recognition
Very Deep Convolutional Networks for Large-Scale Image Recognition
Karen Simonyan
Andrew Zisserman
FAtt
MDE
928
99,991
0
04 Sep 2014
1