SPINN: Synergistic Progressive Inference of Neural Networks over Device and Cloud

14 August 2020

Stefanos Laskaridis

Stylianos I. Venieris

Papers citing "SPINN: Synergistic Progressive Inference of Neural Networks over Device and Cloud"

45 / 45 papers shown

Title
LimitNet: Progressive, Content-Aware Image Offloading for Extremely Weak Devices & Networks A. Hojjat Janek Haberer Tayyaba Zainab Olaf Landsiedel 67 3 0 18 Apr 2025
AI-Powered Urban Transportation Digital Twin: Methods and Applications Xuan Di Yongjie Fu Mehmet K.Turkcan Mahshid Ghasemi Zhaobin Mo Chengbo Zang Abhishek Adhikari Z. Kostić Gil Zussman AI4CE 90 0 0 30 Dec 2024
Tiny Models are the Computational Saver for Large Models Qingyuan Wang B. Cardiff Antoine Frappé Benoît Larras Deepu John 73 2 0 26 Mar 2024
On the Impact of Black-box Deployment Strategies for Edge AI on Latency and Model Performance Jaskirat Singh Emad Fallahzadeh Bram Adams Ahmed E. Hassan MQ 64 3 0 25 Mar 2024
Synergy: Towards On-Body AI via Tiny AI Accelerator Collaboration on Wearables Taesik Gong S. Jang Utku Günay Acer F. Kawsar Chulhong Min 70 2 0 11 Dec 2023
HAPI: Hardware-Aware Progressive Inference Stefanos Laskaridis Stylianos I. Venieris Hyeji Kim Nicholas D. Lane 45 46 0 10 Aug 2020
DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference Ji Xin Raphael Tang Jaejun Lee Yaoliang Yu Jimmy J. Lin 42 370 0 27 Apr 2020
PyTorch: An Imperative Style, High-Performance Deep Learning Library Adam Paszke Sam Gross Francisco Massa Adam Lerer James Bradbury ... Sasank Chilamkurthy Benoit Steiner Lu Fang Junjie Bai Soumith Chintala ODL 256 42,038 0 03 Dec 2019
AI Benchmark: All About Deep Learning on Smartphones in 2019 Andrey D. Ignatov Radu Timofte Andrei Kulik Seungsoo Yang Ke Wang Felix Baum Max Wu Lirong Xu Luc Van Gool ELM 39 220 0 15 Oct 2019
Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge Computing En Li Liekang Zeng Zhi Zhou Xu Chen 36 620 0 04 Oct 2019
Neural Network Inference on Mobile SoCs Siqi Wang A. Pathania T. Mitra 39 86 0 24 Aug 2019
MobiSR: Efficient On-Device Super-Resolution through Heterogeneous Mobile Processors Royson Lee Stylianos I. Venieris Łukasz Dudziak S. Bhattacharya Nicholas D. Lane SupR 36 95 0 21 Aug 2019
Improved Techniques for Training Adaptive Deep Networks Hao Li Hong Zhang Xiaojuan Qi Ruigang Yang Gao Huang 48 132 0 17 Aug 2019
And the Bit Goes Down: Revisiting the Quantization of Neural Networks Pierre Stock Armand Joulin Rémi Gribonval Benjamin Graham Hervé Jégou MQ 54 149 0 12 Jul 2019
SCAN: A Scalable Neural Networks Framework Towards Compact and Efficient Models Linfeng Zhang Zhanhong Tan Jiebo Song Jingwei Chen Chenglong Bao Kaisheng Ma 28 71 0 27 May 2019
EmBench: Quantifying Performance Variations of Deep Neural Networks across Modern Commodity Devices Mario Almeida Stefanos Laskaridis Ilias Leontiadis Stylianos I. Venieris Nicholas D. Lane 33 74 0 17 May 2019
Be Your Own Teacher: Improve the Performance of Convolutional Neural Networks via Self Distillation Linfeng Zhang Jiebo Song Anni Gao Jingwei Chen Chenglong Bao Kaisheng Ma FedML 48 852 0 17 May 2019
JALAD: Joint Accuracy- and Latency-Aware Deep Structure Decoupling for Edge-Cloud Execution Hongshan Li Chenghao Hu Jingyan Jiang Zhi Wang Yonggang Wen Wenwu Zhu 65 133 0 25 Dec 2018
NestDNN: Resource-Aware Multi-Tenant On-Device Deep Learning for Continuous Mobile Vision Biyi Fang Xiao Zeng Mi Zhang 3DH 53 266 0 23 Oct 2018
Optimizing CNN Model Inference on CPUs Yizhi Liu Yao Wang Ruofei Yu Mu Li Vin Sharma Yida Wang 28 152 0 07 Sep 2018
Anatomy Of High-Performance Deep Learning Convolutions On SIMD Architectures E. Georganas Sasikanth Avancha K. Banerjee Dhiraj D. Kalamkar G. Henry Hans Pabst A. Heinecke BDL 31 105 0 16 Aug 2018
CascadeCNN: Pushing the performance limits of quantisation Alexandros Kouris Stylianos I. Venieris C. Bouganis MQ 39 24 0 22 May 2018
JointDNN: An Efficient Training and Inference Engine for Intelligent Mobile Cloud Computing Services Amir Erfan Eshratifar M. Abrishami Massoud Pedram FedML 48 251 0 25 Jan 2018
MobileNetV2: Inverted Residuals and Linear Bottlenecks Mark Sandler Andrew G. Howard Menglong Zhu A. Zhmoginov Liang-Chieh Chen 143 19,124 0 13 Jan 2018
Focus: Querying Large Video Datasets with Low Latency and Low Cost Kevin Hsieh Ganesh Ananthanarayanan P. Bodík P. Bahl Matthai Philipose Phillip B. Gibbons O. Mutlu 48 275 0 10 Jan 2018
Quantization and Training of Neural Networks for Efficient Integer-Arithmetic-Only Inference Benoit Jacob S. Kligys Bo Chen Menglong Zhu Matthew Tang Andrew G. Howard Hartwig Adam Dmitry Kalenichenko MQ 124 3,090 0 15 Dec 2017
MIT Advanced Vehicle Technology Study: Large-Scale Naturalistic Driving Study of Driver Behavior and Interaction with Automation Lex Fridman Daniel E. Brown Michael Glazer William Angell Spencer Dodd ... Anthony Pettinato B. Seppelt Linda S. Angell Bruce Mehler B. Reimer 39 193 0 19 Nov 2017
Bridging the Gap Between Neural Networks and Neuromorphic Hardware with A Neural Network Compiler Yu Ji Youhui Zhang Wenguang Chen Yuan Xie 44 56 0 15 Nov 2017
BranchyNet: Fast Inference via Early Exiting from Deep Neural Networks Surat Teerapittayanon Bradley McDanel H. T. Kung UQCV 52 1,131 0 06 Sep 2017
Distributed Deep Neural Networks over the Cloud, the Edge and End Devices Surat Teerapittayanon Bradley McDanel H. T. Kung FedML 67 714 0 06 Sep 2017
On Calibration of Modern Neural Networks Chuan Guo Geoff Pleiss Yu Sun Kilian Q. Weinberger UQCV 197 5,774 0 14 Jun 2017
Toward Low-Flying Autonomous MAV Trail Navigation using Deep Neural Networks for Environmental Awareness Nikolai Smolyanskiy A. Kamenev Jeffrey Smith Stan Birchfield 86 223 0 07 May 2017
Compressing DMA Engine: Leveraging Activation Sparsity for Training Deep Neural Networks Minsoo Rhu Mike O'Connor Niladrish Chatterjee Jeff Pool S. Keckler 42 176 0 03 May 2017
In-Datacenter Performance Analysis of a Tensor Processing Unit N. Jouppi C. Young Nishant Patil David Patterson Gaurav Agrawal ... Vijay Vasudevan Richard Walter Walter Wang Eric Wilcox Doe Hyun Yoon 178 4,619 0 16 Apr 2017
NoScope: Optimizing Neural Network Queries over Video at Scale Daniel Kang John Emmons Firas Abuzaid Peter Bailis Matei A. Zaharia 55 205 0 07 Mar 2017
Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights Aojun Zhou Anbang Yao Yiwen Guo Lin Xu Yurong Chen MQ 380 1,050 0 10 Feb 2017
Quantized Neural Networks: Training Neural Networks with Low Precision Weights and Activations Itay Hubara Matthieu Courbariaux Daniel Soudry Ran El-Yaniv Yoshua Bengio MQ 96 1,859 0 22 Sep 2016
On the Expressive Power of Deep Neural Networks M. Raghu Ben Poole Jon M. Kleinberg Surya Ganguli Jascha Narain Sohl-Dickstein 49 780 0 16 Jun 2016
TensorFlow: A system for large-scale machine learning Martín Abadi P. Barham Jianmin Chen Zhiwen Chen Andy Davis ... Vijay Vasudevan Pete Warden Martin Wicke Yuan Yu Xiaoqiang Zhang GNN AI4CE 336 18,300 0 27 May 2016
Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning Christian Szegedy Sergey Ioffe Vincent Vanhoucke Alexander A. Alemi 291 14,196 0 23 Feb 2016
Deep Residual Learning for Image Recognition Kaiming He Xinming Zhang Shaoqing Ren Jian Sun MedIm 1.4K 192,638 0 10 Dec 2015
Rethinking the Inception Architecture for Computer Vision Christian Szegedy Vincent Vanhoucke Sergey Ioffe Jonathon Shlens Z. Wojna 3DV BDL 478 27,231 0 02 Dec 2015
Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding Song Han Huizi Mao W. Dally 3DGS 203 8,793 0 01 Oct 2015
Empirical Evaluation of Rectified Activations in Convolutional Network Bing Xu Naiyan Wang Tianqi Chen Mu Li 112 2,901 0 05 May 2015
Very Deep Convolutional Networks for Large-Scale Image Recognition Karen Simonyan Andrew Zisserman FAtt MDE 928 99,991 0 04 Sep 2014