Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1810.10090
Cited By
NestDNN: Resource-Aware Multi-Tenant On-Device Deep Learning for Continuous Mobile Vision
23 October 2018
Biyi Fang
Xiao Zeng
Mi Zhang
3DH
Re-assign community
ArXiv
PDF
HTML
Papers citing
"NestDNN: Resource-Aware Multi-Tenant On-Device Deep Learning for Continuous Mobile Vision"
50 / 82 papers shown
Title
On the Impact of White-box Deployment Strategies for Edge AI on Latency and Model Performance
Jaskirat Singh
Bram Adams
Ahmed E. Hassan
VLM
45
0
0
01 Nov 2024
Panopticus: Omnidirectional 3D Object Detection on Resource-constrained Edge Devices
Jeho Lee
Chanyoung Jung
Jiwon Kim
Hojung Cha
3DPC
37
1
0
02 Oct 2024
HydraViT: Stacking Heads for a Scalable ViT
Janek Haberer
A. Hojjat
Olaf Landsiedel
31
0
0
26 Sep 2024
ELMS: Elasticized Large Language Models On Mobile Devices
Wangsong Yin
Rongjie Yi
Daliang Xu
Gang Huang
Mengwei Xu
Xuanzhe Liu
37
5
0
08 Sep 2024
Loki: A System for Serving ML Inference Pipelines with Hardware and Accuracy Scaling
Sohaib Ahmad
Hui Guan
Ramesh K. Sitaraman
42
4
0
04 Jul 2024
Soar: Design and Deployment of A Smart Roadside Infrastructure System for Autonomous Driving
Shuyao Shi
Neiwen Ling
Zhehao Jiang
Xuan Huang
Yuze He
...
Chen Bian
Jingfei Xia
Zhenyu Yan
Raymond W. Yeung
Guoliang Xing
18
6
0
21 Apr 2024
Socialized Learning: A Survey of the Paradigm Shift for Edge Intelligence in Networked Systems
Xiaofei Wang
Yunfeng Zhao
Chao Qiu
Qinghua Hu
Victor C. M. Leung
35
6
0
20 Apr 2024
BRIEDGE: EEG-Adaptive Edge AI for Multi-Brain to Multi-Robot Interaction
Jinhui Ouyang
Mingzhu Wu
Xinglin Li
Hanhui Deng
Di Wu
21
2
0
14 Mar 2024
Context-aware Multi-Model Object Detection for Diversely Heterogeneous Compute Systems
Justin Davis
M. E. Belviranli
16
1
0
12 Feb 2024
IoT in the Era of Generative AI: Vision and Challenges
Xin Wang
Zhongwei Wan
Arvin Hekmati
M. Zong
Samiul Alam
Mi Zhang
Bhaskar Krishnamachari
32
15
0
03 Jan 2024
Real-time Neural Network Inference on Extremely Weak Devices: Agile Offloading with Explainable AI
Kai Huang
Wei Gao
22
35
0
21 Dec 2023
ECLM: Efficient Edge-Cloud Collaborative Learning with Continuous Environment Adaptation
Zhuang Yan
Zhenzhe Zheng
Yunfeng Shao
Bingshuai Li
Fan Wu
Guihai Chen
25
3
0
18 Nov 2023
Collaborative Inference in DNN-based Satellite Systems with Dynamic Task Streams
Jinglong Guan
Qiyang Zhang
Ilir Murturi
Praveen Kumar Donta
Schahram Dustdar
Shangguang Wang
33
3
0
10 Nov 2023
MOSEL: Inference Serving Using Dynamic Modality Selection
Bodun Hu
Le Xu
Jeongyoon Moon
N. Yadwadkar
Aditya Akella
13
4
0
27 Oct 2023
AdaEvo: Edge-Assisted Continuous and Timely DNN Model Evolution for Mobile Devices
Lehao Wang
Zhiwen Yu
Haoyi Yu
Sicong Liu
Yaxiong Xie
Bin Guo
Yunxin Liu
24
5
0
27 Sep 2023
LLMCad: Fast and Scalable On-device Large Language Model Inference
Daliang Xu
Wangsong Yin
Xin Jin
Wenjie Qu
Shiyun Wei
Mengwei Xu
Xuanzhe Liu
25
44
0
08 Sep 2023
RED: A Systematic Real-Time Scheduling Approach for Robotic Environmental Dynamics
Zexin Li
Tao Ren
Xiaoxi He
Cong Liu
29
7
0
29 Aug 2023
SwapMoE: Serving Off-the-shelf MoE-based Large Language Models with Tunable Memory Budget
Rui Kong
Yuanchun Li
Qingtian Feng
Weijun Wang
Xiaozhou Ye
Ye Ouyang
L. Kong
Yunxin Liu
MoE
37
8
0
29 Aug 2023
Generative Model for Models: Rapid DNN Customization for Diverse Tasks and Resource Constraints
Wenxing Xu
Yuanchun Li
Jiacheng Liu
Yiyou Sun
Zhengyang Cao
Yixuan Li
Hao Wen
Yunxin Liu
30
0
0
29 Aug 2023
Federated Learning for Computationally-Constrained Heterogeneous Devices: A Survey
Kilian Pfeiffer
Martin Rapp
R. Khalili
J. Henkel
FedML
22
66
0
18 Jul 2023
Miriam: Exploiting Elastic Kernels for Real-time Multi-DNN Inference on Edge GPU
Zhihe Zhao
Neiwen Ling
Nan Guan
Guoliang Xing
34
11
0
10 Jul 2023
Breaking On-device Training Memory Wall: A Systematic Survey
Shitian Li
Chunlin Tian
Kahou Tam
Ruirui Ma
Li Li
36
2
0
17 Jun 2023
Adaptive Scheduling for Edge-Assisted DNN Serving
Jian He
Chen-Shun Yang
Zhaoyuan He
Ghufran Baig
L. Qiu
19
0
0
19 Apr 2023
AdaptiveNet: Post-deployment Neural Architecture Adaptation for Diverse Edge Environments
Hao Wen
Yuanchun Li
Zunshuai Zhang
Shiqi Jiang
Xiaozhou Ye
Ouyang Ye
Yaqin Zhang
Yunxin Liu
90
29
0
13 Mar 2023
TFormer: A Transmission-Friendly ViT Model for IoT Devices
Zhichao Lu
Chuntao Ding
Felix Juefei Xu
Vishnu Boddeti
Shangguang Wang
Yun Yang
23
13
0
15 Feb 2023
DynaMIX: Resource Optimization for DNN-Based Real-Time Applications on a Multi-Tasking System
Minkyoung Cho
Kang G. Shin
29
2
0
03 Feb 2023
SuperFedNAS: Cost-Efficient Federated Neural Architecture Search for On-Device Inference
Alind Khare
A. Agrawal
Aditya Annavajjala
Payman Behnam
Myungjin Lee
Hugo Latapie
Alexey Tumanov
FedML
13
2
0
26 Jan 2023
Mind Your Heart: Stealthy Backdoor Attack on Dynamic Deep Neural Network in Edge Computing
Tian Dong
Ziyuan Zhang
Han Qiu
Tianwei Zhang
Hewu Li
T. Wang
AAML
28
6
0
22 Dec 2022
On-device Training: A First Overview on Existing Systems
Shuai Zhu
Thiemo Voigt
Jeonggil Ko
Fatemeh Rahimian
34
14
0
01 Dec 2022
Edge Video Analytics: A Survey on Applications, Systems and Enabling Techniques
Renjie Xu
S. Razavi
Rong Zheng
44
15
0
28 Nov 2022
ROMA: Run-Time Object Detection To Maximize Real-Time Accuracy
JunKyu Lee
Blesson Varghese
Hans Vandierendonck
ObjD
36
4
0
28 Oct 2022
Towards Transmission-Friendly and Robust CNN Models over Cloud and Device
Chuntao Ding
Zhichao Lu
F. Xu
Vishnu Boddeti
Yidong Li
Jiannong Cao
27
14
0
20 Jul 2022
A Survey on Collaborative DNN Inference for Edge Intelligence
Weiqing Ren
Yuben Qu
Chao Dong
Yuqian Jing
Hao Sun
Qihui Wu
Song Guo
36
49
0
16 Jul 2022
STI: Turbocharge NLP Inference at the Edge via Elastic Pipelining
Liwei Guo
Wonkyo Choe
F. Lin
24
14
0
11 Jul 2022
Smart Multi-tenant Federated Learning
Weiming Zhuang
Yonggang Wen
Shuai Zhang
FedML
36
2
0
09 Jul 2022
CPrune: Compiler-Informed Model Pruning for Efficient Target-Aware DNN Execution
Taeho Kim
Yongin Kwon
Jemin Lee
Taeho Kim
Sangtae Ha
35
2
0
04 Jul 2022
Turbo: Opportunistic Enhancement for Edge Video Analytics
Yan Lu
Shiqi Jiang
Ting Cao
Yuanchao Shu
42
29
0
29 Jun 2022
Boosting DNN Cold Inference on Edge Devices
Rongjie Yi
Ting Cao
Ao Zhou
Xiao Ma
Shangguang Wang
Mengwei Xu
151
6
0
15 Jun 2022
Multi-DNN Accelerators for Next-Generation AI Systems
Stylianos I. Venieris
C. Bouganis
Nicholas D. Lane
38
7
0
19 May 2022
FrameHopper: Selective Processing of Video Frames in Detection-driven Real-Time Video Analytics
Md. Adnan Arefeen
Sumaiya Tabassum Nimi
M. Y. S. Uddin
22
10
0
22 Mar 2022
YONO: Modeling Multiple Heterogeneous Neural Networks on Microcontrollers
Young D. Kwon
Jagmohan Chauhan
Cecilia Mascolo
24
13
0
08 Mar 2022
Resource-Efficient Deep Learning: A Survey on Model-, Arithmetic-, and Implementation-Level Techniques
JunKyu Lee
L. Mukhanov
A. S. Molahosseini
U. Minhas
Yang Hua
Jesus Martinez del Rincon
K. Dichev
Cheol-Ho Hong
Hans Vandierendonck
44
29
0
30 Dec 2021
Virtuoso: Video-based Intelligence for real-time tuning on SOCs
Jayoung Lee
Pengcheng Wang
Ran Xu
Venkateswara Dasari
Noah Weston
Yin Li
S. Bagchi
Somali Chaterji
34
2
0
24 Dec 2021
LegoDNN: Block-grained Scaling of Deep Neural Networks for Mobile Vision
Rui Han
Qinglong Zhang
C. Liu
Guoren Wang
Jian Tang
L. Chen
21
44
0
18 Dec 2021
CANS: Communication Limited Camera Network Self-Configuration for Intelligent Industrial Surveillance
Jingzheng Tu
Qimin Xu
Cailian Chen
40
2
0
13 Sep 2021
SensiX++: Bringing MLOPs and Multi-tenant Model Serving to Sensory Edge Devices
Chulhong Min
Akhil Mathur
Utku Günay Acer
A. Montanari
F. Kawsar
30
11
0
08 Sep 2021
Efficient Visual Recognition with Deep Neural Networks: A Survey on Recent Advances and New Directions
Yang Wu
Dingheng Wang
Xiaotong Lu
Fan Yang
Guoqi Li
W. Dong
Jianbo Shi
29
18
0
30 Aug 2021
Leveraging Transprecision Computing for Machine Vision Applications at the Edge
U. Minhas
L. Mukhanov
G. Karakonstantis
Hans Vandierendonck
Roger Francis Woods
32
5
0
29 Aug 2021
A Field Guide to Federated Optimization
Jianyu Wang
Zachary B. Charles
Zheng Xu
Gauri Joshi
H. B. McMahan
...
Mi Zhang
Tong Zhang
Chunxiang Zheng
Chen Zhu
Wennan Zhu
FedML
187
412
0
14 Jul 2021
How to Reach Real-Time AI on Consumer Devices? Solutions for Programmable and Custom Architectures
Stylianos I. Venieris
Ioannis Panopoulos
Ilias Leontiadis
I. Venieris
33
6
0
21 Jun 2021
1
2
Next