ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.05316
  4. Cited By
Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge
  Computing

Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge Computing

4 October 2019
En Li
Liekang Zeng
Zhi Zhou
Xu Chen
ArXiv (abs)PDFHTML

Papers citing "Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge Computing"

50 / 138 papers shown
Title
Intelligent Orchestration of Distributed Large Foundation Model Inference at the Edge
Intelligent Orchestration of Distributed Large Foundation Model Inference at the Edge
Fernando Koch
Aladin Djuhera
Alecio Binotto
74
0
0
01 Jul 2025
Route-and-Reason: Scaling Large Language Model Reasoning with Reinforced Model Router
Route-and-Reason: Scaling Large Language Model Reasoning with Reinforced Model Router
Chenyang Shao
Xinyang Liu
Yutang Lin
Fengli Xu
Yong Li
MoELRM
68
0
0
06 Jun 2025
Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation
Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation
Li Zhong
Ahmed Ghazal
Jun-Jun Wan
Frederik Zilly
Patrick Mackens
Joachim E. Vollrath
Bogdan Sorin Coseriu
247
0
0
23 May 2025
The Larger the Merrier? Efficient Large AI Model Inference in Wireless Edge Networks
The Larger the Merrier? Efficient Large AI Model Inference in Wireless Edge Networks
Zhonghao Lyu
Ming Xiao
Jie Xu
Mikael Skoglund
Marco Di Renzo
61
1
0
14 May 2025
Federated Learning for Cyber Physical Systems: A Comprehensive Survey
Federated Learning for Cyber Physical Systems: A Comprehensive Survey
Minh K. Quan
P. Pathirana
M. Wijayasundara
S. Setunge
Dinh C. Nguyen
Christopher G. Brinton
David J. Love
H. Vincent Poor
AI4CE
108
0
0
08 May 2025
Onboard Optimization and Learning: A Survey
Onboard Optimization and Learning: A Survey
Monirul Islam Pavel
Siyi Hu
Mahardhika Pratama
Ryszard Kowalczyk
66
0
0
07 May 2025
A Wireless Collaborated Inference Acceleration Framework for Plant Disease Recognition
A Wireless Collaborated Inference Acceleration Framework for Plant Disease Recognition
Hele Zhu
Xinyi Huang
Haojia Gao
Mengfei Jiang
Haohua Que
Lei Mu
124
0
0
05 May 2025
Hyperflows: Pruning Reveals the Importance of Weights
Hyperflows: Pruning Reveals the Importance of Weights
Eugen Barbulescu
Antonio Alexoaie
60
0
0
06 Apr 2025
Robust DNN Partitioning and Resource Allocation Under Uncertain Inference Time
Robust DNN Partitioning and Resource Allocation Under Uncertain Inference Time
Zhaojun Nan
Yunchu Han
Sheng Zhou
Zhisheng Niu
128
0
0
27 Mar 2025
Empowering Edge Intelligence: A Comprehensive Survey on On-Device AI Models
Empowering Edge Intelligence: A Comprehensive Survey on On-Device AI Models
Xubin Wang
Zhiqing Tang
Jianxiong Guo
Tianhui Meng
Chenhao Wang
Tian-sheng Wang
Weijia Jia
102
6
0
08 Mar 2025
Dynamic Pricing for On-Demand DNN Inference in the Edge-AI Market
Songyuan Li
Jia Hu
Geyong Min
Haojun Huang
Jiwei Huang
87
0
0
06 Mar 2025
Aligning Task- and Reconstruction-Oriented Communications for Edge Intelligence
Aligning Task- and Reconstruction-Oriented Communications for Edge Intelligence
Yufeng Diao
Yichi Zhang
Changyang She
Philip Guodong Zhao
Emma Liying Li
96
0
0
24 Feb 2025
Privacy-Aware Joint DNN Model Deployment and Partitioning Optimization for Collaborative Edge Inference Services
Privacy-Aware Joint DNN Model Deployment and Partitioning Optimization for Collaborative Edge Inference Services
Zhipeng Cheng
Xiaoyu Xia
Hong Wang
Minghui Liwang
Ning Chen
Xuwei Fan
Xianbin Wang
92
0
0
22 Feb 2025
InTec: integrated things-edge computing: a framework for distributing machine learning pipelines in edge AI systems
InTec: integrated things-edge computing: a framework for distributing machine learning pipelines in edge AI systems
Habib Larian
Faramarz Safi-Esfahani
100
2
0
17 Feb 2025
Janus: Collaborative Vision Transformer Under Dynamic Network Environment
Janus: Collaborative Vision Transformer Under Dynamic Network Environment
Linyi Jiang
Silvery Fu
Yifei Zhu
Bo Li
ViT
461
0
0
14 Feb 2025
Vision-Language Models for Edge Networks: A Comprehensive Survey
Vision-Language Models for Edge Networks: A Comprehensive Survey
Ahmed Sharshar
Latif U. Khan
Waseem Ullah
Mohsen Guizani
VLM
160
3
0
11 Feb 2025
BEEM: Boosting Performance of Early Exit DNNs using Multi-Exit Classifiers as Experts
BEEM: Boosting Performance of Early Exit DNNs using Multi-Exit Classifiers as Experts
Divya J. Bajpai
M. Hanawal
150
1
0
02 Feb 2025
DCentNet: Decentralized Multistage Biomedical Signal Classification using Early Exits
DCentNet: Decentralized Multistage Biomedical Signal Classification using Early Exits
Xiaolin Li
Binhua Huang
B. Cardiff
Deepu John
71
0
0
31 Jan 2025
Edge Graph Intelligence: Reciprocally Empowering Edge Networks with Graph Intelligence
Edge Graph Intelligence: Reciprocally Empowering Edge Networks with Graph Intelligence
Liekang Zeng
Shengyuan Ye
Xu Chen
Xiaoxi Zhang
Ju Ren
Jian Tang
Yang Yang
Xuemin
Shen
145
3
0
08 Jan 2025
Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies
Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies
Xubin Wang
Weijia Jia
169
2
0
08 Jan 2025
Energy Optimization of Multi-task DNN Inference in MEC-assisted XR Devices: A Lyapunov-Guided Reinforcement Learning Approach
Energy Optimization of Multi-task DNN Inference in MEC-assisted XR Devices: A Lyapunov-Guided Reinforcement Learning Approach
Yanzan Sun
Jiacheng Qiu
Guangjin Pan
Shugong Xu
Shunqing Zhang
Xiaoyun Wang
Shuangfeng Han
65
0
0
07 Jan 2025
tuGEMM: Area-Power-Efficient Temporal Unary GEMM Architecture for
  Low-Precision Edge AI
tuGEMM: Area-Power-Efficient Temporal Unary GEMM Architecture for Low-Precision Edge AI
Harideep Nair
P. Vellaisamy
Albert Chen
Joseph Finn
Anna Li
Manav Trivedi
J. Shen
53
3
0
23 Dec 2024
Data Generation for Hardware-Friendly Post-Training Quantization
Data Generation for Hardware-Friendly Post-Training Quantization
Lior Dikstein
Ariel Lapid
Arnon Netzer
H. Habi
MQ
482
0
0
29 Oct 2024
Edge AI Collaborative Learning: Bayesian Approaches to Uncertainty
  Estimation
Edge AI Collaborative Learning: Bayesian Approaches to Uncertainty Estimation
Gleb I. Radchenko
Victoria Andrea Fill
49
0
0
11 Oct 2024
Distributed Inference on Mobile Edge and Cloud: An Early Exit based
  Clustering Approach
Distributed Inference on Mobile Edge and Cloud: An Early Exit based Clustering Approach
Divya J. Bajpai
M. Hanawal
FedML
58
0
0
06 Oct 2024
ParallelSFL: A Novel Split Federated Learning Framework Tackling
  Heterogeneity Issues
ParallelSFL: A Novel Split Federated Learning Framework Tackling Heterogeneity Issues
Yunming Liao
Yang Xu
Hongli Xu
Zhiwei Yao
Liusheng Huang
C. Qiao
FedML
71
9
0
02 Oct 2024
Learning the Optimal Path and DNN Partition for Collaborative Edge
  Inference
Learning the Optimal Path and DNN Partition for Collaborative Edge Inference
Yin Huang
Letian Zhang
Jie Xu
69
1
0
02 Oct 2024
SHEATH: Defending Horizontal Collaboration for Distributed CNNs against
  Adversarial Noise
SHEATH: Defending Horizontal Collaboration for Distributed CNNs against Adversarial Noise
Muneeba Asif
Mohammad Kumail Kazmi
M. Rahman
S. R. Hasan
Soamar Homsi
AAML
55
0
0
25 Sep 2024
A QoE-Aware Split Inference Accelerating Algorithm for NOMA-based Edge
  Intelligence
A QoE-Aware Split Inference Accelerating Algorithm for NOMA-based Edge Intelligence
Xin Yuan
Ning Li
Quan Chen
Wenchao Xu
Zhaoxin Zhang
Song Guo
61
0
0
25 Sep 2024
Automated and Holistic Co-design of Neural Networks and ASICs for
  Enabling In-Pixel Intelligence
Automated and Holistic Co-design of Neural Networks and ASICs for Enabling In-Pixel Intelligence
Shubha R. Kharel
Prashansa Mukim
Piotr Maj
Grzegorz W. Deptuch
Shinjae Yoo
Yihui Ren
Soumyajit Mandal
68
0
0
18 Jul 2024
Latency optimized Deep Neural Networks (DNNs): An Artificial
  Intelligence approach at the Edge using Multiprocessor System on Chip (MPSoC)
Latency optimized Deep Neural Networks (DNNs): An Artificial Intelligence approach at the Edge using Multiprocessor System on Chip (MPSoC)
Seyed Nima Omidsajedi
Rekha Reddy
Jianming Yi
Jan Herbst
Christoph Lipps
Hans D. Schotten
43
0
0
16 Jul 2024
Adaptive Layer Splitting for Wireless LLM Inference in Edge Computing: A
  Model-Based Reinforcement Learning Approach
Adaptive Layer Splitting for Wireless LLM Inference in Edge Computing: A Model-Based Reinforcement Learning Approach
Yuxuan Chen
Rongpeng Li
Xiaoxue Yu
Zhifeng Zhao
Honggang Zhang
88
10
0
03 Jun 2024
Online Resource Allocation for Edge Intelligence with Colocated Model
  Retraining and Inference
Online Resource Allocation for Edge Intelligence with Colocated Model Retraining and Inference
Huaiguang Cai
Zhi Zhou
Qianyi Huang
71
4
0
25 May 2024
CEEBERT: Cross-Domain Inference in Early Exit BERT
CEEBERT: Cross-Domain Inference in Early Exit BERT
Divya J. Bajpai
M. Hanawal
LRM
79
5
0
23 May 2024
Edge Intelligence Optimization for Large Language Model Inference with
  Batching and Quantization
Edge Intelligence Optimization for Large Language Model Inference with Batching and Quantization
Xinyuan Zhang
Jiang Liu
Zehui Xiong
Yudong Huang
Gaochang Xie
Ran Zhang
52
5
0
12 May 2024
Embedded Distributed Inference of Deep Neural Networks: A Systematic
  Review
Embedded Distributed Inference of Deep Neural Networks: A Systematic Review
Federico Nicolás Peccia
Oliver Bringmann
90
0
0
06 May 2024
Collaborative Satellite Computing through Adaptive DNN Task Splitting
  and Offloading
Collaborative Satellite Computing through Adaptive DNN Task Splitting and Offloading
Shifeng Peng
Xuefeng Hou
Zhishu Shen
Qiushi Zheng
Jiong Jin
Atsushi Tagami
Jingling Yuan
21
2
0
06 May 2024
Socialized Learning: A Survey of the Paradigm Shift for Edge
  Intelligence in Networked Systems
Socialized Learning: A Survey of the Paradigm Shift for Edge Intelligence in Networked Systems
Xiaofei Wang
Yunfeng Zhao
Chao Qiu
Qinghua Hu
Victor C. M. Leung
89
7
0
20 Apr 2024
I/O in Machine Learning Applications on HPC Systems: A 360-degree Survey
I/O in Machine Learning Applications on HPC Systems: A 360-degree Survey
Noah Lewis
J. L. Bez
Suren Byna
109
0
0
16 Apr 2024
Collaborative Edge AI Inference over Cloud-RAN
Collaborative Edge AI Inference over Cloud-RAN
Pengfei Zhang
Dingzhu Wen
Guangxu Zhu
Qimei Chen
Kaifeng Han
Yuanming Shi
103
6
0
09 Apr 2024
A Converting Autoencoder Toward Low-latency and Energy-efficient DNN
  Inference at the Edge
A Converting Autoencoder Toward Low-latency and Energy-efficient DNN Inference at the Edge
Hasanul Mahmud
Peng Kang
Kevin Desai
P. Lama
Sushil Prasad
94
3
0
11 Mar 2024
HeteGen: Heterogeneous Parallel Inference for Large Language Models on
  Resource-Constrained Devices
HeteGen: Heterogeneous Parallel Inference for Large Language Models on Resource-Constrained Devices
Xuanlei Zhao
Bin Jia
Hao Zhou
Ziming Liu
Shenggan Cheng
Yang You
36
5
0
02 Mar 2024
Selective Task offloading for Maximum Inference Accuracy and Energy
  efficient Real-Time IoT Sensing Systems
Selective Task offloading for Maximum Inference Accuracy and Energy efficient Real-Time IoT Sensing Systems
Abdelkarim Ben Sada
Amar Khelloufi
Abdenacer Naouri
Huansheng Ning
Sahraoui Dhelim
38
1
0
24 Feb 2024
Attention-aware Semantic Communications for Collaborative Inference
Attention-aware Semantic Communications for Collaborative Inference
Jiwoong Im
Nayoung Kwon
Taewoo Park
Jiheon Woo
Jaeho Lee
Yongjune Kim
76
2
0
23 Feb 2024
Adaptive Inference: Theoretical Limits and Unexplored Opportunities
Adaptive Inference: Theoretical Limits and Unexplored Opportunities
S. Hor
Ying Qian
Mert Pilanci
Amin Arbabian
60
0
0
06 Feb 2024
SwapNet: Efficient Swapping for DNN Inference on Edge AI Devices Beyond
  the Memory Budget
SwapNet: Efficient Swapping for DNN Inference on Edge AI Devices Beyond the Memory Budget
Kun Wang
Jiani Cao
Zimu Zhou
Zhenjiang Li
57
7
0
30 Jan 2024
The Security and Privacy of Mobile Edge Computing: An Artificial
  Intelligence Perspective
The Security and Privacy of Mobile Edge Computing: An Artificial Intelligence Perspective
Cheng Wang
Zenghui Yuan
Pan Zhou
Zichuan Xu
Ruixuan Li
Dapeng Wu
45
24
0
03 Jan 2024
Energy-Efficient Power Control for Multiple-Task Split Inference in
  UAVs: A Tiny Learning-Based Approach
Energy-Efficient Power Control for Multiple-Task Split Inference in UAVs: A Tiny Learning-Based Approach
Chenxi Zhao
Min Sheng
Junyu Liu
Tianshu Chu
Jiandong Li
45
3
0
31 Dec 2023
Mobility and Cost Aware Inference Accelerating Algorithm for Edge
  Intelligence
Mobility and Cost Aware Inference Accelerating Algorithm for Edge Intelligence
Xin Yuan
Ning Li
Kang Wei
Wenchao Xu
Quan Chen
Hao Chen
Song Guo
57
1
0
27 Dec 2023
High Efficiency Inference Accelerating Algorithm for NOMA-based Mobile
  Edge Computing
High Efficiency Inference Accelerating Algorithm for NOMA-based Mobile Edge Computing
Xin Yuan
Ning Li
Tuo Zhang
Muqing Li
Yuwen Chen
José-Fernán Martínez Ortega
Song Guo
64
0
0
26 Dec 2023
123
Next