ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1910.05316
  4. Cited By
Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge
  Computing

Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge Computing

4 October 2019
En Li
Liekang Zeng
Zhi Zhou
Xu Chen
ArXivPDFHTML

Papers citing "Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge Computing"

50 / 135 papers shown
Title
The Larger the Merrier? Efficient Large AI Model Inference in Wireless Edge Networks
The Larger the Merrier? Efficient Large AI Model Inference in Wireless Edge Networks
Zhonghao Lyu
Ming Xiao
Jie Xu
Mikael Skoglund
Marco Di Renzo
28
0
0
14 May 2025
Federated Learning for Cyber Physical Systems: A Comprehensive Survey
Federated Learning for Cyber Physical Systems: A Comprehensive Survey
Minh K. Quan
P. Pathirana
M. Wijayasundara
S. Setunge
Dinh C. Nguyen
Christopher G. Brinton
David J. Love
H. Vincent Poor
AI4CE
54
0
0
08 May 2025
Onboard Optimization and Learning: A Survey
Onboard Optimization and Learning: A Survey
Monirul Islam Pavel
Siyi Hu
Mahardhika Pratama
Ryszard Kowalczyk
26
0
0
07 May 2025
A Wireless Collaborated Inference Acceleration Framework for Plant Disease Recognition
A Wireless Collaborated Inference Acceleration Framework for Plant Disease Recognition
Hele Zhu
Xinyi Huang
Haojia Gao
Mengfei Jiang
Haohua Que
Lei Mu
27
0
0
05 May 2025
Hyperflows: Pruning Reveals the Importance of Weights
Hyperflows: Pruning Reveals the Importance of Weights
Eugen Barbulescu
Antonio Alexoaie
31
0
0
06 Apr 2025
Robust DNN Partitioning and Resource Allocation Under Uncertain Inference Time
Robust DNN Partitioning and Resource Allocation Under Uncertain Inference Time
Zhaojun Nan
Yunchu Han
Sheng Zhou
Zhisheng Niu
46
0
0
27 Mar 2025
Adaptive Orchestration for Inference of Large Foundation Models at the Edge
Adaptive Orchestration for Inference of Large Foundation Models at the Edge
Fernando Koch
Aladin Djuhera
Alecio Binotto
34
0
0
19 Mar 2025
Empowering Edge Intelligence: A Comprehensive Survey on On-Device AI Models
Empowering Edge Intelligence: A Comprehensive Survey on On-Device AI Models
Xubin Wang
Zhiqing Tang
Jianxiong Guo
Tianhui Meng
Chenhao Wang
Tian-sheng Wang
Weijia Jia
60
1
0
08 Mar 2025
Dynamic Pricing for On-Demand DNN Inference in the Edge-AI Market
Songyuan Li
Jia Hu
Geyong Min
Haojun Huang
Jiwei Huang
63
0
0
06 Mar 2025
Aligning Task- and Reconstruction-Oriented Communications for Edge Intelligence
Aligning Task- and Reconstruction-Oriented Communications for Edge Intelligence
Yufeng Diao
Yichi Zhang
Changyang She
Philip Guodong Zhao
Emma Liying Li
67
0
0
24 Feb 2025
Privacy-Aware Joint DNN Model Deployment and Partition Optimization for Delay-Efficient Collaborative Edge Inference
Privacy-Aware Joint DNN Model Deployment and Partition Optimization for Delay-Efficient Collaborative Edge Inference
Zhipeng Cheng
Xiaoyu Xia
Hong Wang
Minghui Liwang
Ning Chen
Xuwei Fan
Xianbin Wang
54
0
0
22 Feb 2025
InTec: integrated things-edge computing: a framework for distributing machine learning pipelines in edge AI systems
InTec: integrated things-edge computing: a framework for distributing machine learning pipelines in edge AI systems
Habib Larian
Faramarz Safi-Esfahani
49
0
0
17 Feb 2025
Janus: Collaborative Vision Transformer Under Dynamic Network Environment
Janus: Collaborative Vision Transformer Under Dynamic Network Environment
Linyi Jiang
Silvery Fu
Yifei Zhu
Bo Li
ViT
203
0
0
14 Feb 2025
Vision-Language Models for Edge Networks: A Comprehensive Survey
Vision-Language Models for Edge Networks: A Comprehensive Survey
Ahmed Sharshar
Latif U. Khan
Waseem Ullah
Mohsen Guizani
VLM
70
3
0
11 Feb 2025
BEEM: Boosting Performance of Early Exit DNNs using Multi-Exit Classifiers as Experts
BEEM: Boosting Performance of Early Exit DNNs using Multi-Exit Classifiers as Experts
Divya J. Bajpai
M. Hanawal
80
0
0
02 Feb 2025
DCentNet: Decentralized Multistage Biomedical Signal Classification using Early Exits
DCentNet: Decentralized Multistage Biomedical Signal Classification using Early Exits
Xiaolin Li
Binhua Huang
B. Cardiff
Deepu John
46
0
0
31 Jan 2025
Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies
Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies
Xubin Wang
Weijia Jia
36
0
0
08 Jan 2025
Edge Graph Intelligence: Reciprocally Empowering Edge Networks with Graph Intelligence
Edge Graph Intelligence: Reciprocally Empowering Edge Networks with Graph Intelligence
Liekang Zeng
Shengyuan Ye
Xu Chen
Xiaoxi Zhang
Ju Ren
Jian Tang
Yang Yang
Xuemin
Shen
60
2
0
08 Jan 2025
tuGEMM: Area-Power-Efficient Temporal Unary GEMM Architecture for
  Low-Precision Edge AI
tuGEMM: Area-Power-Efficient Temporal Unary GEMM Architecture for Low-Precision Edge AI
Harideep Nair
P. Vellaisamy
Albert Chen
Joseph Finn
Anna Li
Manav Trivedi
J. Shen
27
2
0
23 Dec 2024
Data Generation for Hardware-Friendly Post-Training Quantization
Data Generation for Hardware-Friendly Post-Training Quantization
Lior Dikstein
Ariel Lapid
Arnon Netzer
H. Habi
MQ
193
0
0
29 Oct 2024
Edge AI Collaborative Learning: Bayesian Approaches to Uncertainty
  Estimation
Edge AI Collaborative Learning: Bayesian Approaches to Uncertainty Estimation
Gleb I. Radchenko
Victoria Andrea Fill
31
0
0
11 Oct 2024
Distributed Inference on Mobile Edge and Cloud: An Early Exit based
  Clustering Approach
Distributed Inference on Mobile Edge and Cloud: An Early Exit based Clustering Approach
Divya J. Bajpai
M. Hanawal
FedML
32
0
0
06 Oct 2024
ParallelSFL: A Novel Split Federated Learning Framework Tackling
  Heterogeneity Issues
ParallelSFL: A Novel Split Federated Learning Framework Tackling Heterogeneity Issues
Yunming Liao
Yang Xu
Hongli Xu
Zhiwei Yao
Liusheng Huang
C. Qiao
FedML
45
6
0
02 Oct 2024
Learning the Optimal Path and DNN Partition for Collaborative Edge
  Inference
Learning the Optimal Path and DNN Partition for Collaborative Edge Inference
Yin Huang
Letian Zhang
Jie Xu
25
1
0
02 Oct 2024
SHEATH: Defending Horizontal Collaboration for Distributed CNNs against
  Adversarial Noise
SHEATH: Defending Horizontal Collaboration for Distributed CNNs against Adversarial Noise
Muneeba Asif
Mohammad Kumail Kazmi
M. Rahman
S. R. Hasan
Soamar Homsi
AAML
28
0
0
25 Sep 2024
A QoE-Aware Split Inference Accelerating Algorithm for NOMA-based Edge
  Intelligence
A QoE-Aware Split Inference Accelerating Algorithm for NOMA-based Edge Intelligence
Xin Yuan
Ning Li
Quan Chen
Wenchao Xu
Zhaoxin Zhang
Song Guo
38
0
0
25 Sep 2024
Automated and Holistic Co-design of Neural Networks and ASICs for
  Enabling In-Pixel Intelligence
Automated and Holistic Co-design of Neural Networks and ASICs for Enabling In-Pixel Intelligence
Shubha R. Kharel
Prashansa Mukim
Piotr Maj
Grzegorz W. Deptuch
Shinjae Yoo
Yihui Ren
Soumyajit Mandal
44
0
0
18 Jul 2024
Latency optimized Deep Neural Networks (DNNs): An Artificial
  Intelligence approach at the Edge using Multiprocessor System on Chip (MPSoC)
Latency optimized Deep Neural Networks (DNNs): An Artificial Intelligence approach at the Edge using Multiprocessor System on Chip (MPSoC)
Seyed Nima Omidsajedi
Rekha Reddy
Jianming Yi
Jan Herbst
Christoph Lipps
Hans D. Schotten
13
0
0
16 Jul 2024
Adaptive Layer Splitting for Wireless LLM Inference in Edge Computing: A
  Model-Based Reinforcement Learning Approach
Adaptive Layer Splitting for Wireless LLM Inference in Edge Computing: A Model-Based Reinforcement Learning Approach
Yuxuan Chen
Rongpeng Li
Xiaoxue Yu
Zhifeng Zhao
Honggang Zhang
42
9
0
03 Jun 2024
Online Resource Allocation for Edge Intelligence with Colocated Model
  Retraining and Inference
Online Resource Allocation for Edge Intelligence with Colocated Model Retraining and Inference
Huaiguang Cai
Zhi Zhou
Qianyi Huang
45
3
0
25 May 2024
CEEBERT: Cross-Domain Inference in Early Exit BERT
CEEBERT: Cross-Domain Inference in Early Exit BERT
Divya J. Bajpai
M. Hanawal
LRM
52
4
0
23 May 2024
Edge Intelligence Optimization for Large Language Model Inference with
  Batching and Quantization
Edge Intelligence Optimization for Large Language Model Inference with Batching and Quantization
Xinyuan Zhang
Jiang Liu
Zehui Xiong
Yudong Huang
Gaochang Xie
Ran Zhang
23
5
0
12 May 2024
Embedded Distributed Inference of Deep Neural Networks: A Systematic
  Review
Embedded Distributed Inference of Deep Neural Networks: A Systematic Review
Federico Nicolás Peccia
Oliver Bringmann
36
0
0
06 May 2024
Collaborative Satellite Computing through Adaptive DNN Task Splitting
  and Offloading
Collaborative Satellite Computing through Adaptive DNN Task Splitting and Offloading
Shifeng Peng
Xuefeng Hou
Zhishu Shen
Qiushi Zheng
Jiong Jin
Atsushi Tagami
Jingling Yuan
13
2
0
06 May 2024
Socialized Learning: A Survey of the Paradigm Shift for Edge
  Intelligence in Networked Systems
Socialized Learning: A Survey of the Paradigm Shift for Edge Intelligence in Networked Systems
Xiaofei Wang
Yunfeng Zhao
Chao Qiu
Qinghua Hu
Victor C. M. Leung
35
6
0
20 Apr 2024
I/O in Machine Learning Applications on HPC Systems: A 360-degree Survey
I/O in Machine Learning Applications on HPC Systems: A 360-degree Survey
Noah Lewis
J. L. Bez
Suren Byna
57
0
0
16 Apr 2024
Collaborative Edge AI Inference over Cloud-RAN
Collaborative Edge AI Inference over Cloud-RAN
Pengfei Zhang
Dingzhu Wen
Guangxu Zhu
Qimei Chen
Kaifeng Han
Yuanming Shi
58
5
0
09 Apr 2024
A Converting Autoencoder Toward Low-latency and Energy-efficient DNN
  Inference at the Edge
A Converting Autoencoder Toward Low-latency and Energy-efficient DNN Inference at the Edge
Hasanul Mahmud
Peng Kang
Kevin Desai
P. Lama
Sushil Prasad
17
3
0
11 Mar 2024
HeteGen: Heterogeneous Parallel Inference for Large Language Models on
  Resource-Constrained Devices
HeteGen: Heterogeneous Parallel Inference for Large Language Models on Resource-Constrained Devices
Xuanlei Zhao
Bin Jia
Hao Zhou
Ziming Liu
Shenggan Cheng
Yang You
27
4
0
02 Mar 2024
Selective Task offloading for Maximum Inference Accuracy and Energy
  efficient Real-Time IoT Sensing Systems
Selective Task offloading for Maximum Inference Accuracy and Energy efficient Real-Time IoT Sensing Systems
Abdelkarim Ben Sada
Amar Khelloufi
Abdenacer Naouri
Huansheng Ning
Sahraoui Dhelim
30
1
0
24 Feb 2024
Attention-aware Semantic Communications for Collaborative Inference
Attention-aware Semantic Communications for Collaborative Inference
Jiwoong Im
Nayoung Kwon
Taewoo Park
Jiheon Woo
Jaeho Lee
Yongjune Kim
46
2
0
23 Feb 2024
Adaptive Inference: Theoretical Limits and Unexplored Opportunities
Adaptive Inference: Theoretical Limits and Unexplored Opportunities
S. Hor
Ying Qian
Mert Pilanci
Amin Arbabian
23
0
0
06 Feb 2024
SwapNet: Efficient Swapping for DNN Inference on Edge AI Devices Beyond
  the Memory Budget
SwapNet: Efficient Swapping for DNN Inference on Edge AI Devices Beyond the Memory Budget
Kun Wang
Jiani Cao
Zimu Zhou
Zhenjiang Li
27
5
0
30 Jan 2024
The Security and Privacy of Mobile Edge Computing: An Artificial
  Intelligence Perspective
The Security and Privacy of Mobile Edge Computing: An Artificial Intelligence Perspective
Cheng Wang
Zenghui Yuan
Pan Zhou
Zichuan Xu
Ruixuan Li
Dapeng Wu
19
23
0
03 Jan 2024
Energy-Efficient Power Control for Multiple-Task Split Inference in
  UAVs: A Tiny Learning-Based Approach
Energy-Efficient Power Control for Multiple-Task Split Inference in UAVs: A Tiny Learning-Based Approach
Chenxi Zhao
Min Sheng
Junyu Liu
Tianshu Chu
Jiandong Li
20
2
0
31 Dec 2023
Mobility and Cost Aware Inference Accelerating Algorithm for Edge
  Intelligence
Mobility and Cost Aware Inference Accelerating Algorithm for Edge Intelligence
Xin Yuan
Ning Li
Kang Wei
Wenchao Xu
Quan Chen
Hao Chen
Song Guo
31
0
0
27 Dec 2023
High Efficiency Inference Accelerating Algorithm for NOMA-based Mobile
  Edge Computing
High Efficiency Inference Accelerating Algorithm for NOMA-based Mobile Edge Computing
Xin Yuan
Ning Li
Tuo Zhang
Muqing Li
Yuwen Chen
José-Fernán Martínez Ortega
Song Guo
33
0
0
26 Dec 2023
Graft: Efficient Inference Serving for Hybrid Deep Learning with SLO
  Guarantees via DNN Re-alignment
Graft: Efficient Inference Serving for Hybrid Deep Learning with SLO Guarantees via DNN Re-alignment
Jing Wu
Lin Wang
Qirui Jin
Fangming Liu
33
11
0
17 Dec 2023
Towards A Flexible Accuracy-Oriented Deep Learning Module Inference
  Latency Prediction Framework for Adaptive Optimization Algorithms
Towards A Flexible Accuracy-Oriented Deep Learning Module Inference Latency Prediction Framework for Adaptive Optimization Algorithms
Jingran Shen
Nikos Tziritas
Georgios Theodoropoulos
18
0
0
11 Dec 2023
Green Edge AI: A Contemporary Survey
Green Edge AI: A Contemporary Survey
Yuyi Mao
X. Yu
Kaibin Huang
Ying-Jun Angela Zhang
Jun Zhang
41
17
0
01 Dec 2023
123
Next