Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1910.05316
Cited By
Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge Computing
4 October 2019
En Li
Liekang Zeng
Zhi Zhou
Xu Chen
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Edge AI: On-Demand Accelerating Deep Neural Network Inference via Edge Computing"
50 / 135 papers shown
Title
The Larger the Merrier? Efficient Large AI Model Inference in Wireless Edge Networks
Zhonghao Lyu
Ming Xiao
Jie Xu
Mikael Skoglund
Marco Di Renzo
28
0
0
14 May 2025
Federated Learning for Cyber Physical Systems: A Comprehensive Survey
Minh K. Quan
P. Pathirana
M. Wijayasundara
S. Setunge
Dinh C. Nguyen
Christopher G. Brinton
David J. Love
H. Vincent Poor
AI4CE
54
0
0
08 May 2025
Onboard Optimization and Learning: A Survey
Monirul Islam Pavel
Siyi Hu
Mahardhika Pratama
Ryszard Kowalczyk
26
0
0
07 May 2025
A Wireless Collaborated Inference Acceleration Framework for Plant Disease Recognition
Hele Zhu
Xinyi Huang
Haojia Gao
Mengfei Jiang
Haohua Que
Lei Mu
27
0
0
05 May 2025
Hyperflows: Pruning Reveals the Importance of Weights
Eugen Barbulescu
Antonio Alexoaie
31
0
0
06 Apr 2025
Robust DNN Partitioning and Resource Allocation Under Uncertain Inference Time
Zhaojun Nan
Yunchu Han
Sheng Zhou
Zhisheng Niu
46
0
0
27 Mar 2025
Adaptive Orchestration for Inference of Large Foundation Models at the Edge
Fernando Koch
Aladin Djuhera
Alecio Binotto
34
0
0
19 Mar 2025
Empowering Edge Intelligence: A Comprehensive Survey on On-Device AI Models
Xubin Wang
Zhiqing Tang
Jianxiong Guo
Tianhui Meng
Chenhao Wang
Tian-sheng Wang
Weijia Jia
60
1
0
08 Mar 2025
Dynamic Pricing for On-Demand DNN Inference in the Edge-AI Market
Songyuan Li
Jia Hu
Geyong Min
Haojun Huang
Jiwei Huang
63
0
0
06 Mar 2025
Aligning Task- and Reconstruction-Oriented Communications for Edge Intelligence
Yufeng Diao
Yichi Zhang
Changyang She
Philip Guodong Zhao
Emma Liying Li
67
0
0
24 Feb 2025
Privacy-Aware Joint DNN Model Deployment and Partition Optimization for Delay-Efficient Collaborative Edge Inference
Zhipeng Cheng
Xiaoyu Xia
Hong Wang
Minghui Liwang
Ning Chen
Xuwei Fan
Xianbin Wang
54
0
0
22 Feb 2025
InTec: integrated things-edge computing: a framework for distributing machine learning pipelines in edge AI systems
Habib Larian
Faramarz Safi-Esfahani
49
0
0
17 Feb 2025
Janus: Collaborative Vision Transformer Under Dynamic Network Environment
Linyi Jiang
Silvery Fu
Yifei Zhu
Bo Li
ViT
203
0
0
14 Feb 2025
Vision-Language Models for Edge Networks: A Comprehensive Survey
Ahmed Sharshar
Latif U. Khan
Waseem Ullah
Mohsen Guizani
VLM
70
3
0
11 Feb 2025
BEEM: Boosting Performance of Early Exit DNNs using Multi-Exit Classifiers as Experts
Divya J. Bajpai
M. Hanawal
80
0
0
02 Feb 2025
DCentNet: Decentralized Multistage Biomedical Signal Classification using Early Exits
Xiaolin Li
Binhua Huang
B. Cardiff
Deepu John
46
0
0
31 Jan 2025
Optimizing Edge AI: A Comprehensive Survey on Data, Model, and System Strategies
Xubin Wang
Weijia Jia
36
0
0
08 Jan 2025
Edge Graph Intelligence: Reciprocally Empowering Edge Networks with Graph Intelligence
Liekang Zeng
Shengyuan Ye
Xu Chen
Xiaoxi Zhang
Ju Ren
Jian Tang
Yang Yang
Xuemin
Shen
60
2
0
08 Jan 2025
tuGEMM: Area-Power-Efficient Temporal Unary GEMM Architecture for Low-Precision Edge AI
Harideep Nair
P. Vellaisamy
Albert Chen
Joseph Finn
Anna Li
Manav Trivedi
J. Shen
27
2
0
23 Dec 2024
Data Generation for Hardware-Friendly Post-Training Quantization
Lior Dikstein
Ariel Lapid
Arnon Netzer
H. Habi
MQ
193
0
0
29 Oct 2024
Edge AI Collaborative Learning: Bayesian Approaches to Uncertainty Estimation
Gleb I. Radchenko
Victoria Andrea Fill
31
0
0
11 Oct 2024
Distributed Inference on Mobile Edge and Cloud: An Early Exit based Clustering Approach
Divya J. Bajpai
M. Hanawal
FedML
32
0
0
06 Oct 2024
ParallelSFL: A Novel Split Federated Learning Framework Tackling Heterogeneity Issues
Yunming Liao
Yang Xu
Hongli Xu
Zhiwei Yao
Liusheng Huang
C. Qiao
FedML
45
6
0
02 Oct 2024
Learning the Optimal Path and DNN Partition for Collaborative Edge Inference
Yin Huang
Letian Zhang
Jie Xu
25
1
0
02 Oct 2024
SHEATH: Defending Horizontal Collaboration for Distributed CNNs against Adversarial Noise
Muneeba Asif
Mohammad Kumail Kazmi
M. Rahman
S. R. Hasan
Soamar Homsi
AAML
28
0
0
25 Sep 2024
A QoE-Aware Split Inference Accelerating Algorithm for NOMA-based Edge Intelligence
Xin Yuan
Ning Li
Quan Chen
Wenchao Xu
Zhaoxin Zhang
Song Guo
38
0
0
25 Sep 2024
Automated and Holistic Co-design of Neural Networks and ASICs for Enabling In-Pixel Intelligence
Shubha R. Kharel
Prashansa Mukim
Piotr Maj
Grzegorz W. Deptuch
Shinjae Yoo
Yihui Ren
Soumyajit Mandal
44
0
0
18 Jul 2024
Latency optimized Deep Neural Networks (DNNs): An Artificial Intelligence approach at the Edge using Multiprocessor System on Chip (MPSoC)
Seyed Nima Omidsajedi
Rekha Reddy
Jianming Yi
Jan Herbst
Christoph Lipps
Hans D. Schotten
13
0
0
16 Jul 2024
Adaptive Layer Splitting for Wireless LLM Inference in Edge Computing: A Model-Based Reinforcement Learning Approach
Yuxuan Chen
Rongpeng Li
Xiaoxue Yu
Zhifeng Zhao
Honggang Zhang
42
9
0
03 Jun 2024
Online Resource Allocation for Edge Intelligence with Colocated Model Retraining and Inference
Huaiguang Cai
Zhi Zhou
Qianyi Huang
45
3
0
25 May 2024
CEEBERT: Cross-Domain Inference in Early Exit BERT
Divya J. Bajpai
M. Hanawal
LRM
52
4
0
23 May 2024
Edge Intelligence Optimization for Large Language Model Inference with Batching and Quantization
Xinyuan Zhang
Jiang Liu
Zehui Xiong
Yudong Huang
Gaochang Xie
Ran Zhang
23
5
0
12 May 2024
Embedded Distributed Inference of Deep Neural Networks: A Systematic Review
Federico Nicolás Peccia
Oliver Bringmann
36
0
0
06 May 2024
Collaborative Satellite Computing through Adaptive DNN Task Splitting and Offloading
Shifeng Peng
Xuefeng Hou
Zhishu Shen
Qiushi Zheng
Jiong Jin
Atsushi Tagami
Jingling Yuan
13
2
0
06 May 2024
Socialized Learning: A Survey of the Paradigm Shift for Edge Intelligence in Networked Systems
Xiaofei Wang
Yunfeng Zhao
Chao Qiu
Qinghua Hu
Victor C. M. Leung
35
6
0
20 Apr 2024
I/O in Machine Learning Applications on HPC Systems: A 360-degree Survey
Noah Lewis
J. L. Bez
Suren Byna
57
0
0
16 Apr 2024
Collaborative Edge AI Inference over Cloud-RAN
Pengfei Zhang
Dingzhu Wen
Guangxu Zhu
Qimei Chen
Kaifeng Han
Yuanming Shi
58
5
0
09 Apr 2024
A Converting Autoencoder Toward Low-latency and Energy-efficient DNN Inference at the Edge
Hasanul Mahmud
Peng Kang
Kevin Desai
P. Lama
Sushil Prasad
17
3
0
11 Mar 2024
HeteGen: Heterogeneous Parallel Inference for Large Language Models on Resource-Constrained Devices
Xuanlei Zhao
Bin Jia
Hao Zhou
Ziming Liu
Shenggan Cheng
Yang You
27
4
0
02 Mar 2024
Selective Task offloading for Maximum Inference Accuracy and Energy efficient Real-Time IoT Sensing Systems
Abdelkarim Ben Sada
Amar Khelloufi
Abdenacer Naouri
Huansheng Ning
Sahraoui Dhelim
30
1
0
24 Feb 2024
Attention-aware Semantic Communications for Collaborative Inference
Jiwoong Im
Nayoung Kwon
Taewoo Park
Jiheon Woo
Jaeho Lee
Yongjune Kim
46
2
0
23 Feb 2024
Adaptive Inference: Theoretical Limits and Unexplored Opportunities
S. Hor
Ying Qian
Mert Pilanci
Amin Arbabian
23
0
0
06 Feb 2024
SwapNet: Efficient Swapping for DNN Inference on Edge AI Devices Beyond the Memory Budget
Kun Wang
Jiani Cao
Zimu Zhou
Zhenjiang Li
27
5
0
30 Jan 2024
The Security and Privacy of Mobile Edge Computing: An Artificial Intelligence Perspective
Cheng Wang
Zenghui Yuan
Pan Zhou
Zichuan Xu
Ruixuan Li
Dapeng Wu
19
23
0
03 Jan 2024
Energy-Efficient Power Control for Multiple-Task Split Inference in UAVs: A Tiny Learning-Based Approach
Chenxi Zhao
Min Sheng
Junyu Liu
Tianshu Chu
Jiandong Li
20
2
0
31 Dec 2023
Mobility and Cost Aware Inference Accelerating Algorithm for Edge Intelligence
Xin Yuan
Ning Li
Kang Wei
Wenchao Xu
Quan Chen
Hao Chen
Song Guo
31
0
0
27 Dec 2023
High Efficiency Inference Accelerating Algorithm for NOMA-based Mobile Edge Computing
Xin Yuan
Ning Li
Tuo Zhang
Muqing Li
Yuwen Chen
José-Fernán Martínez Ortega
Song Guo
33
0
0
26 Dec 2023
Graft: Efficient Inference Serving for Hybrid Deep Learning with SLO Guarantees via DNN Re-alignment
Jing Wu
Lin Wang
Qirui Jin
Fangming Liu
33
11
0
17 Dec 2023
Towards A Flexible Accuracy-Oriented Deep Learning Module Inference Latency Prediction Framework for Adaptive Optimization Algorithms
Jingran Shen
Nikos Tziritas
Georgios Theodoropoulos
18
0
0
11 Dec 2023
Green Edge AI: A Contemporary Survey
Yuyi Mao
X. Yu
Kaibin Huang
Ying-Jun Angela Zhang
Jun Zhang
41
17
0
01 Dec 2023
1
2
3
Next