Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.02616
Cited By
Adaptive Layer Splitting for Wireless LLM Inference in Edge Computing: A Model-Based Reinforcement Learning Approach
3 June 2024
Yuxuan Chen
Rongpeng Li
Xiaoxue Yu
Zhifeng Zhao
Honggang Zhang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Adaptive Layer Splitting for Wireless LLM Inference in Edge Computing: A Model-Based Reinforcement Learning Approach"
4 / 4 papers shown
Title
The Larger the Merrier? Efficient Large AI Model Inference in Wireless Edge Networks
Zhonghao Lyu
Ming Xiao
Jie Xu
Mikael Skoglund
Marco Di Renzo
15
0
0
14 May 2025
Adaptive Orchestration for Inference of Large Foundation Models at the Edge
Fernando Koch
Aladin Djuhera
Alecio Binotto
29
0
0
19 Mar 2025
Malware Detection at the Edge with Lightweight LLMs: A Performance Evaluation
Christian Rondanini
B. Carminati
E. Ferrari
Antonio Gaudiano
Ashish Kundu
51
0
0
06 Mar 2025
The Landscape and Challenges of HPC Research and LLMs
Le Chen
Nesreen K. Ahmed
Akashnil Dutta
Arijit Bhattacharjee
Sixing Yu
...
Vy A. Vo
J. P. Muñoz
Ted Willke
Tim Mattson
Ali Jannesari
AI4CE
34
20
0
03 Feb 2024
1