Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2412.12687
Cited By
Uncertainty-Aware Hybrid Inference with On-Device Small and Remote Large Language Models
17 December 2024
Seungeun Oh
Jinhyuk Kim
Jihong Park
Seung-Woo Ko
Tony Q. S. Quek
Seong-Lyun Kim
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Uncertainty-Aware Hybrid Inference with On-Device Small and Remote Large Language Models"
3 / 3 papers shown
Title
Communication-Efficient Hybrid Language Model via Uncertainty-Aware Opportunistic and Compressed Transmission
Seungeun Oh
Jinhyuk Kim
Jihong Park
Seung-Woo Ko
Jinho Choi
Tony Q. S. Quek
Seong-Lyun Kim
27
0
0
17 May 2025
The Larger the Merrier? Efficient Large AI Model Inference in Wireless Edge Networks
Zhonghao Lyu
Ming Xiao
Jie Xu
Mikael Skoglund
Marco Di Renzo
36
0
0
14 May 2025
A Novel Hat-Shaped Device-Cloud Collaborative Inference Framework for Large Language Models
Zuan Xie
Yang Xu
Hongli Xu
Yunming Liao
Zhiwei Yao
73
0
0
23 Mar 2025
1