Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.21594
Cited By
Fast and Cost-effective Speculative Edge-Cloud Decoding with Early Exits
27 May 2025
Yeshwanth Venkatesha
Souvik Kundu
Priyadarshini Panda
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Fast and Cost-effective Speculative Edge-Cloud Decoding with Early Exits"
3 / 3 papers shown
Title
Small Language Models: Survey, Measurements, and Insights
Zhenyan Lu
Xiang Li
Dongqi Cai
Rongjie Yi
Fangming Liu
Xiwen Zhang
Nicholas D. Lane
Mengwei Xu
ObjD
LRM
152
58
0
24 Sep 2024
Mobile Edge Intelligence for Large Language Models: A Contemporary Survey
Guanqiao Qu
Qiyuan Chen
Wei Wei
Zheng Lin
Xianhao Chen
Kaibin Huang
152
56
0
09 Jul 2024
Decoding Speculative Decoding
Minghao Yan
Saurabh Agarwal
Shivaram Venkataraman
LRM
92
10
0
02 Feb 2024
1