Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2411.01438
Cited By
SkyServe: Serving AI Models across Regions and Clouds with Spot Instances
3 November 2024
Ziming Mao
Tian Xia
Zhanghao Wu
Wei-Lin Chiang
Tyler Griggs
Romil Bhardwaj
Zongheng Yang
S. Shenker
Ion Stoica
Re-assign community
ArXiv
PDF
HTML
Papers citing
"SkyServe: Serving AI Models across Regions and Clouds with Spot Instances"
2 / 2 papers shown
Title
MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems
Yinsicheng Jiang
Yao Fu
Yeqi Huang
Ping Nie
Zhan Lu
...
Dayou Du
Tairan Xu
Kai Zou
Edoardo Ponti
Luo Mai
MoE
17
0
0
16 May 2025
Prompt Inversion Attack against Collaborative Inference of Large Language Models
Wenjie Qu
Yuguang Zhou
Yongji Wu
Tingsong Xiao
Binhang Yuan
Heng Chang
Jiaheng Zhang
76
0
0
12 Mar 2025
1