Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2505.12566
Cited By
HybridServe: Efficient Serving of Large AI Models with Confidence-Based Cascade Routing
18 May 2025
Leyang Xue
Yao Fu
Luo Mai
Mahesh K. Marina
Re-assign community
ArXiv
PDF
HTML
Papers citing
"HybridServe: Efficient Serving of Large AI Models with Confidence-Based Cascade Routing"
Title
No papers