Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2502.08773
Cited By
Universal Model Routing for Efficient LLM Inference
12 February 2025
Wittawat Jitkrittum
Harikrishna Narasimhan
A. S. Rawat
Jeevesh Juneja
Zifeng Wang
Chen-Yu Lee
Pradeep Shenoy
Rina Panigrahy
A. Menon
Sanjiv Kumar
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Universal Model Routing for Efficient LLM Inference"
4 / 4 papers shown
Title
Route to Reason: Adaptive Routing for LLM and Reasoning Strategy Selection
Zhihong Pan
Kai Zhang
Yuze Zhao
Yupeng Han
LRM
55
0
0
26 May 2025
The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary Giants
Yiqun Zhang
Hao Li
Chenxu Wang
L. Chen
Qiaosheng Zhang
...
Xinrun Wang
Jia Xu
Lei Bai
Wanli Ouyang
Shuyue Hu
67
0
0
26 May 2025
Resource-efficient Inference with Foundation Model Programs
Lunyiu Nie
Zhimin Ding
Kevin Yu
Marco Cheung
C. Jermaine
S. Chaudhuri
57
0
0
09 Apr 2025
RouterEval: A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in LLMs
Zhongzhan Huang
Guoming Ling
Vincent S. Liang
Yupei Lin
Yandong Chen
Shanshan Zhong
Hefeng Wu
LRM
148
7
0
08 Mar 2025
1