Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.02764
Cited By
When Does Confidence-Based Cascade Deferral Suffice?
6 July 2023
Wittawat Jitkrittum
Neha Gupta
A. Menon
Harikrishna Narasimhan
A. S. Rawat
Surinder Kumar
Re-assign community
ArXiv
PDF
HTML
Papers citing
"When Does Confidence-Based Cascade Deferral Suffice?"
8 / 8 papers shown
Title
Bi-directional Model Cascading with Proxy Confidence
David Warren
Mark Dras
44
0
0
27 Apr 2025
Harnessing Multiple Large Language Models: A Survey on LLM Ensemble
Zhijun Chen
Jingzheng Li
Pengpeng Chen
Zhuoran Li
Kai Sun
Yuankai Luo
Qianren Mao
Dingqi Yang
Hailong Sun
Philip S. Yu
ELM
52
4
0
25 Feb 2025
A Unified Approach to Routing and Cascading for LLMs
Jasper Dekoninck
Maximilian Baader
Martin Vechev
60
2
0
17 Feb 2025
Cost-Saving LLM Cascades with Early Abstention
Michael J. Zellinger
Rex Liu
Matt Thomson
111
0
0
13 Feb 2025
Predicting Probabilities of Error to Combine Quantization and Early Exiting: QuEE
Florence Regol
Joud Chataoui
Bertrand Charpentier
Mark J. Coates
Pablo Piantanida
Stephan Gunnemann
42
0
0
20 Jun 2024
Rejection via Learning Density Ratios
Alexander Soen
Hisham Husain
Philip Schulz
Vu-Linh Nguyen
47
2
0
29 May 2024
Language Model Cascades: Token-level uncertainty and beyond
Neha Gupta
Harikrishna Narasimhan
Wittawat Jitkrittum
A. S. Rawat
A. Menon
Sanjiv Kumar
UQLM
50
42
0
15 Apr 2024
BabyBear: Cheap inference triage for expensive language models
Leila Khalili
Yao You
John Bohannon
28
9
0
24 May 2022
1