Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2310.12963
Cited By
AutoMix: Automatically Mixing Language Models
19 October 2023
Pranjal Aggarwal
Aman Madaan
Ankit Anand
Srividya Pranavi Potharaju
Swaroop Mishra
Pei Zhou
Aditya Gupta
Dheeraj Rajagopal
Karthik Kappaganthu
Yiming Yang
Shyam Upadhyay
Manaal Faruqui
Mausam
Re-assign community
ArXiv
PDF
HTML
Papers citing
"AutoMix: Automatically Mixing Language Models"
29 / 29 papers shown
Title
Dynamically Learned Test-Time Model Routing in Language Model Zoos with Service Level Guarantees
Herbert Woisetschläger
Ryan Zhang
Shiqiang Wang
Hans-Arno Jacobsen
7
0
0
26 May 2025
Learnware of Language Models: Specialized Small Language Models Can Do Big
Zhi-Hao Tan
Zi-Chen Zhao
Hao-Yu Shi
Xin-Yu Zhang
Peng Tan
Yang Yu
Zhi Zhou
78
0
0
19 May 2025
UncertainSAM: Fast and Efficient Uncertainty Quantification of the Segment Anything Model
Timo Kaiser
Thomas Norrenbrock
Bodo Rosenhahn
67
0
0
08 May 2025
Improving the End-to-End Efficiency of Offline Inference for Multi-LLM Applications Based on Sampling and Simulation
Jingzhi Fang
Yanyan Shen
Yijiao Wang
Lei Chen
59
2
0
21 Mar 2025
Life-Cycle Routing Vulnerabilities of LLM Router
Qiqi Lin
Xiaoyang Ji
Shengfang Zhai
Qingni Shen
Zhi-Li Zhang
Yuejian Fang
Yansong Gao
AAML
70
1
0
09 Mar 2025
Harnessing Multiple Large Language Models: A Survey on LLM Ensemble
Zhijun Chen
Jingzheng Li
Pengpeng Chen
Zhuoran Li
Kai Sun
Yuankai Luo
Qianren Mao
Dingqi Yang
Hailong Sun
Philip S. Yu
ELM
91
10
0
25 Feb 2025
Cost-Saving LLM Cascades with Early Abstention
Michael J. Zellinger
Rex Liu
Matt Thomson
127
0
0
13 Feb 2025
PickLLM: Context-Aware RL-Assisted Large Language Model Routing
Dimitrios Sikeridis
Dennis Ramdass
Pranay Pareek
103
1
0
12 Dec 2024
A Unified Approach to Routing and Cascading for LLMs
Jasper Dekoninck
Maximilian Baader
Martin Vechev
91
2
0
14 Oct 2024
Efficiently Deploying LLMs with Controlled Risk
Michael J. Zellinger
Matt Thomson
74
1
0
03 Oct 2024
What is the Role of Small Models in the LLM Era: A Survey
Lihu Chen
Gaël Varoquaux
ALM
136
26
0
10 Sep 2024
Cost-Effective Online Multi-LLM Selection with Versatile Reward Models
Xiangxiang Dai
Jin Li
Xutong Liu
Anqi Yu
J. C. Lui
75
8
0
26 May 2024
Optimising Calls to Large Language Models with Uncertainty-Based Two-Tier Selection
Guillem Ramírez
Alexandra Birch
Ivan Titov
62
8
0
03 May 2024
RouterBench: A Benchmark for Multi-LLM Routing System
Qitian Jason Hu
Jacob Bieker
Xiuyu Li
Nan Jiang
Benjamin Keigwin
Gaurav Ranganath
Kurt Keutzer
Shriyash Kaustubh Upadhyay
79
42
0
18 Mar 2024
Fly-Swat or Cannon? Cost-Effective Language Model Choice via Meta-Modeling
Marija vSakota
Maxime Peyrard
Robert West
42
48
0
11 Aug 2023
Faith and Fate: Limits of Transformers on Compositionality
Nouha Dziri
Ximing Lu
Melanie Sclar
Xiang Lorraine Li
Liwei Jian
...
Sean Welleck
Xiang Ren
Allyson Ettinger
Zaïd Harchaoui
Yejin Choi
ReLM
LRM
63
358
0
29 May 2023
Large Language Models are Better Reasoners with Self-Verification
Yixuan Weng
Minjun Zhu
Fei Xia
Bin Li
Shizhu He
Shengping Liu
Bin Sun
Kang Liu
Jun Zhao
ReLM
LRM
43
204
0
19 Dec 2022
RARR: Researching and Revising What Language Models Say, Using Language Models
Luyu Gao
Zhuyun Dai
Panupong Pasupat
Anthony Chen
Arun Tejasvi Chaganty
...
Vincent Zhao
Ni Lao
Hongrae Lee
Da-Cheng Juan
Kelvin Guu
HILM
KELM
76
257
0
17 Oct 2022
Measuring and Narrowing the Compositionality Gap in Language Models
Ofir Press
Muru Zhang
Sewon Min
Ludwig Schmidt
Noah A. Smith
M. Lewis
ReLM
KELM
LRM
98
595
0
07 Oct 2022
DocPrompting: Generating Code by Retrieving the Docs
Shuyan Zhou
Uri Alon
Frank F. Xu
Zhiruo Wang
Zhengbao Jiang
Graham Neubig
LLMAG
53
135
0
13 Jul 2022
SCROLLS: Standardized CompaRison Over Long Language Sequences
Uri Shaham
Elad Segal
Maor Ivgi
Avia Efrat
Ori Yoran
...
Ankit Gupta
Wenhan Xiong
Mor Geva
Jonathan Berant
Omer Levy
RALM
60
136
0
10 Jan 2022
The Efficiency Misnomer
Daoyuan Chen
Liuyi Yao
Dawei Gao
Ashish Vaswani
Yaliang Li
74
101
0
25 Oct 2021
MuTual: A Dataset for Multi-Turn Dialogue Reasoning
Leyang Cui
Yu-Huan Wu
Shujie Liu
Yue Zhang
Ming Zhou
LRM
38
150
0
09 Apr 2020
FastBERT: a Self-distilling BERT with Adaptive Inference Time
Weijie Liu
Peng Zhou
Zhe Zhao
Zhiruo Wang
Haotang Deng
Qi Ju
73
356
0
05 Apr 2020
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
Victor Sanh
Lysandre Debut
Julien Chaumond
Thomas Wolf
117
7,386
0
02 Oct 2019
Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks
Nils Reimers
Iryna Gurevych
571
11,979
0
27 Aug 2019
CoQA: A Conversational Question Answering Challenge
Siva Reddy
Danqi Chen
Christopher D. Manning
RALM
HAI
80
1,188
0
21 Aug 2018
The NarrativeQA Reading Comprehension Challenge
Tomás Kociský
Jonathan Richard Schwarz
Phil Blunsom
Chris Dyer
Karl Moritz Hermann
Gábor Melis
Edward Grefenstette
88
759
0
19 Dec 2017
Learning Finite-State Controllers for Partially Observable Environments
Nicolas Meuleau
L. Peshkin
Kee-Eung Kim
L. Kaelbling
52
240
0
23 Jan 2013
1