Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.11747
Cited By
BabyBear: Cheap inference triage for expensive language models
24 May 2022
Leila Khalili
Yao You
John Bohannon
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"BabyBear: Cheap inference triage for expensive language models"
8 / 8 papers shown
Title
A Unified Approach to Routing and Cascading for LLMs
Jasper Dekoninck
Maximilian Baader
Martin Vechev
117
2
0
14 Oct 2024
Scaling Laws for Neural Machine Translation
Behrooz Ghorbani
Orhan Firat
Markus Freitag
Ankur Bapna
M. Krikun
Xavier Garcia
Ciprian Chelba
Colin Cherry
77
102
0
16 Sep 2021
CascadeBERT: Accelerating Inference of Pre-trained Language Models via Calibrated Complete Models Cascade
Lei Li
Yankai Lin
Deli Chen
Shuhuai Ren
Peng Li
Jie Zhou
Xu Sun
83
52
0
29 Dec 2020
Wisdom of Committees: An Overlooked Approach To Faster and More Accurate Models
Xiaofang Wang
Dan Kondratyuk
Eric Christiansen
Kris Kitani
Y. Alon
Elad Eban
54
49
0
03 Dec 2020
Turkish Text Classification: From Lexicon Analysis to Bidirectional Transformer
Deniz Kavi
25
1
0
21 Aug 2020
SemEval-2017 Task 4: Sentiment Analysis in Twitter
Sara Rosenthal
N. Farra
Preslav Nakov
VLM
92
798
0
02 Dec 2019
DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter
Victor Sanh
Lysandre Debut
Julien Chaumond
Thomas Wolf
234
7,520
0
02 Oct 2019
SpanBERT: Improving Pre-training by Representing and Predicting Spans
Mandar Joshi
Danqi Chen
Yinhan Liu
Daniel S. Weld
Luke Zettlemoyer
Omer Levy
147
1,967
0
24 Jul 2019
1