Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2406.15627
Cited By
v1
v2
v3 (latest)
Benchmarking Uncertainty Quantification Methods for Large Language Models with LM-Polygraph
21 June 2024
Roman Vashurin
Ekaterina Fadeeva
Artem Vazhentsev
Akim Tsvigun
Daniil Vasilev
Rui Xing
Abdelrahman Boda Sadallah
Lyudmila Rvanova
Sergey Petrakov
Alexander Panchenko
Timothy Baldwin
Timothy Baldwin
Maxim Panov
Artem Shelmanov
Artem Shelmanov
HILM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Benchmarking Uncertainty Quantification Methods for Large Language Models with LM-Polygraph"
10 / 60 papers shown
Title
The Right Tool for the Job: Matching Model and Instance Complexities
Roy Schwartz
Gabriel Stanovsky
Swabha Swayamdipta
Jesse Dodge
Noah A. Smith
107
169
0
16 Apr 2020
Pitfalls of In-Domain Uncertainty Estimation and Ensembling in Deep Learning
Arsenii Ashukha
Alexander Lyzhov
Dmitry Molchanov
Dmitry Vetrov
UQCV
FedML
84
319
0
15 Feb 2020
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
445
20,298
0
23 Oct 2019
Mitigating Uncertainty in Document Classification
Xuchao Zhang
Fanglan Chen
Chang-Tien Lu
Naren Ramakrishnan
55
43
0
17 Jul 2019
Don't Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization
Shashi Narayan
Shay B. Cohen
Mirella Lapata
AILaw
143
1,682
0
27 Aug 2018
CoQA: A Conversational Question Answering Challenge
Siva Reddy
Danqi Chen
Christopher D. Manning
RALM
HAI
111
1,205
0
21 Aug 2018
A Simple Unified Framework for Detecting Out-of-Distribution Samples and Adversarial Attacks
Kimin Lee
Kibok Lee
Honglak Lee
Jinwoo Shin
OODD
187
2,060
0
10 Jul 2018
Understanding Measures of Uncertainty for Adversarial Example Detection
Lewis Smith
Y. Gal
UQCV
91
366
0
22 Mar 2018
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
Mandar Joshi
Eunsol Choi
Daniel S. Weld
Luke Zettlemoyer
RALM
213
2,676
0
09 May 2017
Deep Bayesian Active Learning with Image Data
Y. Gal
Riashat Islam
Zoubin Ghahramani
BDL
UQCV
70
1,735
0
08 Mar 2017
Previous
1
2