Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2407.01992
Cited By
Is Your Large Language Model Knowledgeable or a Choices-Only Cheater?
2 July 2024
Nishant Balepur
Rachel Rudinger
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Is Your Large Language Model Knowledgeable or a Choices-Only Cheater?"
8 / 8 papers shown
Title
What the HellaSwag? On the Validity of Common-Sense Reasoning Benchmarks
Pavel Chizhov
Mattia Nee
Pierre-Carl Langlais
Ivan P. Yamshchikov
ReLM
ELM
LRM
39
1
0
10 Apr 2025
It is Too Many Options: Pitfalls of Multiple-Choice Questions in Generative AI and Medical Education
Shrutika Singh
Anton Alyakin
Daniel Alber
Jaden Stryker
Ai Phuong S Tong
...
Mathew de la Paz
Miguel Hernandez-Rovira
Ki Yun Park
Eric Leuthardt
E. Oermann
AI4MH
AI4Ed
ELM
64
1
0
13 Mar 2025
AtmosSci-Bench: Evaluating the Recent Advance of Large Language Model for Atmospheric Science
Chenyue Li
Wen Deng
Mengqian Lu
Binhang Yuan
ELM
AI4Cl
LRM
90
0
0
03 Feb 2025
TQA-Bench: Evaluating LLMs for Multi-Table Question Answering with Scalable Context and Symbolic Extension
Zipeng Qiu
You Peng
Guangxin He
Binhang Yuan
Chen Wang
LMTD
106
2
0
29 Nov 2024
Improving Model Evaluation using SMART Filtering of Benchmark Datasets
Vipul Gupta
Candace Ross
David Pantoja
R. Passonneau
Megan Ung
Adina Williams
76
1
0
26 Oct 2024
Plausibly Problematic Questions in Multiple-Choice Benchmarks for Commonsense Reasoning
Shramay Palta
Nishant Balepur
Peter Rankel
Sarah Wiegreffe
Marine Carpuat
Rachel Rudinger
ELM
31
4
0
06 Oct 2024
Leveraging Large Language Models for Multiple Choice Question Answering
Joshua Robinson
Christopher Rytting
David Wingate
ELM
143
186
0
22 Oct 2022
Better Distractions: Transformer-based Distractor Generation and Multiple Choice Question Filtering
J. Offerijns
Suzan Verberne
Tessa Verhoef
18
26
0
19 Oct 2020
1