ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2401.07955
  4. Cited By
A Study on Large Language Models' Limitations in Multiple-Choice
  Question Answering

A Study on Large Language Models' Limitations in Multiple-Choice Question Answering

15 January 2024
Aisha Khatun
Daniel G. Brown
    ELM
ArXivPDFHTML

Papers citing "A Study on Large Language Models' Limitations in Multiple-Choice Question Answering"

10 / 10 papers shown
Title
Shaping Shared Languages: Human and Large Language Models' Inductive Biases in Emergent Communication
Tom Kouwenhoven
Max Peeperkorn
R. D. Kleijn
Tessa Verhoef
69
0
0
06 Mar 2025
Exploring Language Model Generalization in Low-Resource Extractive QA
Exploring Language Model Generalization in Low-Resource Extractive QA
Saptarshi Sengupta
Wenpeng Yin
Preslav Nakov
Shreya Ghosh
Suhang Wang
27
0
0
27 Sep 2024
Kalahi: A handcrafted, grassroots cultural LLM evaluation suite for
  Filipino
Kalahi: A handcrafted, grassroots cultural LLM evaluation suite for Filipino
Jann Railey Montalan
Jian Gang Ngui
Wei Qi Leong
Yosephine Susanto
Hamsawardhini Rengarajan
William-Chandra Tjhi
Alham Fikri Aji
41
3
0
20 Sep 2024
Answer, Assemble, Ace: Understanding How LMs Answer Multiple Choice Questions
Answer, Assemble, Ace: Understanding How LMs Answer Multiple Choice Questions
Sarah Wiegreffe
Oyvind Tafjord
Yonatan Belinkov
Hanna Hajishirzi
Ashish Sabharwal
50
5
0
21 Jul 2024
OLMES: A Standard for Language Model Evaluations
OLMES: A Standard for Language Model Evaluations
Yuling Gu
Oyvind Tafjord
Bailey Kuehl
Dany Haddad
Jesse Dodge
Hannaneh Hajishirzi
ELM
40
14
0
12 Jun 2024
TruthEval: A Dataset to Evaluate LLM Truthfulness and Reliability
TruthEval: A Dataset to Evaluate LLM Truthfulness and Reliability
Aisha Khatun
Daniel G. Brown
HILM
23
2
0
04 Jun 2024
PertEval: Unveiling Real Knowledge Capacity of LLMs with
  Knowledge-Invariant Perturbations
PertEval: Unveiling Real Knowledge Capacity of LLMs with Knowledge-Invariant Perturbations
Jiatong Li
Renjun Hu
Kunzhe Huang
Zhuang Yan
Qi Liu
Mengxiao Zhu
Xing Shi
Wei Lin
KELM
54
5
0
30 May 2024
Is Temperature the Creativity Parameter of Large Language Models?
Is Temperature the Creativity Parameter of Large Language Models?
Max Peeperkorn
Tom Kouwenhoven
Daniel G. Brown
Anna K. Jordanous
37
45
0
01 May 2024
Pragmatic Competence Evaluation of Large Language Models for Korean
Pragmatic Competence Evaluation of Large Language Models for Korean
Dojun Park
Jiwoo Lee
Hyeyun Jeong
Seohyun Park
Sungeun Lee
ELM
41
2
0
19 Mar 2024
Leveraging Large Language Models for Multiple Choice Question Answering
Leveraging Large Language Models for Multiple Choice Question Answering
Joshua Robinson
Christopher Rytting
David Wingate
ELM
148
186
0
22 Oct 2022
1