LLMMaps -- A Visual Metaphor for Stratified Evaluation of Large Language
Models

LLMMaps -- A Visual Metaphor for Stratified Evaluation of Large Language Models

2 April 2023

Christian van Onzenoodt

Timo Ropinski

Papers citing "LLMMaps -- A Visual Metaphor for Stratified Evaluation of Large Language Models"

6 / 6 papers shown

Title
TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention Jinhao Duan Fei Kong Hao-Ran Cheng James Diffenderfer B. Kailkhura Lichao Sun Xiaofeng Zhu Xiaoshuang Shi Kaidi Xu 242 0 0 13 Mar 2025
Blue Noise Plots Christian van Onzenoodt Gurprit Singh Timo Ropinski Tobias Ritschel 16 4 0 08 Feb 2021
Understanding the Capabilities, Limitations, and Societal Impact of Large Language Models Alex Tamkin Miles Brundage Jack Clark Deep Ganguli AILaw ELM 200 261 0 04 Feb 2021
PubMedQA: A Dataset for Biomedical Research Question Answering Qiao Jin Bhuwan Dhingra Zhengping Liu William W. Cohen Xinghua Lu 243 831 0 13 Sep 2019
Language Models as Knowledge Bases? Fabio Petroni Tim Rocktaschel Patrick Lewis A. Bakhtin Yuxiang Wu Alexander H. Miller Sebastian Riedel KELM AI4MH 456 2,592 0 03 Sep 2019
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding Alex Jinpeng Wang Amanpreet Singh Julian Michael Felix Hill Omer Levy Samuel R. Bowman ELM 304 7,005 0 20 Apr 2018