ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2406.08723
  4. Cited By
ECBD: Evidence-Centered Benchmark Design for NLP

ECBD: Evidence-Centered Benchmark Design for NLP

13 June 2024
Yu Lu Liu
Su Lin Blodgett
Jackie Chi Kit Cheung
Q. Vera Liao
Alexandra Olteanu
Ziang Xiao
ArXivPDFHTML

Papers citing "ECBD: Evidence-Centered Benchmark Design for NLP"

5 / 5 papers shown
Title
TRAIL: Trace Reasoning and Agentic Issue Localization
TRAIL: Trace Reasoning and Agentic Issue Localization
Darshan Deshpande
Varun Gangal
Hersh Mehta
Jitin Krishnan
Anand Kannappan
Rebecca Qian
30
0
0
13 May 2025
TALES: Text Adventure Learning Environment Suite
TALES: Text Adventure Learning Environment Suite
Christopher Zhang Cui
Xingdi Yuan
Ziang Xiao
Prithviraj Ammanabrolu
Marc-Alexandre Côté
LLMAG
LRM
52
1
0
19 Apr 2025
Challenges in Measuring Bias via Open-Ended Language Generation
Challenges in Measuring Bias via Open-Ended Language Generation
Afra Feyza Akyürek
Muhammed Yusuf Kocyigit
Sejin Paik
Derry Wijaya
43
22
0
23 May 2022
Just What do You Think You're Doing, Dave?' A Checklist for Responsible
  Data Use in NLP
Just What do You Think You're Doing, Dave?' A Checklist for Responsible Data Use in NLP
Anna Rogers
Timothy Baldwin
Kobi Leins
104
64
0
14 Sep 2021
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language
  Understanding
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding
Alex Jinpeng Wang
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
299
6,996
0
20 Apr 2018
1