ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2004.03607
  4. Cited By
TuringAdvice: A Generative and Dynamic Evaluation of Language Use
v1v2 (latest)

TuringAdvice: A Generative and Dynamic Evaluation of Language Use

7 April 2020
Rowan Zellers
Ari Holtzman
Elizabeth Clark
Lianhui Qin
Ali Farhadi
Yejin Choi
    ELMLRM
ArXiv (abs)PDFHTML

Papers citing "TuringAdvice: A Generative and Dynamic Evaluation of Language Use"

11 / 11 papers shown
Debate, Deliberate, Decide (D3): A Cost-Aware Adversarial Framework for Reliable and Interpretable LLM Evaluation
Debate, Deliberate, Decide (D3): A Cost-Aware Adversarial Framework for Reliable and Interpretable LLM Evaluation
Chaithanya Bandi
Abir Harrasse
Hari Bandi
LLMAGELM
408
11
0
07 Oct 2024
Towards Human-Centred Explainability Benchmarks For Text Classification
Towards Human-Centred Explainability Benchmarks For Text Classification
Viktor Schlegel
Erick Mendez Guzman
Riza Batista-Navarro
287
5
0
10 Nov 2022
AI and the Everything in the Whole Wide World Benchmark
AI and the Everything in the Whole Wide World Benchmark
Inioluwa Deborah Raji
Emily M. Bender
Amandalynne Paullada
Emily L. Denton
A. Hanna
289
426
0
26 Nov 2021
TellMeWhy: A Dataset for Answering Why-Questions in Narratives
TellMeWhy: A Dataset for Answering Why-Questions in NarrativesFindings (Findings), 2021
Yash Kumar Lal
Nathanael Chambers
Raymond J. Mooney
Niranjan Balasubramanian
362
56
0
11 Jun 2021
What Will it Take to Fix Benchmarking in Natural Language Understanding?
What Will it Take to Fix Benchmarking in Natural Language Understanding?North American Chapter of the Association for Computational Linguistics (NAACL), 2021
Samuel R. Bowman
George E. Dahl
ELMALM
328
203
0
05 Apr 2021
Help! Need Advice on Identifying Advice
Help! Need Advice on Identifying AdviceConference on Empirical Methods in Natural Language Processing (EMNLP), 2020
Venkata S Govindarajan
Benjamin Chen
Rebecca Warholic
K. Erk
Junyi Jessy Li
162
22
0
06 Oct 2020
Measuring Massive Multitask Language Understanding
Measuring Massive Multitask Language UnderstandingInternational Conference on Learning Representations (ICLR), 2020
Dan Hendrycks
Collin Burns
Steven Basart
Andy Zou
Mantas Mazeika
Basel Alomair
Jacob Steinhardt
ELMRALM
3.9K
7,430
0
07 Sep 2020
Forecasting AI Progress: A Research Agenda
Forecasting AI Progress: A Research Agenda
Ross Gruetzemacher
Florian E. Dorner
Niko Bernaola-Alvarez
Charlie Giattino
D. Manheim
AI4TS
202
39
0
04 Aug 2020
Evaluation of Text Generation: A Survey
Evaluation of Text Generation: A Survey
Asli Celikyilmaz
Elizabeth Clark
Jianfeng Gao
ELMLM&MA
404
440
0
26 Jun 2020
Experience Grounds Language
Experience Grounds Language
Yonatan Bisk
Ari Holtzman
Jesse Thomason
Jacob Andreas
Yoshua Bengio
...
Angeliki Lazaridou
Jonathan May
Aleksandr Nisnevich
Nicolas Pinto
Joseph P. Turian
654
420
0
21 Apr 2020
Machine learning as a model for cultural learning: Teaching an algorithm
  what it means to be fat
Machine learning as a model for cultural learning: Teaching an algorithm what it means to be fatSociological Methods & Research (SMR), 2020
Alina Arseniev-Koehler
J. Foster
325
57
0
24 Mar 2020
1
Page 1 of 1