Back to Square One: Artifact Detection, Training and Commonsense
Disentanglement in the Winograd Schema

Back to Square One: Artifact Detection, Training and Commonsense Disentanglement in the Winograd Schema

16 April 2021

Papers citing "Back to Square One: Artifact Detection, Training and Commonsense Disentanglement in the Winograd Schema"

17 / 17 papers shown

Title
MASS: Overcoming Language Bias in Image-Text Matching Jiwan Chung Seungwon Lim Sangkyu Lee Youngjae Yu VLM 32 0 0 20 Jan 2025
WinoPron: Revisiting English Winogender Schemas for Consistency, Coverage, and Grammatical Case Vagrant Gautam Julius Steuer Eileen Bingert Ray Johns Anne Lauscher Dietrich Klakow 48 3 0 09 Sep 2024
Picturing Ambiguity: A Visual Twist on the Winograd Schema Challenge Brendan Park Madeline Janecek Naser Ezzati-Jivan Yifeng Li Ali Emami 37 0 0 25 May 2024
CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models Sreyan Ghosh Ashish Seth Sonal Kumar Utkarsh Tyagi Chandra Kiran Reddy Evuru S. Ramaneswaran S. Sakshi Oriol Nieto R. Duraiswami Dinesh Manocha AuLLM VLM CoGe 35 21 0 12 Oct 2023
Causal interventions expose implicit situation models for commonsense language understanding Takateru Yamakoshi James L. McClelland A. Goldberg Robert D. Hawkins 22 6 0 06 Jun 2023
Event knowledge in large language models: the gap between the impossible and the unlikely Carina Kauf Anna A. Ivanova Giulia Rambelli Emmanuele Chersoni Jingyuan Selena She Zawad Chowdhury Evelina Fedorenko Alessandro Lenci 37 67 0 02 Dec 2022
Measuring Reliability of Large Language Models through Semantic Consistency Harsh Raj Domenic Rosati S. Majumdar HILM 22 30 0 10 Nov 2022
CIKQA: Learning Commonsense Inference with a Unified Knowledge-in-the-loop QA Paradigm Hongming Zhang Yintong Huo Yanai Elazar Yangqiu Song Yoav Goldberg Dan Roth LRM 30 3 0 12 Oct 2022
State-of-the-art generalisation research in NLP: A taxonomy and review Dieuwke Hupkes Mario Giulianelli Verna Dankers Mikel Artetxe Yanai Elazar ... Leila Khalatbari Maria Ryskina Rita Frieske Ryan Cotterell Zhijing Jin 114 93 0 06 Oct 2022
On the Limitations of Dataset Balancing: The Lost Battle Against Spurious Correlations Roy Schwartz Gabriel Stanovsky 29 25 0 27 Apr 2022
Testing the Ability of Language Models to Interpret Figurative Language Emmy Liu Chenxuan Cui Kenneth Zheng Graham Neubig ELM LRM 17 65 0 26 Apr 2022
A Theoretically Grounded Benchmark for Evaluating Machine Commonsense Henrique M. Dinis Santos Ke Shen Alice M. Mulvehill Yasaman Razeghi D. McGuinness Mayank Kejriwal ELM LRM 15 4 0 23 Mar 2022
Commonsense Knowledge in Word Associations and ConceptNet Chunhua Liu Trevor Cohn Lea Frermann 30 7 0 20 Sep 2021
Measuring and Improving Consistency in Pretrained Language Models Yanai Elazar Nora Kassner Shauli Ravfogel Abhilasha Ravichander Eduard H. Hovy Hinrich Schütze Yoav Goldberg HILM 263 346 0 01 Feb 2021
The Sensitivity of Language Models and Humans to Winograd Schema Perturbations Mostafa Abdou Vinit Ravishankar Maria Barrett Yonatan Belinkov Desmond Elliott Anders Søgaard ReLM LRM 62 34 0 04 May 2020
Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understanding Datasets Mor Geva Yoav Goldberg Jonathan Berant 242 320 0 21 Aug 2019
Hypothesis Only Baselines in Natural Language Inference Adam Poliak Jason Naradowsky Aparajita Haldar Rachel Rudinger Benjamin Van Durme 190 576 0 02 May 2018