ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.02324
  4. Cited By
Annotation Artifacts in Natural Language Inference Data

Annotation Artifacts in Natural Language Inference Data

6 March 2018
Suchin Gururangan
Swabha Swayamdipta
Omer Levy
Roy Schwartz
Samuel R. Bowman
Noah A. Smith
ArXivPDFHTML

Papers citing "Annotation Artifacts in Natural Language Inference Data"

50 / 783 papers shown
Title
Shortcut Learning of Large Language Models in Natural Language
  Understanding
Shortcut Learning of Large Language Models in Natural Language Understanding
Mengnan Du
Fengxiang He
Na Zou
Dacheng Tao
Xia Hu
KELM
OffRL
47
84
0
25 Aug 2022
ShortcutLens: A Visual Analytics Approach for Exploring Shortcuts in
  Natural Language Understanding Dataset
ShortcutLens: A Visual Analytics Approach for Exploring Shortcuts in Natural Language Understanding Dataset
Zhihua Jin
Xingbo Wang
Furui Cheng
Chunhui Sun
Qun Liu
Huamin Qu
34
9
0
17 Aug 2022
Counterfactual Supervision-based Information Bottleneck for
  Out-of-Distribution Generalization
Counterfactual Supervision-based Information Bottleneck for Out-of-Distribution Generalization
Bin Deng
Kui Jia
OOD
30
1
0
16 Aug 2022
MENLI: Robust Evaluation Metrics from Natural Language Inference
MENLI: Robust Evaluation Metrics from Natural Language Inference
Yanran Chen
Steffen Eger
37
16
0
15 Aug 2022
Measuring Causal Effects of Data Statistics on Language Model's
  `Factual' Predictions
Measuring Causal Effects of Data Statistics on Language Model's `Factual' Predictions
Yanai Elazar
Nora Kassner
Shauli Ravfogel
Amir Feder
Abhilasha Ravichander
Marius Mosbach
Yonatan Belinkov
Hinrich Schütze
Yoav Goldberg
CML
SyDa
MILM
38
55
0
28 Jul 2022
DataPerf: Benchmarks for Data-Centric AI Development
DataPerf: Benchmarks for Data-Centric AI Development
Mark Mazumder
Colby R. Banbury
Xiaozhe Yao
Bojan Karlavs
W. G. Rojas
...
Carole-Jean Wu
Cody Coleman
Andrew Y. Ng
Peter Mattson
Vijay Janapa Reddi
VLM
48
101
0
20 Jul 2022
MRCLens: an MRC Dataset Bias Detection Toolkit
MRCLens: an MRC Dataset Bias Detection Toolkit
Yifan Zhong
Haohan Wang
Eric Xing
29
0
0
18 Jul 2022
Breaking Correlation Shift via Conditional Invariant Regularizer
Breaking Correlation Shift via Conditional Invariant Regularizer
Mingyang Yi
Ruoyu Wang
Jiacheng Sun
Zhenguo Li
Zhi-Ming Ma
OODD
25
5
0
14 Jul 2022
Probing Classifiers are Unreliable for Concept Removal and Detection
Probing Classifiers are Unreliable for Concept Removal and Detection
Abhinav Kumar
Chenhao Tan
Amit Sharma
AAML
36
21
0
08 Jul 2022
When Does Group Invariant Learning Survive Spurious Correlations?
When Does Group Invariant Learning Survive Spurious Correlations?
Yimeng Chen
Ruibin Xiong
Zhiming Ma
Yanyan Lan
OOD
CML
43
21
0
29 Jun 2022
Unveiling Transformers with LEGO: a synthetic reasoning task
Unveiling Transformers with LEGO: a synthetic reasoning task
Yi Zhang
A. Backurs
Sébastien Bubeck
Ronen Eldan
Suriya Gunasekar
Tal Wagner
LRM
41
86
0
09 Jun 2022
Revealing Single Frame Bias for Video-and-Language Learning
Revealing Single Frame Bias for Video-and-Language Learning
Jie Lei
Tamara L. Berg
Joey Tianyi Zhou
24
111
0
07 Jun 2022
Does Your Model Classify Entities Reasonably? Diagnosing and Mitigating
  Spurious Correlations in Entity Typing
Does Your Model Classify Entities Reasonably? Diagnosing and Mitigating Spurious Correlations in Entity Typing
Nan Xu
Fei Wang
Bangzheng Li
Mingtao Dong
Muhao Chen
34
20
0
25 May 2022
RobustLR: Evaluating Robustness to Logical Perturbation in Deductive
  Reasoning
RobustLR: Evaluating Robustness to Logical Perturbation in Deductive Reasoning
Soumya Sanyal
Zeyi Liao
Xiang Ren
ELM
ReLM
LRM
61
20
0
25 May 2022
Less Learn Shortcut: Analyzing and Mitigating Learning of Spurious
  Feature-Label Correlation
Less Learn Shortcut: Analyzing and Mitigating Learning of Spurious Feature-Label Correlation
Yanrui Du
Jing Yang
Yan Chen
Jing Liu
Sendong Zhao
Qiaoqiao She
Huaqin Wu
Haifeng Wang
Bing Qin
44
9
0
25 May 2022
ER-Test: Evaluating Explanation Regularization Methods for Language
  Models
ER-Test: Evaluating Explanation Regularization Methods for Language Models
Brihi Joshi
Aaron Chan
Ziyi Liu
Shaoliang Nie
Maziar Sanjabi
Hamed Firooz
Xiang Ren
AAML
38
6
0
25 May 2022
Partial-input baselines show that NLI models can ignore context, but
  they don't
Partial-input baselines show that NLI models can ignore context, but they don't
Neha Srikanth
Rachel Rudinger
33
4
0
24 May 2022
On the Paradox of Learning to Reason from Data
On the Paradox of Learning to Reason from Data
Honghua Zhang
Liunian Harold Li
Tao Meng
Kai-Wei Chang
Guy Van den Broeck
NAI
ReLM
OOD
LRM
140
105
0
23 May 2022
Logical Reasoning with Span-Level Predictions for Interpretable and
  Robust NLI Models
Logical Reasoning with Span-Level Predictions for Interpretable and Robust NLI Models
Joe Stacey
Pasquale Minervini
Haim Dubossarsky
Marek Rei
ReLM
LRM
40
14
0
23 May 2022
An Empirical Investigation of Commonsense Self-Supervision with
  Knowledge Graphs
An Empirical Investigation of Commonsense Self-Supervision with Knowledge Graphs
Jiarui Zhang
Filip Ilievski
Kaixin Ma
Jonathan M Francis
A. Oltramari
SSL
26
5
0
21 May 2022
Improving Multi-Task Generalization via Regularizing Spurious
  Correlation
Improving Multi-Task Generalization via Regularizing Spurious Correlation
Ziniu Hu
Zhe Zhao
Xinyang Yi
Tiansheng Yao
Lichan Hong
Yizhou Sun
Ed H. Chi
OOD
LRM
98
29
0
19 May 2022
Automated Crossword Solving
Automated Crossword Solving
Eric Wallace
Nicholas Tomlin
Albert Xu
Kevin Kaichuang Yang
Eshaan Pathak
Matthew Ginsberg
Dan Klein
50
12
0
19 May 2022
Are Prompt-based Models Clueless?
Are Prompt-based Models Clueless?
Pride Kavumba
Ryo Takahashi
Yusuke Oda
VLM
142
13
0
19 May 2022
Falsesum: Generating Document-level NLI Examples for Recognizing Factual
  Inconsistency in Summarization
Falsesum: Generating Document-level NLI Examples for Recognizing Factual Inconsistency in Summarization
Prasetya Ajie Utama
Joshua Bambrick
N. Moosavi
Iryna Gurevych
HILM
29
42
0
12 May 2022
e-CARE: a New Dataset for Exploring Explainable Causal Reasoning
e-CARE: a New Dataset for Exploring Explainable Causal Reasoning
Li Du
Xiao Ding
Kai Xiong
Ting Liu
Bing Qin
CML
28
62
0
12 May 2022
Clinical Prompt Learning with Frozen Language Models
Clinical Prompt Learning with Frozen Language Models
Niall Taylor
Yi Zhang
Dan W Joyce
A. Nevado-Holgado
Andrey Kormilitzin
VLM
LM&MA
16
31
0
11 May 2022
Textual Entailment for Event Argument Extraction: Zero- and Few-Shot
  with Multi-Source Learning
Textual Entailment for Event Argument Extraction: Zero- and Few-Shot with Multi-Source Learning
Oscar Sainz
Itziar Gonzalez-Dios
Oier López de Lacalle
Bonan Min
Eneko Agirre
38
49
0
03 May 2022
Don't Blame the Annotator: Bias Already Starts in the Annotation
  Instructions
Don't Blame the Annotator: Bias Already Starts in the Annotation Instructions
Mihir Parmar
Swaroop Mishra
Mor Geva
Chitta Baral
38
55
0
01 May 2022
Learning to Split for Automatic Bias Detection
Learning to Split for Automatic Bias Detection
Yujia Bao
Regina Barzilay
26
20
0
28 Apr 2022
On the Limitations of Dataset Balancing: The Lost Battle Against
  Spurious Correlations
On the Limitations of Dataset Balancing: The Lost Battle Against Spurious Correlations
Roy Schwartz
Gabriel Stanovsky
42
26
0
27 Apr 2022
Testing the Ability of Language Models to Interpret Figurative Language
Testing the Ability of Language Models to Interpret Figurative Language
Emmy Liu
Chenxuan Cui
Kenneth Zheng
Graham Neubig
ELM
LRM
25
65
0
26 Apr 2022
Event Detection Explorer: An Interactive Tool for Event Detection
  Exploration
Event Detection Explorer: An Interactive Tool for Event Detection Exploration
Wenlong Zhang
Bhagyashree Ingale
Hamza Shabir
Tianyi Li
Tian Shi
Ping Wang
31
4
0
26 Apr 2022
Generalized Quantifiers as a Source of Error in Multilingual NLU
  Benchmarks
Generalized Quantifiers as a Source of Error in Multilingual NLU Benchmarks
Ruixiang Cui
Daniel Hershcovich
Anders Søgaard
30
13
0
22 Apr 2022
Stretching Sentence-pair NLI Models to Reason over Long Documents and
  Clusters
Stretching Sentence-pair NLI Models to Reason over Long Documents and Clusters
Tal Schuster
Sihao Chen
S. Buthpitiya
Alex Fabrikant
Donald Metzler
31
41
0
15 Apr 2022
Towards Fine-grained Causal Reasoning and QA
Towards Fine-grained Causal Reasoning and QA
Linyi Yang
Zhen Wang
Yuxiang Wu
Jie Yang
Yue Zhang
43
15
0
15 Apr 2022
Automatic Fake News Detection: Are current models "fact-checking" or
  "gut-checking"?
Automatic Fake News Detection: Are current models "fact-checking" or "gut-checking"?
Ian Kelk
B. Basseri
Wee Yi Lee
Richard Qiu
Christy Tanner
34
5
0
14 Apr 2022
Fast Few-shot Debugging for NLU Test Suites
Fast Few-shot Debugging for NLU Test Suites
Christopher Malon
Kai Li
E. Kruus
30
4
0
13 Apr 2022
NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning
  Tasks
NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks
Swaroop Mishra
Arindam Mitra
Neeraj Varshney
Bhavdeep Singh Sachdeva
Peter Clark
Chitta Baral
Ashwin Kalyan
AIMat
ReLM
ELM
LRM
41
104
0
12 Apr 2022
KOBEST: Korean Balanced Evaluation of Significant Tasks
KOBEST: Korean Balanced Evaluation of Significant Tasks
Dohyeong Kim
Myeongjun Jang
D. Kwon
Eric Davis
ALM
16
23
0
09 Apr 2022
Informativeness and Invariance: Two Perspectives on Spurious
  Correlations in Natural Language
Informativeness and Invariance: Two Perspectives on Spurious Correlations in Natural Language
Jacob Eisenstein
CML
40
25
0
09 Apr 2022
Invariance Learning based on Label Hierarchy
Invariance Learning based on Label Hierarchy
S. Toyota
Kenji Fukumizu
OOD
28
1
0
29 Mar 2022
Generating Data to Mitigate Spurious Correlations in Natural Language
  Inference Datasets
Generating Data to Mitigate Spurious Correlations in Natural Language Inference Datasets
Yuxiang Wu
Matt Gardner
Pontus Stenetorp
Pradeep Dasigi
42
67
0
24 Mar 2022
A Rationale-Centric Framework for Human-in-the-loop Machine Learning
A Rationale-Centric Framework for Human-in-the-loop Machine Learning
Jinghui Lu
Linyi Yang
Brian Mac Namee
Yue Zhang
29
39
0
24 Mar 2022
Multilingual CheckList: Generation and Evaluation
Multilingual CheckList: Generation and Evaluation
Karthikeyan K
Shaily Bhatt
Pankaj Singh
Somak Aditya
Sandipan Dandapat
Sunayana Sitaram
Monojit Choudhary
ELM
29
1
0
24 Mar 2022
Word Order Does Matter (And Shuffled Language Models Know It)
Word Order Does Matter (And Shuffled Language Models Know It)
Vinit Ravishankar
Mostafa Abdou
Artur Kulmizev
Anders Søgaard
22
44
0
21 Mar 2022
Leveraging Expert Guided Adversarial Augmentation For Improving
  Generalization in Named Entity Recognition
Leveraging Expert Guided Adversarial Augmentation For Improving Generalization in Named Entity Recognition
Aaron Reich
Jiaao Chen
Aastha Agrawal
Yanzhe Zhang
Diyi Yang
AAML
30
5
0
21 Mar 2022
Differentiable Reasoning over Long Stories -- Assessing Systematic
  Generalisation in Neural Models
Differentiable Reasoning over Long Stories -- Assessing Systematic Generalisation in Neural Models
Wanshui Li
Pasquale Minervini
OOD
22
1
0
20 Mar 2022
How Many Data Samples is an Additional Instruction Worth?
How Many Data Samples is an Additional Instruction Worth?
Ravsehaj Singh Puri
Swaroop Mishra
Mihir Parmar
Chitta Baral
25
17
0
17 Mar 2022
An Analysis of Negation in Natural Language Understanding Corpora
An Analysis of Negation in Natural Language Understanding Corpora
Md Mosharaf Hossain
Dhivya Chinnappa
Eduardo Blanco
16
42
0
16 Mar 2022
SciNLI: A Corpus for Natural Language Inference on Scientific Text
SciNLI: A Corpus for Natural Language Inference on Scientific Text
Mobashir Sadat
Cornelia Caragea
AILaw
32
35
0
13 Mar 2022
Previous
123...678...141516
Next