Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.02324
Cited By
Annotation Artifacts in Natural Language Inference Data
6 March 2018
Suchin Gururangan
Swabha Swayamdipta
Omer Levy
Roy Schwartz
Samuel R. Bowman
Noah A. Smith
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Annotation Artifacts in Natural Language Inference Data"
50 / 783 papers shown
Title
Shortcut Learning of Large Language Models in Natural Language Understanding
Mengnan Du
Fengxiang He
Na Zou
Dacheng Tao
Xia Hu
KELM
OffRL
47
84
0
25 Aug 2022
ShortcutLens: A Visual Analytics Approach for Exploring Shortcuts in Natural Language Understanding Dataset
Zhihua Jin
Xingbo Wang
Furui Cheng
Chunhui Sun
Qun Liu
Huamin Qu
34
9
0
17 Aug 2022
Counterfactual Supervision-based Information Bottleneck for Out-of-Distribution Generalization
Bin Deng
Kui Jia
OOD
30
1
0
16 Aug 2022
MENLI: Robust Evaluation Metrics from Natural Language Inference
Yanran Chen
Steffen Eger
37
16
0
15 Aug 2022
Measuring Causal Effects of Data Statistics on Language Model's `Factual' Predictions
Yanai Elazar
Nora Kassner
Shauli Ravfogel
Amir Feder
Abhilasha Ravichander
Marius Mosbach
Yonatan Belinkov
Hinrich Schütze
Yoav Goldberg
CML
SyDa
MILM
38
55
0
28 Jul 2022
DataPerf: Benchmarks for Data-Centric AI Development
Mark Mazumder
Colby R. Banbury
Xiaozhe Yao
Bojan Karlavs
W. G. Rojas
...
Carole-Jean Wu
Cody Coleman
Andrew Y. Ng
Peter Mattson
Vijay Janapa Reddi
VLM
48
101
0
20 Jul 2022
MRCLens: an MRC Dataset Bias Detection Toolkit
Yifan Zhong
Haohan Wang
Eric Xing
29
0
0
18 Jul 2022
Breaking Correlation Shift via Conditional Invariant Regularizer
Mingyang Yi
Ruoyu Wang
Jiacheng Sun
Zhenguo Li
Zhi-Ming Ma
OODD
25
5
0
14 Jul 2022
Probing Classifiers are Unreliable for Concept Removal and Detection
Abhinav Kumar
Chenhao Tan
Amit Sharma
AAML
36
21
0
08 Jul 2022
When Does Group Invariant Learning Survive Spurious Correlations?
Yimeng Chen
Ruibin Xiong
Zhiming Ma
Yanyan Lan
OOD
CML
43
21
0
29 Jun 2022
Unveiling Transformers with LEGO: a synthetic reasoning task
Yi Zhang
A. Backurs
Sébastien Bubeck
Ronen Eldan
Suriya Gunasekar
Tal Wagner
LRM
41
86
0
09 Jun 2022
Revealing Single Frame Bias for Video-and-Language Learning
Jie Lei
Tamara L. Berg
Joey Tianyi Zhou
24
111
0
07 Jun 2022
Does Your Model Classify Entities Reasonably? Diagnosing and Mitigating Spurious Correlations in Entity Typing
Nan Xu
Fei Wang
Bangzheng Li
Mingtao Dong
Muhao Chen
34
20
0
25 May 2022
RobustLR: Evaluating Robustness to Logical Perturbation in Deductive Reasoning
Soumya Sanyal
Zeyi Liao
Xiang Ren
ELM
ReLM
LRM
61
20
0
25 May 2022
Less Learn Shortcut: Analyzing and Mitigating Learning of Spurious Feature-Label Correlation
Yanrui Du
Jing Yang
Yan Chen
Jing Liu
Sendong Zhao
Qiaoqiao She
Huaqin Wu
Haifeng Wang
Bing Qin
44
9
0
25 May 2022
ER-Test: Evaluating Explanation Regularization Methods for Language Models
Brihi Joshi
Aaron Chan
Ziyi Liu
Shaoliang Nie
Maziar Sanjabi
Hamed Firooz
Xiang Ren
AAML
38
6
0
25 May 2022
Partial-input baselines show that NLI models can ignore context, but they don't
Neha Srikanth
Rachel Rudinger
33
4
0
24 May 2022
On the Paradox of Learning to Reason from Data
Honghua Zhang
Liunian Harold Li
Tao Meng
Kai-Wei Chang
Guy Van den Broeck
NAI
ReLM
OOD
LRM
140
105
0
23 May 2022
Logical Reasoning with Span-Level Predictions for Interpretable and Robust NLI Models
Joe Stacey
Pasquale Minervini
Haim Dubossarsky
Marek Rei
ReLM
LRM
40
14
0
23 May 2022
An Empirical Investigation of Commonsense Self-Supervision with Knowledge Graphs
Jiarui Zhang
Filip Ilievski
Kaixin Ma
Jonathan M Francis
A. Oltramari
SSL
26
5
0
21 May 2022
Improving Multi-Task Generalization via Regularizing Spurious Correlation
Ziniu Hu
Zhe Zhao
Xinyang Yi
Tiansheng Yao
Lichan Hong
Yizhou Sun
Ed H. Chi
OOD
LRM
98
29
0
19 May 2022
Automated Crossword Solving
Eric Wallace
Nicholas Tomlin
Albert Xu
Kevin Kaichuang Yang
Eshaan Pathak
Matthew Ginsberg
Dan Klein
50
12
0
19 May 2022
Are Prompt-based Models Clueless?
Pride Kavumba
Ryo Takahashi
Yusuke Oda
VLM
142
13
0
19 May 2022
Falsesum: Generating Document-level NLI Examples for Recognizing Factual Inconsistency in Summarization
Prasetya Ajie Utama
Joshua Bambrick
N. Moosavi
Iryna Gurevych
HILM
29
42
0
12 May 2022
e-CARE: a New Dataset for Exploring Explainable Causal Reasoning
Li Du
Xiao Ding
Kai Xiong
Ting Liu
Bing Qin
CML
28
62
0
12 May 2022
Clinical Prompt Learning with Frozen Language Models
Niall Taylor
Yi Zhang
Dan W Joyce
A. Nevado-Holgado
Andrey Kormilitzin
VLM
LM&MA
16
31
0
11 May 2022
Textual Entailment for Event Argument Extraction: Zero- and Few-Shot with Multi-Source Learning
Oscar Sainz
Itziar Gonzalez-Dios
Oier López de Lacalle
Bonan Min
Eneko Agirre
38
49
0
03 May 2022
Don't Blame the Annotator: Bias Already Starts in the Annotation Instructions
Mihir Parmar
Swaroop Mishra
Mor Geva
Chitta Baral
38
55
0
01 May 2022
Learning to Split for Automatic Bias Detection
Yujia Bao
Regina Barzilay
26
20
0
28 Apr 2022
On the Limitations of Dataset Balancing: The Lost Battle Against Spurious Correlations
Roy Schwartz
Gabriel Stanovsky
42
26
0
27 Apr 2022
Testing the Ability of Language Models to Interpret Figurative Language
Emmy Liu
Chenxuan Cui
Kenneth Zheng
Graham Neubig
ELM
LRM
25
65
0
26 Apr 2022
Event Detection Explorer: An Interactive Tool for Event Detection Exploration
Wenlong Zhang
Bhagyashree Ingale
Hamza Shabir
Tianyi Li
Tian Shi
Ping Wang
31
4
0
26 Apr 2022
Generalized Quantifiers as a Source of Error in Multilingual NLU Benchmarks
Ruixiang Cui
Daniel Hershcovich
Anders Søgaard
30
13
0
22 Apr 2022
Stretching Sentence-pair NLI Models to Reason over Long Documents and Clusters
Tal Schuster
Sihao Chen
S. Buthpitiya
Alex Fabrikant
Donald Metzler
31
41
0
15 Apr 2022
Towards Fine-grained Causal Reasoning and QA
Linyi Yang
Zhen Wang
Yuxiang Wu
Jie Yang
Yue Zhang
43
15
0
15 Apr 2022
Automatic Fake News Detection: Are current models "fact-checking" or "gut-checking"?
Ian Kelk
B. Basseri
Wee Yi Lee
Richard Qiu
Christy Tanner
34
5
0
14 Apr 2022
Fast Few-shot Debugging for NLU Test Suites
Christopher Malon
Kai Li
E. Kruus
30
4
0
13 Apr 2022
NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks
Swaroop Mishra
Arindam Mitra
Neeraj Varshney
Bhavdeep Singh Sachdeva
Peter Clark
Chitta Baral
Ashwin Kalyan
AIMat
ReLM
ELM
LRM
41
104
0
12 Apr 2022
KOBEST: Korean Balanced Evaluation of Significant Tasks
Dohyeong Kim
Myeongjun Jang
D. Kwon
Eric Davis
ALM
16
23
0
09 Apr 2022
Informativeness and Invariance: Two Perspectives on Spurious Correlations in Natural Language
Jacob Eisenstein
CML
40
25
0
09 Apr 2022
Invariance Learning based on Label Hierarchy
S. Toyota
Kenji Fukumizu
OOD
28
1
0
29 Mar 2022
Generating Data to Mitigate Spurious Correlations in Natural Language Inference Datasets
Yuxiang Wu
Matt Gardner
Pontus Stenetorp
Pradeep Dasigi
42
67
0
24 Mar 2022
A Rationale-Centric Framework for Human-in-the-loop Machine Learning
Jinghui Lu
Linyi Yang
Brian Mac Namee
Yue Zhang
29
39
0
24 Mar 2022
Multilingual CheckList: Generation and Evaluation
Karthikeyan K
Shaily Bhatt
Pankaj Singh
Somak Aditya
Sandipan Dandapat
Sunayana Sitaram
Monojit Choudhary
ELM
29
1
0
24 Mar 2022
Word Order Does Matter (And Shuffled Language Models Know It)
Vinit Ravishankar
Mostafa Abdou
Artur Kulmizev
Anders Søgaard
22
44
0
21 Mar 2022
Leveraging Expert Guided Adversarial Augmentation For Improving Generalization in Named Entity Recognition
Aaron Reich
Jiaao Chen
Aastha Agrawal
Yanzhe Zhang
Diyi Yang
AAML
30
5
0
21 Mar 2022
Differentiable Reasoning over Long Stories -- Assessing Systematic Generalisation in Neural Models
Wanshui Li
Pasquale Minervini
OOD
22
1
0
20 Mar 2022
How Many Data Samples is an Additional Instruction Worth?
Ravsehaj Singh Puri
Swaroop Mishra
Mihir Parmar
Chitta Baral
25
17
0
17 Mar 2022
An Analysis of Negation in Natural Language Understanding Corpora
Md Mosharaf Hossain
Dhivya Chinnappa
Eduardo Blanco
16
42
0
16 Mar 2022
SciNLI: A Corpus for Natural Language Inference on Scientific Text
Mobashir Sadat
Cornelia Caragea
AILaw
32
35
0
13 Mar 2022
Previous
1
2
3
...
6
7
8
...
14
15
16
Next