Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1806.00692
Cited By
v1
v2
v3 (latest)
Stress Test Evaluation for Natural Language Inference
2 June 2018
Aakanksha Naik
Abhilasha Ravichander
Norman M. Sadeh
Carolyn Rose
Graham Neubig
ELM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Stress Test Evaluation for Natural Language Inference"
49 / 149 papers shown
Title
Underspecification Presents Challenges for Credibility in Modern Machine Learning
Alexander DÁmour
Katherine A. Heller
D. Moldovan
Ben Adlam
B. Alipanahi
...
Kellie Webster
Steve Yadlowsky
T. Yun
Xiaohua Zhai
D. Sculley
OffRL
183
688
0
06 Nov 2020
ANLIzing the Adversarial Natural Language Inference Dataset
Adina Williams
Tristan Thrush
Douwe Kiela
AAML
255
47
0
24 Oct 2020
Improving Robustness by Augmenting Training Sentences with Predicate-Argument Structures
N. Moosavi
M. Boer
Prasetya Ajie Utama
Iryna Gurevych
82
13
0
23 Oct 2020
The Extraordinary Failure of Complement Coercion Crowdsourcing
Yanai Elazar
Victoria Basmov
Shauli Ravfogel
Yoav Goldberg
Reut Tsarfaty
101
6
0
12 Oct 2020
OCNLI: Original Chinese Natural Language Inference
Hai Hu
Kyle Richardson
Liang Xu
Lu Li
Sandra Kübler
L. Moss
95
118
0
12 Oct 2020
On the Importance of Adaptive Data Collection for Extremely Imbalanced Pairwise Tasks
Stephen Mussmann
Robin Jia
Percy Liang
83
15
0
10 Oct 2020
Counterfactually-Augmented SNLI Training Data Does Not Yield Better Generalization Than Unaugmented Data
William Huang
Haokun Liu
Samuel R. Bowman
84
38
0
09 Oct 2020
An Empirical Study on Model-agnostic Debiasing Strategies for Robust Natural Language Inference
Tianyu Liu
Xin Zheng
Xiaoan Ding
Baobao Chang
Zhifang Sui
73
25
0
08 Oct 2020
CAT-Gen: Improving Robustness in NLP Models via Controlled Adversarial Text Generation
Tianlu Wang
Xuezhi Wang
Yao Qin
Ben Packer
Kang Li
Jilin Chen
Alex Beutel
Ed H. Chi
SILM
81
84
0
05 Oct 2020
TaxiNLI: Taking a Ride up the NLU Hill
Pratik M. Joshi
Somak Aditya
Aalok Sathe
Monojit Choudhury
66
36
0
30 Sep 2020
Towards Debiasing NLU Models from Unknown Biases
Prasetya Ajie Utama
N. Moosavi
Iryna Gurevych
119
155
0
25 Sep 2020
Selective Question Answering under Domain Shift
Amita Kamath
Robin Jia
Percy Liang
OOD
61
214
0
16 Jun 2020
(Re)construing Meaning in NLP
Sean Trott
Tiago Timponi Torrent
Nancy Chang
Nathan Schneider
AI4CE
48
30
0
18 May 2020
INFOTABS: Inference on Tables as Semi-structured Data
Vivek Gupta
Maitrey Mehta
Pegah Nokhiz
Vivek Srikumar
LMTD
73
112
0
13 May 2020
Towards Robustifying NLI Models Against Lexical Dataset Biases
Xiang Zhou
Joey Tianyi Zhou
64
58
0
10 May 2020
Beyond Accuracy: Behavioral Testing of NLP models with CheckList
Marco Tulio Ribeiro
Tongshuang Wu
Carlos Guestrin
Sameer Singh
ELM
219
1,112
0
08 May 2020
DQI: Measuring Data Quality in NLP
Swaroop Mishra
Anjana Arunkumar
Bhavdeep Singh Sachdeva
Chris Bryan
Chitta Baral
133
32
0
02 May 2020
Elastic weight consolidation for better bias inoculation
James Thorne
Andreas Vlachos
54
11
0
29 Apr 2020
The Curse of Performance Instability in Analysis Datasets: Consequences, Source, and Suggestions
Xiang Zhou
Yixin Nie
Hao Tan
Joey Tianyi Zhou
111
41
0
28 Apr 2020
Pretrained Transformers Improve Out-of-Distribution Robustness
Dan Hendrycks
Xiaoyuan Liu
Eric Wallace
Adam Dziedzic
R. Krishnan
Basel Alomair
OOD
223
436
0
13 Apr 2020
Translation Artifacts in Cross-lingual Transfer Learning
Mikel Artetxe
Gorka Labaka
Eneko Agirre
65
121
0
09 Apr 2020
Are Natural Language Inference Models IMPPRESsive? Learning IMPlicature and PRESupposition
Paloma Jeretic
Alex Warstadt
Suvrat Bhooshan
Adina Williams
ReLM
AI4CE
72
118
0
07 Apr 2020
Evaluating Models' Local Decision Boundaries via Contrast Sets
Matt Gardner
Yoav Artzi
Victoria Basmova
Jonathan Berant
Ben Bogin
...
Sanjay Subramanian
Reut Tsarfaty
Eric Wallace
Ally Zhang
Ben Zhou
ELM
120
84
0
06 Apr 2020
TyDi QA: A Benchmark for Information-Seeking Question Answering in Typologically Diverse Languages
J. Clark
Eunsol Choi
Michael Collins
Dan Garrette
Tom Kwiatkowski
Vitaly Nikolaev
J. Palomaki
237
613
0
10 Mar 2020
HypoNLI: Exploring the Artificial Patterns of Hypothesis-only Bias in Natural Language Inference
Tianyu Liu
Xin Zheng
Baobao Chang
Zhifang Sui
113
24
0
05 Mar 2020
Stress Test Evaluation of Transformer-based Models in Natural Language Understanding Tasks
Carlos Aspillaga
Andrés Carvallo
Vladimir Araujo
ELM
75
31
0
14 Feb 2020
Adversarial Filters of Dataset Biases
Ronan Le Bras
Swabha Swayamdipta
Chandra Bhagavatula
Rowan Zellers
Matthew E. Peters
Ashish Sabharwal
Yejin Choi
155
223
0
10 Feb 2020
Stance Detection Benchmark: How Robust Is Your Stance Detection?
Benjamin Schiller
Johannes Daxenberger
Iryna Gurevych
94
98
0
06 Jan 2020
Adversarial Analysis of Natural Language Inference Systems
Tiffany Chien
Jugal Kalita
AAML
61
12
0
07 Dec 2019
Question Answering for Privacy Policies: Combining Computational and Legal Perspectives
Abhilasha Ravichander
A. Black
Shomir Wilson
Thomas B. Norton
Norman M. Sadeh
AILaw
118
112
0
03 Nov 2019
Posing Fair Generalization Tasks for Natural Language Inference
Atticus Geiger
Ignacio Cases
L. Karttunen
Christopher Potts
68
48
0
03 Nov 2019
Adversarial Music: Real World Audio Adversary Against Wake-word Detection System
Juncheng Billy Li
Shuhui Qu
Xinjian Li
Joseph Szurley
J. Zico Kolter
Florian Metze
AAML
69
67
0
31 Oct 2019
Adversarial NLI: A New Benchmark for Natural Language Understanding
Yixin Nie
Adina Williams
Emily Dinan
Joey Tianyi Zhou
Jason Weston
Douwe Kiela
210
1,014
0
31 Oct 2019
Diversify Your Datasets: Analyzing Generalization via Controlled Variance in Adversarial Datasets
Ohad Rozen
Vered Shwartz
Roee Aharoni
Ido Dagan
AAML
90
38
0
21 Oct 2019
MonaLog: a Lightweight System for Natural Language Inference Based on Monotonicity
Hai Hu
Qi Chen
Kyle Richardson
A. Mukherjee
L. Moss
Sandra Kübler
68
41
0
19 Oct 2019
SesameBERT: Attention for Anywhere
Ta-Chun Su
Hsiang-Chih Cheng
58
7
0
08 Oct 2019
Probing Natural Language Inference Models through Semantic Fragments
Kyle Richardson
Hai Hu
L. Moss
Ashish Sabharwal
90
149
0
16 Sep 2019
A Logic-Driven Framework for Consistency of Neural Models
Tao Li
Vivek Gupta
Maitrey Mehta
Vivek Srikumar
AI4CE
141
106
0
31 Aug 2019
Unlearn Dataset Bias in Natural Language Inference by Fitting the Residual
He He
Sheng Zha
Haohan Wang
88
200
0
28 Aug 2019
Can neural networks understand monotonicity reasoning?
Hitomi Yanaka
K. Mineshima
D. Bekki
Kentaro Inui
Satoshi Sekine
Lasha Abzianidze
Johan Bos
LRM
67
81
0
15 Jun 2019
SuperGLUE: A Stickier Benchmark for General-Purpose Language Understanding Systems
Alex Jinpeng Wang
Yada Pruksachatkun
Nikita Nangia
Amanpreet Singh
Julian Michael
Felix Hill
Omer Levy
Samuel R. Bowman
ELM
428
2,331
0
02 May 2019
Probing What Different NLP Tasks Teach Machines about Function Word Comprehension
Najoung Kim
Roma Patel
Adam Poliak
Alex Jinpeng Wang
Patrick Xia
...
Alexis Ross
Tal Linzen
Benjamin Van Durme
Samuel R. Bowman
Ellie Pavlick
80
107
0
25 Apr 2019
Inoculation by Fine-Tuning: A Method for Analyzing Challenge Datasets
Nelson F. Liu
Roy Schwartz
Noah A. Smith
AAML
107
106
0
04 Apr 2019
On Evaluation of Adversarial Perturbations for Sequence-to-Sequence Models
Paul Michel
Xian Li
Graham Neubig
J. Pino
AAML
89
136
0
15 Mar 2019
Adversarial attacks against Fact Extraction and VERification
James Thorne
Andreas Vlachos
FedML
AAML
77
26
0
13 Mar 2019
Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference
R. Thomas McCoy
Ellie Pavlick
Tal Linzen
159
1,244
0
04 Feb 2019
Analysis Methods in Neural Language Processing: A Survey
Yonatan Belinkov
James R. Glass
125
558
0
21 Dec 2018
What If We Simply Swap the Two Text Fragments? A Straightforward yet Effective Way to Test the Robustness of Methods to Confounding Signals in Nature Language Inference Tasks
Haohan Wang
Da-You Sun
Eric Xing
101
42
0
07 Sep 2018
Trick Me If You Can: Human-in-the-loop Generation of Adversarial Examples for Question Answering
Eric Wallace
Pedro Rodriguez
Shi Feng
Ikuya Yamada
Jordan L. Boyd-Graber
AAML
125
18
0
07 Sep 2018
Previous
1
2
3