Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.02324
Cited By
v1
v2 (latest)
Annotation Artifacts in Natural Language Inference Data
6 March 2018
Suchin Gururangan
Swabha Swayamdipta
Omer Levy
Roy Schwartz
Samuel R. Bowman
Noah A. Smith
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Annotation Artifacts in Natural Language Inference Data"
50 / 796 papers shown
Title
InfFeed: Influence Functions as a Feedback to Improve the Performance of Subjective Tasks
Somnath Banerjee
Maulindu Sarkar
Punyajoy Saha
Binny Mathew
Animesh Mukherjee
TDI
59
0
0
22 Feb 2024
Can You Learn Semantics Through Next-Word Prediction? The Case of Entailment
William Merrill
Zhaofeng Wu
Norihito Naka
Yoon Kim
Tal Linzen
120
9
0
21 Feb 2024
SaGE: Evaluating Moral Consistency in Large Language Models
Vamshi Krishna Bonagiri
Sreeram Vennam
Priyanshul Govil
Ponnurangam Kumaraguru
Manas Gaur
ELM
85
0
0
21 Feb 2024
Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?
Nishant Balepur
Abhilasha Ravichander
Rachel Rudinger
ELM
120
28
0
19 Feb 2024
A synthetic data approach for domain generalization of NLI models
Mohammad Javad Hosseini
Andrey Petrov
Alex Fabrikant
Annie Louis
SyDa
84
10
0
19 Feb 2024
Punctuation Restoration Improves Structure Understanding Without Supervision
Junghyun Min
Minho Lee
Woochul Lee
Yeonsoo Lee
157
1
0
13 Feb 2024
A Hypothesis-Driven Framework for the Analysis of Self-Rationalising Models
Marc Braun
Jenny Kunz
40
3
0
07 Feb 2024
Are Machines Better at Complex Reasoning? Unveiling Human-Machine Inference Gaps in Entailment Verification
Soumya Sanyal
Tianyi Xiao
Jiacheng Liu
Wenya Wang
Xiang Ren
LRM
ReLM
129
12
0
06 Feb 2024
Analyzing the Evaluation of Cross-Lingual Knowledge Transfer in Multilingual Language Models
Sara Rajaee
Christof Monz
80
4
0
03 Feb 2024
Comparing Template-based and Template-free Language Model Probing
Sagi Shaier
Kevin Bennett
Lawrence E Hunter
Katharina von der Wense
ELM
96
4
0
31 Jan 2024
How Useful is Continued Pre-Training for Generative Unsupervised Domain Adaptation?
Rheeya Uppaal
Yixuan Li
Junjie Hu
146
6
0
31 Jan 2024
Semantic Sensitivities and Inconsistent Predictions: Measuring the Fragility of NLI Models
Erik Arakelyan
Zhaoqi Liu
Isabelle Augenstein
AAML
145
12
0
25 Jan 2024
How the Advent of Ubiquitous Large Language Models both Stymie and Turbocharge Dynamic Adversarial Question Generation
Yoo Yeon Sung
Ishani Mondal
Jordan L. Boyd-Graber
70
0
0
20 Jan 2024
Relying on the Unreliable: The Impact of Language Models' Reluctance to Express Uncertainty
Kaitlyn Zhou
Jena D. Hwang
Xiang Ren
Maarten Sap
88
68
0
12 Jan 2024
Self-Supervised Position Debiasing for Large Language Models
Zhongkun Liu
Zheng Chen
Mengqi Zhang
Zhaochun Ren
Pengjie Ren
Zhumin Chen
76
1
0
02 Jan 2024
Zero-Shot Fact-Checking with Semantic Triples and Knowledge Graphs
Moy Yuan
Andreas Vlachos
92
7
0
19 Dec 2023
Discovering Highly Influential Shortcut Reasoning: An Automated Template-Free Approach
Daichi Haraguchi
Kiyoaki Shirai
Naoya Inoue
Natthawut Kertkeidkachorn
LRM
33
0
0
15 Dec 2023
PROPRES: Investigating the Projectivity of Presupposition with Various Triggers and Environments
Daiki Asami
Saku Sugawara
30
1
0
14 Dec 2023
Dissecting vocabulary biases datasets through statistical testing and automated data augmentation for artifact mitigation in Natural Language Inference
Dat Thanh Nguyen
30
0
0
14 Dec 2023
RoAST: Robustifying Language Models via Adversarial Perturbation with Selective Training
Jaehyung Kim
Yuning Mao
Rui Hou
Hanchao Yu
Davis Liang
Pascale Fung
Qifan Wang
Fuli Feng
Lifu Huang
Madian Khabsa
AAML
60
4
0
07 Dec 2023
Improving Bias Mitigation through Bias Experts in Natural Language Understanding
Eojin Jeon
Mingyu Lee
Juhyeong Park
Yeachan Kim
Wing-Lam Mok
SangKeun Lee
42
2
0
06 Dec 2023
Eliciting Latent Knowledge from Quirky Language Models
Alex Troy Mallen
Madeline Brumley
Julia Kharchenko
Nora Belrose
HILM
RALM
KELM
88
33
0
02 Dec 2023
The Case for Scalable, Data-Driven Theory: A Paradigm for Scientific Progress in NLP
Julian Michael
49
1
0
01 Dec 2023
WorldSense: A Synthetic Benchmark for Grounded Reasoning in Large Language Models
Youssef Benchekroun
Megi Dervishi
Mark Ibrahim
Jean-Baptiste Gaya
Xavier Martinet
Grégoire Mialon
Thomas Scialom
Emmanuel Dupoux
Dieuwke Hupkes
Pascal Vincent
LRM
63
8
0
27 Nov 2023
Human Learning by Model Feedback: The Dynamics of Iterative Prompting with Midjourney
Shachar Don-Yehiya
Leshem Choshen
Omri Abend
63
7
0
20 Nov 2023
Latent Feature-based Data Splits to Improve Generalisation Evaluation: A Hate Speech Detection Case Study
Maike Zufle
Verna Dankers
Ivan Titov
91
0
0
16 Nov 2023
Measuring and Improving Attentiveness to Partial Inputs with Counterfactuals
Yanai Elazar
Bhargavi Paranjape
Hao Peng
Sarah Wiegreffe
Khyathi Raghavi
Vivek Srikumar
Sameer Singh
Noah A. Smith
AAML
OOD
65
0
0
16 Nov 2023
Rescue: Ranking LLM Responses with Partial Ordering to Improve Response Generation
Yikun Wang
Rui Zheng
Haoming Li
Qi Zhang
Tao Gui
Fei Liu
OffRL
58
4
0
15 Nov 2023
Formal Proofs as Structured Explanations: Proposing Several Tasks on Explainable Natural Language Inference
Lasha Abzianidze
LRM
XAI
130
0
0
15 Nov 2023
Using Natural Language Explanations to Improve Robustness of In-context Learning
Xuanli He
Yuxiang Wu
Oana-Maria Camburu
Pasquale Minervini
Pontus Stenetorp
AAML
68
1
0
13 Nov 2023
Zero-Shot Cross-Lingual Sentiment Classification under Distribution Shift: an Exploratory Study
Maarten De Raedt
Semere Kiros Bitew
Fréderic Godin
Thomas Demeester
Chris Develder
82
4
0
11 Nov 2023
Recursion in Recursion: Two-Level Nested Recursion for Length Generalization with Scalability
Jishnu Ray Chowdhury
Cornelia Caragea
78
5
0
08 Nov 2023
Data Factors for Better Compositional Generalization
Xiang Zhou
Yichen Jiang
Mohit Bansal
CoGe
OOD
71
5
0
08 Nov 2023
Principles from Clinical Research for NLP Model Generalization
Aparna Elangovan
Jiayuan He
Yuan Li
Karin Verspoor
CML
104
4
0
07 Nov 2023
Measuring Adversarial Datasets
Yuanchen Bai
Raoyi Huang
Vijay Viswanathan
Tzu-Sheng Kuo
Tongshuang Wu
83
1
0
06 Nov 2023
Invariant-Feature Subspace Recovery: A New Class of Provable Domain Generalization Algorithms
Haoxiang Wang
Gargi Balasubramaniam
Haozhe Si
Bo Li
Han Zhao
OOD
79
2
0
02 Nov 2023
Construction Artifacts in Metaphor Identification Datasets
Joanne Boisson
Luis Espinosa-Anke
Jose Camacho-Collados
28
3
0
01 Nov 2023
IBADR: an Iterative Bias-Aware Dataset Refinement Framework for Debiasing NLU models
Xiaoyue Wang
Xin Liu
Lijie Wang
Yaoxiang Wang
Jinsong Su
Hua Wu
74
2
0
01 Nov 2023
Interpretable-by-Design Text Understanding with Iteratively Generated Concept Bottleneck
Josh Magnus Ludan
Qing Lyu
Yue Yang
Liam Dugan
Mark Yatskar
Chris Callison-Burch
76
5
0
30 Oct 2023
Group Robust Classification Without Any Group Information
Christos Tsirigotis
João Monteiro
Pau Rodríguez
David Vazquez
Aaron Courville
OOD
89
23
0
28 Oct 2023
SDOH-NLI: a Dataset for Inferring Social Determinants of Health from Clinical Notes
Á. Lelkes
Eric Loreaux
Tal Schuster
Ming-Jun Chen
Alvin Rajkomar
78
2
0
27 Oct 2023
The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI
Shayne Longpre
Robert Mahari
Anthony Chen
Naana Obeng-Marnu
Damien Sileo
...
K. Bollacker
Tongshuang Wu
Luis Villa
Sandy Pentland
Sara Hooker
95
65
0
25 Oct 2023
The Distributional Hypothesis Does Not Fully Explain the Benefits of Masked Language Model Pretraining
Ting-Rui Chiang
Dani Yogatama
57
1
0
25 Oct 2023
On the Foundations of Shortcut Learning
Katherine Hermann
Hossein Mobahi
Thomas Fel
M. C. Mozer
VLM
134
33
0
24 Oct 2023
From Heuristic to Analytic: Cognitively Motivated Strategies for Coherent Physical Commonsense Reasoning
Zheyuan Zhang
Shane Storks
Fengyuan Hu
Sungryull Sohn
Moontae Lee
Honglak Lee
Joyce Chai
LRM
73
4
0
24 Oct 2023
EXPLAIN, EDIT, GENERATE: Rationale-Sensitive Counterfactual Data Augmentation for Multi-hop Fact Verification
Yingjie Zhu
Jiasheng Si
Yibo Zhao
Haiyang Zhu
Deyu Zhou
Yulan He
91
7
0
23 Oct 2023
REFER: An End-to-end Rationale Extraction Framework for Explanation Regularization
Mohammad Reza Ghasemi Madani
Pasquale Minervini
91
4
0
22 Oct 2023
Implications of Annotation Artifacts in Edge Probing Test Datasets
Sagnik Ray Choudhury
Jushaan Kalra
43
0
0
20 Oct 2023
Ecologically Valid Explanations for Label Variation in NLI
Nan-Jiang Jiang
Chenhao Tan
M. Marneffe
FAtt
72
6
0
20 Oct 2023
How Much Consistency Is Your Accuracy Worth?
Jacob K. Johnson
Ana Marasović
53
1
0
20 Oct 2023
Previous
1
2
3
4
5
6
...
14
15
16
Next