v1v2 (latest)

Annotation Artifacts in Natural Language Inference Data

6 March 2018

Papers citing "Annotation Artifacts in Natural Language Inference Data"

50 / 796 papers shown

Title
InfFeed: Influence Functions as a Feedback to Improve the Performance of Subjective Tasks Somnath Banerjee Maulindu Sarkar Punyajoy Saha Binny Mathew Animesh Mukherjee TDI 59 0 0 22 Feb 2024
Can You Learn Semantics Through Next-Word Prediction? The Case of Entailment William Merrill Zhaofeng Wu Norihito Naka Yoon Kim Tal Linzen 120 9 0 21 Feb 2024
SaGE: Evaluating Moral Consistency in Large Language Models Vamshi Krishna Bonagiri Sreeram Vennam Priyanshul Govil Ponnurangam Kumaraguru Manas Gaur ELM 85 0 0 21 Feb 2024
Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question? Nishant Balepur Abhilasha Ravichander Rachel Rudinger ELM 120 28 0 19 Feb 2024
A synthetic data approach for domain generalization of NLI models Mohammad Javad Hosseini Andrey Petrov Alex Fabrikant Annie Louis SyDa 84 10 0 19 Feb 2024
Punctuation Restoration Improves Structure Understanding Without Supervision Junghyun Min Minho Lee Woochul Lee Yeonsoo Lee 157 1 0 13 Feb 2024
A Hypothesis-Driven Framework for the Analysis of Self-Rationalising Models Marc Braun Jenny Kunz 40 3 0 07 Feb 2024
Are Machines Better at Complex Reasoning? Unveiling Human-Machine Inference Gaps in Entailment Verification Soumya Sanyal Tianyi Xiao Jiacheng Liu Wenya Wang Xiang Ren LRM ReLM 129 12 0 06 Feb 2024
Analyzing the Evaluation of Cross-Lingual Knowledge Transfer in Multilingual Language Models Sara Rajaee Christof Monz 80 4 0 03 Feb 2024
Comparing Template-based and Template-free Language Model Probing Sagi Shaier Kevin Bennett Lawrence E Hunter Katharina von der Wense ELM 96 4 0 31 Jan 2024
How Useful is Continued Pre-Training for Generative Unsupervised Domain Adaptation? Rheeya Uppaal Yixuan Li Junjie Hu 146 6 0 31 Jan 2024
Semantic Sensitivities and Inconsistent Predictions: Measuring the Fragility of NLI Models Erik Arakelyan Zhaoqi Liu Isabelle Augenstein AAML 145 12 0 25 Jan 2024
How the Advent of Ubiquitous Large Language Models both Stymie and Turbocharge Dynamic Adversarial Question Generation Yoo Yeon Sung Ishani Mondal Jordan L. Boyd-Graber 70 0 0 20 Jan 2024
Relying on the Unreliable: The Impact of Language Models' Reluctance to Express Uncertainty Kaitlyn Zhou Jena D. Hwang Xiang Ren Maarten Sap 88 68 0 12 Jan 2024
Self-Supervised Position Debiasing for Large Language Models Zhongkun Liu Zheng Chen Mengqi Zhang Zhaochun Ren Pengjie Ren Zhumin Chen 76 1 0 02 Jan 2024
Zero-Shot Fact-Checking with Semantic Triples and Knowledge Graphs Moy Yuan Andreas Vlachos 92 7 0 19 Dec 2023
Discovering Highly Influential Shortcut Reasoning: An Automated Template-Free Approach Daichi Haraguchi Kiyoaki Shirai Naoya Inoue Natthawut Kertkeidkachorn LRM 33 0 0 15 Dec 2023
PROPRES: Investigating the Projectivity of Presupposition with Various Triggers and Environments Daiki Asami Saku Sugawara 30 1 0 14 Dec 2023
Dissecting vocabulary biases datasets through statistical testing and automated data augmentation for artifact mitigation in Natural Language Inference Dat Thanh Nguyen 30 0 0 14 Dec 2023
RoAST: Robustifying Language Models via Adversarial Perturbation with Selective Training Jaehyung Kim Yuning Mao Rui Hou Hanchao Yu Davis Liang Pascale Fung Qifan Wang Fuli Feng Lifu Huang Madian Khabsa AAML 60 4 0 07 Dec 2023
Improving Bias Mitigation through Bias Experts in Natural Language Understanding Eojin Jeon Mingyu Lee Juhyeong Park Yeachan Kim Wing-Lam Mok SangKeun Lee 42 2 0 06 Dec 2023
Eliciting Latent Knowledge from Quirky Language Models Alex Troy Mallen Madeline Brumley Julia Kharchenko Nora Belrose HILM RALM KELM 88 33 0 02 Dec 2023
The Case for Scalable, Data-Driven Theory: A Paradigm for Scientific Progress in NLP Julian Michael 49 1 0 01 Dec 2023
WorldSense: A Synthetic Benchmark for Grounded Reasoning in Large Language Models Youssef Benchekroun Megi Dervishi Mark Ibrahim Jean-Baptiste Gaya Xavier Martinet Grégoire Mialon Thomas Scialom Emmanuel Dupoux Dieuwke Hupkes Pascal Vincent LRM 63 8 0 27 Nov 2023
Human Learning by Model Feedback: The Dynamics of Iterative Prompting with Midjourney Shachar Don-Yehiya Leshem Choshen Omri Abend 63 7 0 20 Nov 2023
Latent Feature-based Data Splits to Improve Generalisation Evaluation: A Hate Speech Detection Case Study Maike Zufle Verna Dankers Ivan Titov 91 0 0 16 Nov 2023
Measuring and Improving Attentiveness to Partial Inputs with Counterfactuals Yanai Elazar Bhargavi Paranjape Hao Peng Sarah Wiegreffe Khyathi Raghavi Vivek Srikumar Sameer Singh Noah A. Smith AAML OOD 65 0 0 16 Nov 2023
Rescue: Ranking LLM Responses with Partial Ordering to Improve Response Generation Yikun Wang Rui Zheng Haoming Li Qi Zhang Tao Gui Fei Liu OffRL 58 4 0 15 Nov 2023
Formal Proofs as Structured Explanations: Proposing Several Tasks on Explainable Natural Language Inference Lasha Abzianidze LRM XAI 130 0 0 15 Nov 2023
Using Natural Language Explanations to Improve Robustness of In-context Learning Xuanli He Yuxiang Wu Oana-Maria Camburu Pasquale Minervini Pontus Stenetorp AAML 68 1 0 13 Nov 2023
Zero-Shot Cross-Lingual Sentiment Classification under Distribution Shift: an Exploratory Study Maarten De Raedt Semere Kiros Bitew Fréderic Godin Thomas Demeester Chris Develder 82 4 0 11 Nov 2023
Recursion in Recursion: Two-Level Nested Recursion for Length Generalization with Scalability Jishnu Ray Chowdhury Cornelia Caragea 78 5 0 08 Nov 2023
Data Factors for Better Compositional Generalization Xiang Zhou Yichen Jiang Mohit Bansal CoGe OOD 71 5 0 08 Nov 2023
Principles from Clinical Research for NLP Model Generalization Aparna Elangovan Jiayuan He Yuan Li Karin Verspoor CML 104 4 0 07 Nov 2023
Measuring Adversarial Datasets Yuanchen Bai Raoyi Huang Vijay Viswanathan Tzu-Sheng Kuo Tongshuang Wu 83 1 0 06 Nov 2023
Invariant-Feature Subspace Recovery: A New Class of Provable Domain Generalization Algorithms Haoxiang Wang Gargi Balasubramaniam Haozhe Si Bo Li Han Zhao OOD 79 2 0 02 Nov 2023
Construction Artifacts in Metaphor Identification Datasets Joanne Boisson Luis Espinosa-Anke Jose Camacho-Collados 28 3 0 01 Nov 2023
IBADR: an Iterative Bias-Aware Dataset Refinement Framework for Debiasing NLU models Xiaoyue Wang Xin Liu Lijie Wang Yaoxiang Wang Jinsong Su Hua Wu 74 2 0 01 Nov 2023
Interpretable-by-Design Text Understanding with Iteratively Generated Concept Bottleneck Josh Magnus Ludan Qing Lyu Yue Yang Liam Dugan Mark Yatskar Chris Callison-Burch 76 5 0 30 Oct 2023
Group Robust Classification Without Any Group Information Christos Tsirigotis João Monteiro Pau Rodríguez David Vazquez Aaron Courville OOD 89 23 0 28 Oct 2023
SDOH-NLI: a Dataset for Inferring Social Determinants of Health from Clinical Notes Á. Lelkes Eric Loreaux Tal Schuster Ming-Jun Chen Alvin Rajkomar 78 2 0 27 Oct 2023
The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI Shayne Longpre Robert Mahari Anthony Chen Naana Obeng-Marnu Damien Sileo ... K. Bollacker Tongshuang Wu Luis Villa Sandy Pentland Sara Hooker 95 65 0 25 Oct 2023
The Distributional Hypothesis Does Not Fully Explain the Benefits of Masked Language Model Pretraining Ting-Rui Chiang Dani Yogatama 57 1 0 25 Oct 2023
On the Foundations of Shortcut Learning Katherine Hermann Hossein Mobahi Thomas Fel M. C. Mozer VLM 134 33 0 24 Oct 2023
From Heuristic to Analytic: Cognitively Motivated Strategies for Coherent Physical Commonsense Reasoning Zheyuan Zhang Shane Storks Fengyuan Hu Sungryull Sohn Moontae Lee Honglak Lee Joyce Chai LRM 73 4 0 24 Oct 2023
EXPLAIN, EDIT, GENERATE: Rationale-Sensitive Counterfactual Data Augmentation for Multi-hop Fact Verification Yingjie Zhu Jiasheng Si Yibo Zhao Haiyang Zhu Deyu Zhou Yulan He 91 7 0 23 Oct 2023
REFER: An End-to-end Rationale Extraction Framework for Explanation Regularization Mohammad Reza Ghasemi Madani Pasquale Minervini 91 4 0 22 Oct 2023
Implications of Annotation Artifacts in Edge Probing Test Datasets Sagnik Ray Choudhury Jushaan Kalra 43 0 0 20 Oct 2023
Ecologically Valid Explanations for Label Variation in NLI Nan-Jiang Jiang Chenhao Tan M. Marneffe FAtt 72 6 0 20 Oct 2023
How Much Consistency Is Your Accuracy Worth? Jacob K. Johnson Ana Marasović 53 1 0 20 Oct 2023