What Can We Learn from Collective Human Opinions on Natural Language Inference Data?

7 October 2020

Papers citing "What Can We Learn from Collective Human Opinions on Natural Language Inference Data?"

30 / 30 papers shown

Title
Always Tell Me The Odds: Fine-grained Conditional Probability Estimation Liaoyaqi Wang Zhengping Jiang Anqi Liu Benjamin Van Durme 61 0 0 02 May 2025
Validating LLM-as-a-Judge Systems in the Absence of Gold Labels Luke M. Guerdan Solon Barocas Kenneth Holstein Hanna M. Wallach Zhiwei Steven Wu Alexandra Chouldechova ALM ELM 263 0 0 13 Mar 2025
FOReCAst: The Future Outcome Reasoning and Confidence Assessment Benchmark Zhangdie Yuan Zifeng Ding Andreas Vlachos AI4TS 82 0 0 27 Feb 2025
Fine-grained Fallacy Detection with Human Label Variation Alan Ramponi Agnese Daffara Sara Tonelli 58 1 0 20 Feb 2025
Training and Evaluating with Human Label Variation: An Empirical Study Kemal Kurniawan Meladel Mistica Timothy Baldwin Jey Han Lau 67 0 0 03 Feb 2025
LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs Tongshuang Wu Haiyi Zhu Maya Albayrak Alexis Axon Amanda Bertsch ... Ying-Jui Tseng Patricia Vaidos Zhijin Wu Wei Wu Chenyang Yang 88 31 0 10 Jan 2025
Conformalized Credal Regions for Classification with Ambiguous Ground Truth Michele Caprio David Stutz Shuo Li Arnaud Doucet UQCV 70 4 0 07 Nov 2024
Modeling Future Conversation Turns to Teach LLMs to Ask Clarifying Questions Michael J.Q. Zhang W. Bradley Knox Eunsol Choi 50 4 0 17 Oct 2024
Can Humans Identify Domains? Maria Barrett Max Müller-Eberstein Elisa Bassignana Amalie Brogaard Pauli Mike Zhang Rob van der Goot 47 1 0 02 Apr 2024
SemEval-2024 Shared Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes Timothee Mickus Elaine Zosa Raúl Vázquez Teemu Vahtola Jörg Tiedemann Vincent Segonne Alessandro Raganato Marianna Apidianaki HILM LRM 43 21 0 12 Mar 2024
Interpretation modeling: Social grounding of sentences by reasoning over their implicit moral judgments Liesbeth Allein Maria Mihaela Trucscva Marie-Francine Moens 45 1 0 27 Nov 2023
Collective Human Opinions in Semantic Textual Similarity Yuxia Wang Shimin Tao Ning Xie Hao Yang Timothy Baldwin Karin Verspoor 29 4 0 08 Aug 2023
No Strong Feelings One Way or Another: Re-operationalizing Neutrality in Natural Language Inference Animesh Nighojkar Antonio Laverghetta John Licato 36 4 0 16 Jun 2023
Deep Model Compression Also Helps Models Capture Ambiguity Hancheol Park Jong C. Park 33 1 0 12 Jun 2023
Understanding and Predicting Human Label Variation in Natural Language Inference through Explanation Nan-Jiang Jiang Chenhao Tan M. Marneffe 35 2 0 24 Apr 2023
Uncertainty-Aware Natural Language Inference with Stochastic Weight Averaging Aarne Talman H. Çelikkanat Sami Virpioja Markus Heinonen Jörg Tiedemann BDL UQCV 26 7 0 10 Apr 2023
Design Choices for Crowdsourcing Implicit Discourse Relations: Revealing the Biases Introduced by Task Design Valentina Pyatkin Frances Yung Merel C. J. Scholman Reut Tsarfaty Ido Dagan Vera Demberg 27 12 0 03 Apr 2023
Investigating Multi-source Active Learning for Natural Language Inference Ard Snijders Douwe Kiela Katerina Margatina 24 7 0 14 Feb 2023
Multi-Scales Data Augmentation Approach In Natural Language Inference For Artifacts Mitigation And Pre-Trained Model Optimization Zhenyu Lu 20 1 0 16 Dec 2022
Sources of Noise in Dialogue and How to Deal with Them Derek Chen Zhou Yu 29 2 0 06 Dec 2022
The 'Problem' of Human Label Variation: On Ground Truth in Data, Modeling and Evaluation Barbara Plank 30 97 0 04 Nov 2022
Stop Measuring Calibration When Humans Disagree Joris Baan Wilker Aziz Barbara Plank Raquel Fernández 29 53 0 28 Oct 2022
Investigating Reasons for Disagreement in Natural Language Inference Nan-Jiang Jiang M. Marneffe 27 26 0 07 Sep 2022
ALLSH: Active Learning Guided by Local Sensitivity and Hardness Shujian Zhang Chengyue Gong Xingchao Liu Pengcheng He Weizhu Chen Mingyuan Zhou 27 26 0 10 May 2022
Reducing Model Jitter: Stable Re-training of Semantic Parsers in Production Environments Christopher Hidey Fei Liu Rahul Goel 32 4 0 10 Apr 2022
What Makes Reading Comprehension Questions Difficult? Saku Sugawara Nikita Nangia Alex Warstadt Sam Bowman ELM RALM 20 13 0 12 Mar 2022
Natural Language Deduction through Search over Statement Compositions Kaj Bostrom Zayne Sprague Swarat Chaudhuri Greg Durrett ReLM LRM 27 46 0 16 Jan 2022
Adversarially Constructed Evaluation Sets Are More Challenging, but May Not Be Fair Jason Phang Angelica Chen William Huang Samuel R. Bowman AAML 28 13 0 16 Nov 2021
Can Transformer Language Models Predict Psychometric Properties? Antonio Laverghetta Animesh Nighojkar Jamshidbek Mirzakhalov John Licato LM&MA 38 14 0 12 Jun 2021
ANLIzing the Adversarial Natural Language Inference Dataset Adina Williams Tristan Thrush Douwe Kiela AAML 183 46 0 24 Oct 2020