Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2010.03532
Cited By
What Can We Learn from Collective Human Opinions on Natural Language Inference Data?
7 October 2020
Yixin Nie
Xiang Zhou
Joey Tianyi Zhou
Re-assign community
ArXiv
PDF
HTML
Papers citing
"What Can We Learn from Collective Human Opinions on Natural Language Inference Data?"
30 / 30 papers shown
Title
Always Tell Me The Odds: Fine-grained Conditional Probability Estimation
Liaoyaqi Wang
Zhengping Jiang
Anqi Liu
Benjamin Van Durme
61
0
0
02 May 2025
Validating LLM-as-a-Judge Systems in the Absence of Gold Labels
Luke M. Guerdan
Solon Barocas
Kenneth Holstein
Hanna M. Wallach
Zhiwei Steven Wu
Alexandra Chouldechova
ALM
ELM
263
0
0
13 Mar 2025
FOReCAst: The Future Outcome Reasoning and Confidence Assessment Benchmark
Zhangdie Yuan
Zifeng Ding
Andreas Vlachos
AI4TS
82
0
0
27 Feb 2025
Fine-grained Fallacy Detection with Human Label Variation
Alan Ramponi
Agnese Daffara
Sara Tonelli
58
1
0
20 Feb 2025
Training and Evaluating with Human Label Variation: An Empirical Study
Kemal Kurniawan
Meladel Mistica
Timothy Baldwin
Jey Han Lau
67
0
0
03 Feb 2025
LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs
Tongshuang Wu
Haiyi Zhu
Maya Albayrak
Alexis Axon
Amanda Bertsch
...
Ying-Jui Tseng
Patricia Vaidos
Zhijin Wu
Wei Wu
Chenyang Yang
88
31
0
10 Jan 2025
Conformalized Credal Regions for Classification with Ambiguous Ground Truth
Michele Caprio
David Stutz
Shuo Li
Arnaud Doucet
UQCV
70
4
0
07 Nov 2024
Modeling Future Conversation Turns to Teach LLMs to Ask Clarifying Questions
Michael J.Q. Zhang
W. Bradley Knox
Eunsol Choi
50
4
0
17 Oct 2024
Can Humans Identify Domains?
Maria Barrett
Max Müller-Eberstein
Elisa Bassignana
Amalie Brogaard Pauli
Mike Zhang
Rob van der Goot
47
1
0
02 Apr 2024
SemEval-2024 Shared Task 6: SHROOM, a Shared-task on Hallucinations and Related Observable Overgeneration Mistakes
Timothee Mickus
Elaine Zosa
Raúl Vázquez
Teemu Vahtola
Jörg Tiedemann
Vincent Segonne
Alessandro Raganato
Marianna Apidianaki
HILM
LRM
43
21
0
12 Mar 2024
Interpretation modeling: Social grounding of sentences by reasoning over their implicit moral judgments
Liesbeth Allein
Maria Mihaela Trucscva
Marie-Francine Moens
45
1
0
27 Nov 2023
Collective Human Opinions in Semantic Textual Similarity
Yuxia Wang
Shimin Tao
Ning Xie
Hao Yang
Timothy Baldwin
Karin Verspoor
29
4
0
08 Aug 2023
No Strong Feelings One Way or Another: Re-operationalizing Neutrality in Natural Language Inference
Animesh Nighojkar
Antonio Laverghetta
John Licato
36
4
0
16 Jun 2023
Deep Model Compression Also Helps Models Capture Ambiguity
Hancheol Park
Jong C. Park
33
1
0
12 Jun 2023
Understanding and Predicting Human Label Variation in Natural Language Inference through Explanation
Nan-Jiang Jiang
Chenhao Tan
M. Marneffe
35
2
0
24 Apr 2023
Uncertainty-Aware Natural Language Inference with Stochastic Weight Averaging
Aarne Talman
H. Çelikkanat
Sami Virpioja
Markus Heinonen
Jörg Tiedemann
BDL
UQCV
26
7
0
10 Apr 2023
Design Choices for Crowdsourcing Implicit Discourse Relations: Revealing the Biases Introduced by Task Design
Valentina Pyatkin
Frances Yung
Merel C. J. Scholman
Reut Tsarfaty
Ido Dagan
Vera Demberg
27
12
0
03 Apr 2023
Investigating Multi-source Active Learning for Natural Language Inference
Ard Snijders
Douwe Kiela
Katerina Margatina
24
7
0
14 Feb 2023
Multi-Scales Data Augmentation Approach In Natural Language Inference For Artifacts Mitigation And Pre-Trained Model Optimization
Zhenyu Lu
20
1
0
16 Dec 2022
Sources of Noise in Dialogue and How to Deal with Them
Derek Chen
Zhou Yu
29
2
0
06 Dec 2022
The 'Problem' of Human Label Variation: On Ground Truth in Data, Modeling and Evaluation
Barbara Plank
30
97
0
04 Nov 2022
Stop Measuring Calibration When Humans Disagree
Joris Baan
Wilker Aziz
Barbara Plank
Raquel Fernández
29
53
0
28 Oct 2022
Investigating Reasons for Disagreement in Natural Language Inference
Nan-Jiang Jiang
M. Marneffe
27
26
0
07 Sep 2022
ALLSH: Active Learning Guided by Local Sensitivity and Hardness
Shujian Zhang
Chengyue Gong
Xingchao Liu
Pengcheng He
Weizhu Chen
Mingyuan Zhou
27
26
0
10 May 2022
Reducing Model Jitter: Stable Re-training of Semantic Parsers in Production Environments
Christopher Hidey
Fei Liu
Rahul Goel
32
4
0
10 Apr 2022
What Makes Reading Comprehension Questions Difficult?
Saku Sugawara
Nikita Nangia
Alex Warstadt
Sam Bowman
ELM
RALM
20
13
0
12 Mar 2022
Natural Language Deduction through Search over Statement Compositions
Kaj Bostrom
Zayne Sprague
Swarat Chaudhuri
Greg Durrett
ReLM
LRM
27
46
0
16 Jan 2022
Adversarially Constructed Evaluation Sets Are More Challenging, but May Not Be Fair
Jason Phang
Angelica Chen
William Huang
Samuel R. Bowman
AAML
28
13
0
16 Nov 2021
Can Transformer Language Models Predict Psychometric Properties?
Antonio Laverghetta
Animesh Nighojkar
Jamshidbek Mirzakhalov
John Licato
LM&MA
38
14
0
12 Jun 2021
ANLIzing the Adversarial Natural Language Inference Dataset
Adina Williams
Tristan Thrush
Douwe Kiela
AAML
183
46
0
24 Oct 2020
1