Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.02324
Cited By
Annotation Artifacts in Natural Language Inference Data
6 March 2018
Suchin Gururangan
Swabha Swayamdipta
Omer Levy
Roy Schwartz
Samuel R. Bowman
Noah A. Smith
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Annotation Artifacts in Natural Language Inference Data"
50 / 783 papers shown
Title
The Group Robustness is in the Details: Revisiting Finetuning under Spurious Correlations
Tyler LaBonte
John C. Hill
Xinchen Zhang
Vidya Muthukumar
Abhishek Kumar
AAML
41
0
0
19 Jul 2024
LitSearch: A Retrieval Benchmark for Scientific Literature Search
Anirudh Ajith
Mengzhou Xia
Alexis Chevalier
Tanya Goyal
Danqi Chen
Tianyu Gao
RALM
53
11
0
10 Jul 2024
An LLM Feature-based Framework for Dialogue Constructiveness Assessment
Lexin Zhou
Youmna Farag
Andreas Vlachos
50
2
0
20 Jun 2024
LLMs Are Prone to Fallacies in Causal Inference
Nitish Joshi
Abulhair Saparov
Yixin Wang
He He
53
10
0
18 Jun 2024
Not Eliminate but Aggregate: Post-Hoc Control over Mixture-of-Experts to Address Shortcut Shifts in Natural Language Understanding
Ukyo Honda
Tatsushi Oka
Peinan Zhang
Masato Mita
52
1
0
17 Jun 2024
MoE-RBench
\texttt{MoE-RBench}
MoE-RBench
: Towards Building Reliable Language Models with Sparse Mixture-of-Experts
Guanjie Chen
Xinyu Zhao
Tianlong Chen
Yu Cheng
MoE
83
5
0
17 Jun 2024
KGPA: Robustness Evaluation for Large Language Models via Cross-Domain Knowledge Graphs
Aihua Pei
Zehua Yang
Shunan Zhu
Ruoxi Cheng
Ju Jia
Lina Wang
45
1
0
16 Jun 2024
ECBD: Evidence-Centered Benchmark Design for NLP
Yu Lu Liu
Su Lin Blodgett
Jackie Chi Kit Cheung
Q. Vera Liao
Alexandra Olteanu
Ziang Xiao
36
10
0
13 Jun 2024
DCA-Bench: A Benchmark for Dataset Curation Agents
Benhao Huang
Yingzhuo Yu
Jin Huang
Xingjian Zhang
Jiaqi Ma
36
1
0
11 Jun 2024
mCSQA: Multilingual Commonsense Reasoning Dataset with Unified Creation Strategy by Language Models and Humans
Yusuke Sakai
Hidetaka Kamigaito
Taro Watanabe
LRM
46
3
0
06 Jun 2024
Are We Done with MMLU?
Aryo Pradipta Gema
Joshua Ong Jun Leang
Giwon Hong
Alessio Devoto
Alberto Carlo Maria Mancino
...
R. McHardy
Joshua Harris
Jean Kaddour
Emile van Krieken
Pasquale Minervini
ELM
60
31
0
06 Jun 2024
What Makes Language Models Good-enough?
Daiki Asami
Saku Sugawara
37
1
0
06 Jun 2024
Sharpness-Aware Minimization Enhances Feature Quality via Balanced Learning
Jacob Mitchell Springer
Vaishnavh Nagarajan
Aditi Raghunathan
44
5
0
30 May 2024
Improve Student's Reasoning Generalizability through Cascading Decomposed CoTs Distillation
Chengwei Dai
Kun Li
Wei Zhou
Song Hu
LRM
52
3
0
30 May 2024
ConSiDERS-The-Human Evaluation Framework: Rethinking Human Evaluation for Generative Large Language Models
Aparna Elangovan
Ling Liu
Lei Xu
S. Bodapati
Dan Roth
ELM
35
9
0
28 May 2024
Views Can Be Deceiving: Improved SSL Through Feature Space Augmentation
Kimia Hamidieh
Haoran Zhang
Swami Sankaranarayanan
Marzyeh Ghassemi
52
0
0
28 May 2024
Resolving Word Vagueness with Scenario-guided Adapter for Natural Language Inference
Yuqi Liu
Mengyu Li
Di Liang
Ximing Li
Fausto Giunchiglia
Lan Huang
Xiaoyue Feng
Renchu Guan
42
3
0
21 May 2024
A Novel Cartography-Based Curriculum Learning Method Applied on RoNLI: The First Romanian Natural Language Inference Corpus
Eduard Poesina
Cornelia Caragea
Radu Tudor Ionescu
23
5
0
20 May 2024
Elements of World Knowledge (EWOK): A cognition-inspired framework for evaluating basic world knowledge in language models
Anna A. Ivanova
Aalok Sathe
Benjamin Lipkin
Unnathi Kumar
S. Radkani
...
Leshem Choshen
Roger Levy
Evelina Fedorenko
Josh Tenenbaum
Jacob Andreas
46
24
0
15 May 2024
Logical Negation Augmenting and Debiasing for Prompt-based Methods
Yitian Li
Jidong Tian
Hao He
Yaohui Jin
43
0
0
08 May 2024
Philosophy of Cognitive Science in the Age of Deep Learning
Raphaël Millière
AI4CE
NAI
43
3
0
07 May 2024
A Philosophical Introduction to Language Models - Part II: The Way Forward
Raphael Milliere
Cameron Buckner
LRM
66
14
0
06 May 2024
Examining the robustness of LLM evaluation to the distributional assumptions of benchmarks
Melissa Ailem
Katerina Marazopoulou
Charlotte Siska
James Bono
59
15
0
25 Apr 2024
Does It Make Sense to Explain a Black Box With Another Black Box?
J. Delaunay
Luis Galárraga
Christine Largouet
AAML
21
1
0
23 Apr 2024
Improving Group Robustness on Spurious Correlation Requires Preciser Group Inference
Yujin Han
Difan Zou
AAML
52
3
0
22 Apr 2024
Explanation based Bias Decoupling Regularization for Natural Language Inference
Jianxiang Zang
Hui Liu
16
0
0
20 Apr 2024
From Form(s) to Meaning: Probing the Semantic Depths of Language Models Using Multisense Consistency
Xenia Ohmer
Elia Bruni
Dieuwke Hupkes
AI4CE
39
6
0
18 Apr 2024
How often are errors in natural language reasoning due to paraphrastic variability?
Neha Srikanth
Marine Carpuat
Rachel Rudinger
LRM
35
2
0
17 Apr 2024
MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents
Liyan Tang
Philippe Laban
Greg Durrett
HILM
SyDa
43
78
0
16 Apr 2024
MSciNLI: A Diverse Benchmark for Scientific Natural Language Inference
Mobashir Sadat
Cornelia Caragea
40
4
0
11 Apr 2024
XNLIeu: a dataset for cross-lingual NLI in Basque
Maite Heredia
Julen Etxaniz
Muitze Zulaika
X. Saralegi
Jeremy Barnes
A. Soroa
18
0
0
10 Apr 2024
Two Heads are Better than One: Nested PoE for Robust Defense Against Multi-Backdoors
Victoria Graf
Qin Liu
Muhao Chen
AAML
40
8
0
02 Apr 2024
Fairness in Large Language Models: A Taxonomic Survey
Zhibo Chu
Zichong Wang
Wenbin Zhang
AILaw
48
33
0
31 Mar 2024
Debiasing surgeon: fantastic weights and how to find them
Rémi Nahon
Ivan Luiz De Moura Matos
Van-Tam Nguyen
Enzo Tartaglione
36
1
0
21 Mar 2024
On the Challenges and Opportunities in Generative AI
Laura Manduchi
Kushagra Pandey
Robert Bamler
Ryan Cotterell
Sina Daubener
...
F. Wenzel
Frank Wood
Stephan Mandt
Vincent Fortuin
Vincent Fortuin
56
17
0
28 Feb 2024
MultiContrievers: Analysis of Dense Retrieval Representations
Seraphina Goldfarb-Tarrant
Pedro Rodriguez
Jane Dwivedi-Yu
Patrick Lewis
33
1
0
24 Feb 2024
InfFeed: Influence Functions as a Feedback to Improve the Performance of Subjective Tasks
Somnath Banerjee
Maulindu Sarkar
Punyajoy Saha
Binny Mathew
Animesh Mukherjee
TDI
34
0
0
22 Feb 2024
Can You Learn Semantics Through Next-Word Prediction? The Case of Entailment
William Merrill
Zhaofeng Wu
Norihito Naka
Yoon Kim
Tal Linzen
49
7
0
21 Feb 2024
SaGE: Evaluating Moral Consistency in Large Language Models
Vamshi Krishna Bonagiri
Sreeram Vennam
Priyanshul Govil
Ponnurangam Kumaraguru
Manas Gaur
ELM
56
0
0
21 Feb 2024
Artifacts or Abduction: How Do LLMs Answer Multiple-Choice Questions Without the Question?
Nishant Balepur
Abhilasha Ravichander
Rachel Rudinger
ELM
47
19
0
19 Feb 2024
A synthetic data approach for domain generalization of NLI models
Mohammad Javad Hosseini
Andrey Petrov
Alex Fabrikant
Annie Louis
SyDa
38
8
0
19 Feb 2024
Punctuation Restoration Improves Structure Understanding Without Supervision
Junghyun Min
Minho Lee
Woochul Lee
Yeonsoo Lee
62
1
0
13 Feb 2024
A Hypothesis-Driven Framework for the Analysis of Self-Rationalising Models
Marc Braun
Jenny Kunz
18
2
0
07 Feb 2024
Are Machines Better at Complex Reasoning? Unveiling Human-Machine Inference Gaps in Entailment Verification
Soumya Sanyal
Tianyi Xiao
Jiacheng Liu
Wenya Wang
Xiang Ren
LRM
ReLM
53
12
0
06 Feb 2024
Analyzing the Evaluation of Cross-Lingual Knowledge Transfer in Multilingual Language Models
Sara Rajaee
Christof Monz
30
3
0
03 Feb 2024
Comparing Template-based and Template-free Language Model Probing
Sagi Shaier
Kevin Bennett
Lawrence E Hunter
K. Wense
ELM
36
3
0
31 Jan 2024
How Useful is Continued Pre-Training for Generative Unsupervised Domain Adaptation?
Rheeya Uppaal
Yixuan Li
Junjie Hu
37
4
0
31 Jan 2024
Semantic Sensitivities and Inconsistent Predictions: Measuring the Fragility of NLI Models
Erik Arakelyan
Zhaoqi Liu
Isabelle Augenstein
AAML
45
10
0
25 Jan 2024
How the Advent of Ubiquitous Large Language Models both Stymie and Turbocharge Dynamic Adversarial Question Generation
Yoo Yeon Sung
Ishani Mondal
Jordan L. Boyd-Graber
30
0
0
20 Jan 2024
Relying on the Unreliable: The Impact of Language Models' Reluctance to Express Uncertainty
Kaitlyn Zhou
Jena D. Hwang
Xiang Ren
Maarten Sap
36
54
0
12 Jan 2024
Previous
1
2
3
4
5
...
14
15
16
Next