Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1803.02324
Cited By
Annotation Artifacts in Natural Language Inference Data
6 March 2018
Suchin Gururangan
Swabha Swayamdipta
Omer Levy
Roy Schwartz
Samuel R. Bowman
Noah A. Smith
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Annotation Artifacts in Natural Language Inference Data"
50 / 783 papers shown
Title
NLI4CT: Multi-Evidence Natural Language Inference for Clinical Trial Reports
Mael Jullien
Marco Valentino
H. Frost
Paul O'Regan
Dónal Landers
André Freitas
27
28
0
05 May 2023
MindGames: Targeting Theory of Mind in Large Language Models with Dynamic Epistemic Modal Logic
Damien Sileo
Antoine Lernould
LRM
16
25
0
05 May 2023
SCOTT: Self-Consistent Chain-of-Thought Distillation
Jamie Yap
Zhengyang Wang
Zheng Li
K. Lynch
Bing Yin
Xiang Ren
LRM
78
93
0
03 May 2023
Information Redundancy and Biases in Public Document Information Extraction Benchmarks
S. Laatiri
Pirashanth Ratnamogan
Joel Tang
Laurent Lam
William Vanhuffel
Fabien Caspani
33
1
0
28 Apr 2023
DataComp: In search of the next generation of multimodal datasets
S. Gadre
Gabriel Ilharco
Alex Fang
J. Hayase
Georgios Smyrnis
...
A. Dimakis
J. Jitsev
Y. Carmon
Vaishaal Shankar
Ludwig Schmidt
VLM
33
415
0
27 Apr 2023
GPT-NER: Named Entity Recognition via Large Language Models
Shuhe Wang
Xiaofei Sun
Xiaoya Li
Rongbin Ouyang
Fei Wu
Tianwei Zhang
Jiwei Li
Guoyin Wang
39
182
0
20 Apr 2023
Exploring the Trade-Offs: Unified Large Language Models vs Local Fine-Tuned Models for Highly-Specific Radiology NLI Task
Zihao Wu
Lu Zhang
Chao-Yang Cao
Xiao-Xing Yu
Haixing Dai
...
Quanzheng Li
Dinggang Shen
Xiang Li
Dajiang Zhu
Tianming Liu
LM&MA
44
39
0
18 Apr 2023
LINGO : Visually Debiasing Natural Language Instructions to Support Task Diversity
Anjana Arunkumar
Shubham Sharma
Rakhi Agrawal
Sriramakrishnan Chandrasekaran
Chris Bryan
34
0
0
12 Apr 2023
Should ChatGPT be Biased? Challenges and Risks of Bias in Large Language Models
Emilio Ferrara
SILM
36
248
0
07 Apr 2023
Inspecting and Editing Knowledge Representations in Language Models
Evan Hernandez
Belinda Z. Li
Jacob Andreas
KELM
24
80
0
03 Apr 2023
Exposing and Addressing Cross-Task Inconsistency in Unified Vision-Language Models
A. Maharana
Amita Kamath
Christopher Clark
Joey Tianyi Zhou
Aniruddha Kembhavi
40
3
0
28 Mar 2023
Self-Improving-Leaderboard(SIL): A Call for Real-World Centric Natural Language Processing Leaderboards
Chanjun Park
Hyeonseok Moon
Seolhwa Lee
Jaehyung Seo
Sugyeong Eo
Heu-Jeoung Lim
25
2
0
20 Mar 2023
Distributionally Robust Optimization with Probabilistic Group
Soumya Suvra Ghosal
Yixuan Li
OOD
16
7
0
10 Mar 2023
Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation
Yixin Liu
Alexander R. Fabbri
Yilun Zhao
Pengfei Liu
Shafiq Joty
Chien-Sheng Wu
Caiming Xiong
Dragomir R. Radev
15
28
0
07 Mar 2023
Contrastive variational information bottleneck for aspect-based sentiment analysis
Ming-Wei Chang
Min Yang
Qingshan Jiang
Ruifeng Xu
35
4
0
06 Mar 2023
WiCE: Real-World Entailment for Claims in Wikipedia
Ryo Kamoi
Tanya Goyal
Juan Diego Rodriguez
Greg Durrett
41
81
0
02 Mar 2023
SMoA: Sparse Mixture of Adapters to Mitigate Multiple Dataset Biases
Yanchen Liu
Jing Yang
Yan Chen
Jing Liu
Huaqin Wu
MoE
47
2
0
28 Feb 2023
Does Deep Learning Learn to Abstract? A Systematic Probing Framework
Shengnan An
Zeqi Lin
B. Chen
Qiang Fu
Nanning Zheng
Jian-Guang Lou
49
4
0
23 Feb 2023
Parallel Sentence-Level Explanation Generation for Real-World Low-Resource Scenarios
Yu Liu
Xiaokang Chen
Qianwen Dai
LRM
22
4
0
21 Feb 2023
Empirical Investigation of Neural Symbolic Reasoning Strategies
Yoichi Aoki
Keito Kudo
Tatsuki Kuribayashi
Ana Brassard
Masashi Yoshikawa
Keisuke Sakaguchi
Kentaro Inui
29
2
0
16 Feb 2023
ScatterShot: Interactive In-context Example Curation for Text Transformation
Tongshuang Wu
Hua Shen
Daniel S. Weld
Jeffrey Heer
Marco Tulio Ribeiro
24
23
0
14 Feb 2023
Investigating Multi-source Active Learning for Natural Language Inference
Ard Snijders
Douwe Kiela
Katerina Margatina
26
7
0
14 Feb 2023
Benchmarks for Automated Commonsense Reasoning: A Survey
E. Davis
ELM
LRM
24
58
0
09 Feb 2023
Real-Time Visual Feedback to Guide Benchmark Creation: A Human-and-Metric-in-the-Loop Workflow
Anjana Arunkumar
Swaroop Mishra
Bhavdeep Singh Sachdeva
Chitta Baral
Chris Bryan
33
0
0
09 Feb 2023
Guide the Learner: Controlling Product of Experts Debiasing Method Based on Token Attribution Similarities
Ali Modarressi
Hossein Amirkhani
Mohammad Taher Pilehvar
29
2
0
06 Feb 2023
Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference
Vilém Zouhar
S. Dhuliawala
Wangchunshu Zhou
Nico Daheim
Tom Kocmi
Yuchen Eleanor Jiang
Mrinmaya Sachan
18
9
0
21 Jan 2023
KL Regularized Normalization Framework for Low Resource Tasks
Neeraj Kumar
Ankur Narang
Brejesh Lall
31
1
0
21 Dec 2022
DISCO: Distilling Counterfactuals with Large Language Models
Zeming Chen
Qiyue Gao
Antoine Bosselut
Ashish Sabharwal
Kyle Richardson
42
25
0
20 Dec 2022
Task Ambiguity in Humans and Language Models
Alex Tamkin
Kunal Handa
Ava Shrestha
Noah D. Goodman
UQLM
44
22
0
20 Dec 2022
Evaluation for Change
Rishi Bommasani
ELM
40
0
0
20 Dec 2022
Debiasing Stance Detection Models with Counterfactual Reasoning and Adversarial Bias Learning
Jianhua Yuan
Yanyan Zhao
Bing Qin
52
4
0
20 Dec 2022
To Adapt or to Annotate: Challenges and Interventions for Domain Adaptation in Open-Domain Question Answering
Dheeru Dua
Emma Strubell
Sameer Singh
Pat Verga
OOD
50
3
0
20 Dec 2022
Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor
Or Honovich
Thomas Scialom
Omer Levy
Timo Schick
ALM
48
363
0
19 Dec 2022
Statistical Dataset Evaluation: Reliability, Difficulty, and Validity
Chengwen Wang
Qingxiu Dong
Xiaochen Wang
Haitao Wang
Zhifang Sui
XAI
29
3
0
19 Dec 2022
JEMMA: An Extensible Java Dataset for ML4Code Applications
Anjan Karmakar
Miltiadis Allamanis
Romain Robbes
VLM
29
3
0
18 Dec 2022
Multi-Scales Data Augmentation Approach In Natural Language Inference For Artifacts Mitigation And Pre-Trained Model Optimization
Zhenyu Lu
20
1
0
16 Dec 2022
Azimuth: Systematic Error Analysis for Text Classification
Gabrielle Gauthier Melançon
Orlando Marquez Ayala
Lindsay D. Brin
Chris Tyler
Frederic Branchaud-Charron
Joseph Marinier
Karine Grande
Dieu-Thu Le
23
3
0
16 Dec 2022
Feature-Level Debiased Natural Language Understanding
Yougang Lyu
Piji Li
Yechang Yang
Maarten de Rijke
Pengjie Ren
Yukun Zhao
Dawei Yin
Z. Ren
32
10
0
11 Dec 2022
JamPatoisNLI: A Jamaican Patois Natural Language Inference Dataset
Ruth-Ann Armstrong
John Hewitt
Christopher D. Manning
33
14
0
07 Dec 2022
LawngNLI: A Long-Premise Benchmark for In-Domain Generalization from Short to Long Contexts and for Implication-Based Retrieval
William F. Bruno
Dan Roth
ELM
AILaw
25
6
0
06 Dec 2022
AGRO: Adversarial Discovery of Error-prone groups for Robust Optimization
Bhargavi Paranjape
Pradeep Dasigi
Vivek Srikumar
Luke Zettlemoyer
Hannaneh Hajishirzi
36
7
0
02 Dec 2022
Which Shortcut Solution Do Question Answering Models Prefer to Learn?
Kazutoshi Shinoda
Saku Sugawara
Akiko Aizawa
32
6
0
29 Nov 2022
AutoCAD: Automatically Generating Counterfactuals for Mitigating Shortcut Learning
Jiaxin Wen
Yeshuang Zhu
Jinchao Zhang
Jie Zhou
Minlie Huang
CML
AAML
27
8
0
29 Nov 2022
Beyond Counting Datasets: A Survey of Multilingual Dataset Construction and Necessary Resources
Xinyan Velocity Yu
Akari Asai
Trina Chatterjee
Junjie Hu
Eunsol Choi
29
21
0
28 Nov 2022
Attack on Unfair ToS Clause Detection: A Case Study using Universal Adversarial Triggers
Shanshan Xu
Irina Broda
R. Haddad
Marco Negrini
Matthias Grabmair
34
0
0
28 Nov 2022
Chroma-VAE: Mitigating Shortcut Learning with Generative Classifiers
Wanqian Yang
Polina Kirichenko
Micah Goldblum
A. Wilson
DRL
30
10
0
28 Nov 2022
Evaluation Beyond Task Performance: Analyzing Concepts in AlphaZero in Hex
Charles Lovering
Jessica Zosa Forde
George Konidaris
Ellie Pavlick
Michael L. Littman
21
7
0
26 Nov 2022
Using Focal Loss to Fight Shallow Heuristics: An Empirical Analysis of Modulated Cross-Entropy in Natural Language Inference
Frano Rajic
Ivan Stresec
Axel Marmet
Tim Postuvan
32
3
0
23 Nov 2022
Leveraging Data Recasting to Enhance Tabular Reasoning
Aashna Jena
Vivek Gupta
Manish Shrivastava
Julian Martin Eisenschlos
LMTD
30
6
0
23 Nov 2022
GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective
Linyi Yang
Shuibai Zhang
Libo Qin
Yafu Li
Yidong Wang
Hanmeng Liu
Jindong Wang
Xingxu Xie
Yue Zhang
ELM
52
79
0
15 Nov 2022
Previous
1
2
3
4
5
6
...
14
15
16
Next