ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1803.02324
  4. Cited By
Annotation Artifacts in Natural Language Inference Data
v1v2 (latest)

Annotation Artifacts in Natural Language Inference Data

6 March 2018
Suchin Gururangan
Swabha Swayamdipta
Omer Levy
Roy Schwartz
Samuel R. Bowman
Noah A. Smith
ArXiv (abs)PDFHTML

Papers citing "Annotation Artifacts in Natural Language Inference Data"

50 / 796 papers shown
Title
QLoRA: Efficient Finetuning of Quantized LLMs
QLoRA: Efficient Finetuning of Quantized LLMs
Tim Dettmers
Artidoro Pagnoni
Ari Holtzman
Luke Zettlemoyer
ALM
163
2,641
0
23 May 2023
Out-of-Distribution Generalization in Text Classification: Past,
  Present, and Future
Out-of-Distribution Generalization in Text Classification: Past, Present, and Future
Linyi Yang
Yangqiu Song
Xuan Ren
Chenyang Lyu
Yidong Wang
Lingqiao Liu
Jindong Wang
Jennifer Foster
Yue Zhang
OOD
129
3
0
23 May 2023
Understanding and Mitigating Spurious Correlations in Text
  Classification with Neighborhood Analysis
Understanding and Mitigating Spurious Correlations in Text Classification with Neighborhood Analysis
Oscar Chew
Hsuan-Tien Lin
Kai-Wei Chang
Kuan-Hao Huang
80
6
0
23 May 2023
Measuring Inductive Biases of In-Context Learning with Underspecified
  Demonstrations
Measuring Inductive Biases of In-Context Learning with Underspecified Demonstrations
Chenglei Si
Dan Friedman
Nitish Joshi
Shi Feng
Danqi Chen
He He
77
48
0
22 May 2023
Distilling Robustness into Natural Language Inference Models with
  Domain-Targeted Augmentation
Distilling Robustness into Natural Language Inference Models with Domain-Targeted Augmentation
Joe Stacey
Marek Rei
57
3
0
22 May 2023
Has It All Been Solved? Open NLP Research Questions Not Solved by Large
  Language Models
Has It All Been Solved? Open NLP Research Questions Not Solved by Large Language Models
Oana Ignat
Zhijing Jin
Artem Abzaliev
Laura Biester
Santiago Castro
...
Verónica Pérez-Rosas
Siqi Shen
Zekun Wang
Winston Wu
Rada Mihalcea
LRM
136
6
0
21 May 2023
Mitigating Backdoor Poisoning Attacks through the Lens of Spurious
  Correlation
Mitigating Backdoor Poisoning Attacks through the Lens of Spurious Correlation
Xuanli He
Xingliang Yuan
Jun Wang
Benjamin I. P. Rubinstein
Trevor Cohn
AAML
88
20
0
19 May 2023
It Takes Two to Tango: Navigating Conceptualizations of NLP Tasks and
  Measurements of Performance
It Takes Two to Tango: Navigating Conceptualizations of NLP Tasks and Measurements of Performance
Arjun Subramonian
Xingdi Yuan
Hal Daumé
Su Lin Blodgett
95
18
0
15 May 2023
Measuring Consistency in Text-based Financial Forecasting Models
Measuring Consistency in Text-based Financial Forecasting Models
Linyi Yang
Yingpeng Ma
Yue Zhang
59
4
0
15 May 2023
What's the Meaning of Superhuman Performance in Today's NLU?
What's the Meaning of Superhuman Performance in Today's NLU?
Simone Tedeschi
Johan Bos
T. Declerck
Jan Hajic
Daniel Hershcovich
...
Simon Krek
Steven Schockaert
Rico Sennrich
Ekaterina Shutova
Roberto Navigli
ELMLM&MAVLMReLMLRM
96
27
0
15 May 2023
Learning to Generalize for Cross-domain QA
Learning to Generalize for Cross-domain QA
Yingjie Niu
Linyi Yang
Ruihai Dong
Yue Zhang
92
6
0
14 May 2023
SCENE: Self-Labeled Counterfactuals for Extrapolating to Negative
  Examples
SCENE: Self-Labeled Counterfactuals for Extrapolating to Negative Examples
Deqing Fu
Ameya Godbole
Robin Jia
72
8
0
13 May 2023
Say What You Mean! Large Language Models Speak Too Positively about
  Negative Commonsense Knowledge
Say What You Mean! Large Language Models Speak Too Positively about Negative Commonsense Knowledge
Jiangjie Chen
Wei Shi
Ziquan Fu
Sijie Cheng
Lei Li
Yanghua Xiao
93
51
0
10 May 2023
Explanation-based Finetuning Makes Models More Robust to Spurious Cues
Explanation-based Finetuning Makes Models More Robust to Spurious Cues
Josh Magnus Ludan
Yixuan Meng
Nguyen Tai
Saurabh Shah
Qing Lyu
Marianna Apidianaki
Chris Callison-Burch
AAMLLRM
107
21
0
08 May 2023
NLI4CT: Multi-Evidence Natural Language Inference for Clinical Trial
  Reports
NLI4CT: Multi-Evidence Natural Language Inference for Clinical Trial Reports
Mael Jullien
Marco Valentino
H. Frost
Paul O'Regan
Dónal Landers
André Freitas
57
30
0
05 May 2023
MindGames: Targeting Theory of Mind in Large Language Models with
  Dynamic Epistemic Modal Logic
MindGames: Targeting Theory of Mind in Large Language Models with Dynamic Epistemic Modal Logic
Damien Sileo
Antoine Lernould
LRM
69
27
0
05 May 2023
SCOTT: Self-Consistent Chain-of-Thought Distillation
SCOTT: Self-Consistent Chain-of-Thought Distillation
Jamie Yap
Zhengyang Wang
Zheng Li
K. Lynch
Bing Yin
Xiang Ren
LRM
141
97
0
03 May 2023
Information Redundancy and Biases in Public Document Information
  Extraction Benchmarks
Information Redundancy and Biases in Public Document Information Extraction Benchmarks
S. Laatiri
Pirashanth Ratnamogan
Joel Tang
Laurent Lam
William Vanhuffel
Fabien Caspani
35
1
0
28 Apr 2023
DataComp: In search of the next generation of multimodal datasets
DataComp: In search of the next generation of multimodal datasets
S. Gadre
Gabriel Ilharco
Alex Fang
J. Hayase
Georgios Smyrnis
...
A. Dimakis
J. Jitsev
Y. Carmon
Vaishaal Shankar
Ludwig Schmidt
VLM
107
452
0
27 Apr 2023
GPT-NER: Named Entity Recognition via Large Language Models
GPT-NER: Named Entity Recognition via Large Language Models
Shuhe Wang
Xiaofei Sun
Xiaoya Li
Rongbin Ouyang
Leilei Gan
Tianwei Zhang
Jiwei Li
Guoyin Wang
97
201
0
20 Apr 2023
Exploring the Trade-Offs: Unified Large Language Models vs Local
  Fine-Tuned Models for Highly-Specific Radiology NLI Task
Exploring the Trade-Offs: Unified Large Language Models vs Local Fine-Tuned Models for Highly-Specific Radiology NLI Task
Zihao Wu
Lu Zhang
Chao-Yang Cao
Xiao-Xing Yu
Haixing Dai
...
Quanzheng Li
Dinggang Shen
Xiang Li
Dajiang Zhu
Tianming Liu
LM&MA
66
39
0
18 Apr 2023
LINGO : Visually Debiasing Natural Language Instructions to Support Task
  Diversity
LINGO : Visually Debiasing Natural Language Instructions to Support Task Diversity
Anjana Arunkumar
Shubham Sharma
Rakhi Agrawal
Sriramakrishnan Chandrasekaran
Chris Bryan
80
0
0
12 Apr 2023
Should ChatGPT be Biased? Challenges and Risks of Bias in Large Language
  Models
Should ChatGPT be Biased? Challenges and Risks of Bias in Large Language Models
Emilio Ferrara
SILM
121
264
0
07 Apr 2023
Inspecting and Editing Knowledge Representations in Language Models
Inspecting and Editing Knowledge Representations in Language Models
Evan Hernandez
Belinda Z. Li
Jacob Andreas
KELM
89
91
0
03 Apr 2023
Exposing and Addressing Cross-Task Inconsistency in Unified
  Vision-Language Models
Exposing and Addressing Cross-Task Inconsistency in Unified Vision-Language Models
A. Maharana
Amita Kamath
Christopher Clark
Joey Tianyi Zhou
Aniruddha Kembhavi
76
3
0
28 Mar 2023
Self-Improving-Leaderboard(SIL): A Call for Real-World Centric Natural
  Language Processing Leaderboards
Self-Improving-Leaderboard(SIL): A Call for Real-World Centric Natural Language Processing Leaderboards
Chanjun Park
Hyeonseok Moon
Seolhwa Lee
Jaehyung Seo
Sugyeong Eo
Heu-Jeoung Lim
57
2
0
20 Mar 2023
Distributionally Robust Optimization with Probabilistic Group
Distributionally Robust Optimization with Probabilistic Group
Soumya Suvra Ghosal
Yixuan Li
OOD
66
10
0
10 Mar 2023
Towards Interpretable and Efficient Automatic Reference-Based
  Summarization Evaluation
Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation
Yixin Liu
Alexander R. Fabbri
Yilun Zhao
Pengfei Liu
Shafiq Joty
Chien-Sheng Wu
Caiming Xiong
Dragomir R. Radev
53
28
0
07 Mar 2023
Contrastive variational information bottleneck for aspect-based
  sentiment analysis
Contrastive variational information bottleneck for aspect-based sentiment analysis
Ming-Wei Chang
Min Yang
Qingshan Jiang
Ruifeng Xu
78
4
0
06 Mar 2023
WiCE: Real-World Entailment for Claims in Wikipedia
WiCE: Real-World Entailment for Claims in Wikipedia
Ryo Kamoi
Tanya Goyal
Juan Diego Rodriguez
Greg Durrett
103
92
0
02 Mar 2023
SMoA: Sparse Mixture of Adapters to Mitigate Multiple Dataset Biases
SMoA: Sparse Mixture of Adapters to Mitigate Multiple Dataset Biases
Yanchen Liu
Jing Yang
Yan Chen
Jing Liu
Huaqin Wu
MoE
85
2
0
28 Feb 2023
Does Deep Learning Learn to Abstract? A Systematic Probing Framework
Does Deep Learning Learn to Abstract? A Systematic Probing Framework
Shengnan An
Zeqi Lin
B. Chen
Qiang Fu
Nanning Zheng
Jian-Guang Lou
87
5
0
23 Feb 2023
Parallel Sentence-Level Explanation Generation for Real-World
  Low-Resource Scenarios
Parallel Sentence-Level Explanation Generation for Real-World Low-Resource Scenarios
Yang Liu
Xiaokang Chen
Qianwen Dai
LRM
51
4
0
21 Feb 2023
Empirical Investigation of Neural Symbolic Reasoning Strategies
Empirical Investigation of Neural Symbolic Reasoning Strategies
Yoichi Aoki
Keito Kudo
Tatsuki Kuribayashi
Ana Brassard
Masashi Yoshikawa
Keisuke Sakaguchi
Kentaro Inui
70
2
0
16 Feb 2023
ScatterShot: Interactive In-context Example Curation for Text
  Transformation
ScatterShot: Interactive In-context Example Curation for Text Transformation
Tongshuang Wu
Hua Shen
Daniel S. Weld
Jeffrey Heer
Marco Tulio Ribeiro
60
25
0
14 Feb 2023
Investigating Multi-source Active Learning for Natural Language
  Inference
Investigating Multi-source Active Learning for Natural Language Inference
Ard Snijders
Douwe Kiela
Katerina Margatina
77
7
0
14 Feb 2023
Benchmarks for Automated Commonsense Reasoning: A Survey
Benchmarks for Automated Commonsense Reasoning: A Survey
E. Davis
ELMLRM
92
63
0
09 Feb 2023
Real-Time Visual Feedback to Guide Benchmark Creation: A
  Human-and-Metric-in-the-Loop Workflow
Real-Time Visual Feedback to Guide Benchmark Creation: A Human-and-Metric-in-the-Loop Workflow
Anjana Arunkumar
Swaroop Mishra
Bhavdeep Singh Sachdeva
Chitta Baral
Chris Bryan
56
0
0
09 Feb 2023
Guide the Learner: Controlling Product of Experts Debiasing Method Based
  on Token Attribution Similarities
Guide the Learner: Controlling Product of Experts Debiasing Method Based on Token Attribution Similarities
Ali Modarressi
Hossein Amirkhani
Mohammad Taher Pilehvar
48
2
0
06 Feb 2023
Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics
  Without the Reference
Poor Man's Quality Estimation: Predicting Reference-Based MT Metrics Without the Reference
Vilém Zouhar
Shehzaad Dhuliawala
Wangchunshu Zhou
Nico Daheim
Tom Kocmi
Yuchen Eleanor Jiang
Mrinmaya Sachan
66
11
0
21 Jan 2023
KL Regularized Normalization Framework for Low Resource Tasks
KL Regularized Normalization Framework for Low Resource Tasks
Neeraj Kumar
Ankur Narang
Brejesh Lall
58
1
0
21 Dec 2022
DISCO: Distilling Counterfactuals with Large Language Models
DISCO: Distilling Counterfactuals with Large Language Models
Zeming Chen
Qiyue Gao
Antoine Bosselut
Ashish Sabharwal
Kyle Richardson
92
31
0
20 Dec 2022
Task Ambiguity in Humans and Language Models
Task Ambiguity in Humans and Language Models
Alex Tamkin
Kunal Handa
Ava Shrestha
Noah D. Goodman
UQLM
122
23
0
20 Dec 2022
Evaluation for Change
Evaluation for Change
Rishi Bommasani
ELM
64
0
0
20 Dec 2022
Debiasing Stance Detection Models with Counterfactual Reasoning and
  Adversarial Bias Learning
Debiasing Stance Detection Models with Counterfactual Reasoning and Adversarial Bias Learning
Jianhua Yuan
Yanyan Zhao
Bing Qin
117
4
0
20 Dec 2022
To Adapt or to Annotate: Challenges and Interventions for Domain
  Adaptation in Open-Domain Question Answering
To Adapt or to Annotate: Challenges and Interventions for Domain Adaptation in Open-Domain Question Answering
Dheeru Dua
Emma Strubell
Sameer Singh
Pat Verga
OOD
94
3
0
20 Dec 2022
Unnatural Instructions: Tuning Language Models with (Almost) No Human
  Labor
Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor
Or Honovich
Thomas Scialom
Omer Levy
Timo Schick
ALM
158
374
0
19 Dec 2022
Statistical Dataset Evaluation: Reliability, Difficulty, and Validity
Statistical Dataset Evaluation: Reliability, Difficulty, and Validity
Chengwen Wang
Qingxiu Dong
Xiaochen Wang
Haitao Wang
Zhifang Sui
XAI
57
3
0
19 Dec 2022
JEMMA: An Extensible Java Dataset for ML4Code Applications
JEMMA: An Extensible Java Dataset for ML4Code Applications
Anjan Karmakar
Miltiadis Allamanis
Romain Robbes
VLM
55
3
0
18 Dec 2022
Multi-Scales Data Augmentation Approach In Natural Language Inference
  For Artifacts Mitigation And Pre-Trained Model Optimization
Multi-Scales Data Augmentation Approach In Natural Language Inference For Artifacts Mitigation And Pre-Trained Model Optimization
Zhenyu Lu
104
1
0
16 Dec 2022
Previous
123456...141516
Next