Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2112.08313
Cited By
Measure and Improve Robustness in NLP Models: A Survey
15 December 2021
Xuezhi Wang
Haohan Wang
Diyi Yang
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Measure and Improve Robustness in NLP Models: A Survey"
50 / 150 papers shown
Title
Are Time-Series Foundation Models Deployment-Ready? A Systematic Study of Adversarial Robustness Across Domains
Jiawen Zhang
Zhenwei Zhang
Shun Zheng
Xumeng Wen
Jia Li
Jiang Bian
AI4TS
AAML
144
0
0
26 May 2025
RAS: Retrieval-And-Structuring for Knowledge-Intensive LLM Generation
Pengcheng Jiang
Lang Cao
Ruike Zhu
Minhao Jiang
Yunyi Zhang
Jimeng Sun
Jiawei Han
RALM
183
4
0
16 Feb 2025
LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs
Tongshuang Wu
Haiyi Zhu
Maya Albayrak
Alexis Axon
Amanda Bertsch
...
Ying-Jui Tseng
Patricia Vaidos
Zhijin Wu
Wei Wu
Chenyang Yang
124
34
0
10 Jan 2025
Evaluating Concurrent Robustness of Language Models Across Diverse Challenge Sets
Vatsal Gupta
Pranshu Pandya
Tushar Kataria
Vivek Gupta
Dan Roth
AAML
110
1
0
03 Jan 2025
Specification Overfitting in Artificial Intelligence
Benjamin Roth
Pedro Henrique Luz de Araujo
Yuxi Xia
Saskia Kaltenbrunner
Christoph Korab
181
1
0
13 Mar 2024
NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation
Kaustubh D. Dhole
Varun Gangal
Sebastian Gehrmann
Aadesh Gupta
Zhenhao Li
...
Tianbao Xie
Usama Yaseen
Michael A. Yee
Jing Zhang
Yue Zhang
206
87
0
06 Dec 2021
Uncertainty Calibration for Ensemble-Based Debiasing Methods
Ruibin Xiong
Yimeng Chen
Liang Pang
Xueqi Chen
Yanyan Lan
42
21
0
07 Nov 2021
Toward Learning Human-aligned Cross-domain Robust Models by Countering Misaligned Features
Haohan Wang
Zeyi Huang
Hanlin Zhang
Yong Jae Lee
Eric P. Xing
OOD
172
16
0
05 Nov 2021
Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of Language Models
Wei Ping
Chejian Xu
Shuohang Wang
Zhe Gan
Yu Cheng
Jianfeng Gao
Ahmed Hassan Awadallah
Yangqiu Song
VLM
ELM
AAML
63
222
0
04 Nov 2021
Identifying and Mitigating Spurious Correlations for Improving Robustness in NLP Models
Tianlu Wang
Rohit Sridhar
Diyi Yang
Xuezhi Wang
AAML
169
75
0
14 Oct 2021
RockNER: A Simple Method to Create Adversarial Examples for Evaluating the Robustness of Named Entity Recognition Models
Bill Yuchen Lin
Wenyang Gao
Jun Yan
Ryan Rene Moreno
Xiang Ren
AAML
76
42
0
12 Sep 2021
Multi-granularity Textual Adversarial Attack with Behavior Cloning
Yangyi Chen
Jingtong Su
Wei Wei
AAML
36
33
0
09 Sep 2021
Just Train Twice: Improving Group Robustness without Training Group Information
Emmy Liu
Behzad Haghgoo
Annie S. Chen
Aditi Raghunathan
Pang Wei Koh
Shiori Sagawa
Percy Liang
Chelsea Finn
OOD
94
562
0
19 Jul 2021
Tailor: Generating and Perturbing Text with Semantic Controls
Alexis Ross
Tongshuang Wu
Hao Peng
Matthew E. Peters
Matt Gardner
179
78
0
15 Jul 2021
An Investigation of the (In)effectiveness of Counterfactually Augmented Data
Nitish Joshi
He He
OODD
54
47
0
01 Jul 2021
Bad Characters: Imperceptible NLP Attacks
Nicholas Boucher
Ilia Shumailov
Ross J. Anderson
Nicolas Papernot
AAML
SILM
68
106
0
18 Jun 2021
HiddenCut: Simple Data Augmentation for Natural Language Understanding with Better Generalization
Jiaao Chen
Dinghan Shen
Weizhu Chen
Diyi Yang
BDL
65
47
0
31 May 2021
Counterfactual Invariance to Spurious Correlations: Why and How to Pass Stress Tests
Victor Veitch
Alexander DÁmour
Steve Yadlowsky
Jacob Eisenstein
OOD
52
93
0
31 May 2021
Exploring Misclassifications of Robust Neural Networks to Enhance Adversarial Attacks
Leo Schwinn
René Raab
A. Nguyen
Dario Zanca
Bjoern M. Eskofier
AAML
58
61
0
21 May 2021
A Survey of Data Augmentation Approaches for NLP
Steven Y. Feng
Varun Gangal
Jason W. Wei
Sarath Chandar
Soroush Vosoughi
Teruko Mitamura
Eduard H. Hovy
AIMat
106
823
0
07 May 2021
Reliability Testing for Natural Language Processing Systems
Samson Tan
Shafiq Joty
K. Baxter
Araz Taeihagh
G. Bennett
Min-Yen Kan
75
40
0
06 May 2021
Improving Faithfulness in Abstractive Summarization with Contrast Candidate Generation and Selection
Sihao Chen
Fan Zhang
Kazoo Sone
Dan Roth
HILM
85
107
0
19 Apr 2021
Dynabench: Rethinking Benchmarking in NLP
Douwe Kiela
Max Bartolo
Yixin Nie
Divyansh Kaushik
Atticus Geiger
...
Pontus Stenetorp
Robin Jia
Joey Tianyi Zhou
Christopher Potts
Adina Williams
201
407
0
07 Apr 2021
Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots
Samson Tan
Shafiq Joty
AAML
61
35
0
17 Mar 2021
How to represent part-whole hierarchies in a neural network
Geoffrey E. Hinton
OCL
MoE
83
203
0
25 Feb 2021
On Robustness of Neural Semantic Parsers
Shuo Huang
Zhuang Li
Zhuang Li
Lei Pan
AAML
71
16
0
02 Feb 2021
Robustness Gym: Unifying the NLP Evaluation Landscape
Karan Goel
Nazneen Rajani
Jesse Vig
Samson Tan
Jason M. Wu
Stephan Zheng
Caiming Xiong
Joey Tianyi Zhou
Christopher Ré
AAML
OffRL
OOD
186
140
0
13 Jan 2021
Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models
Tongshuang Wu
Marco Tulio Ribeiro
Jeffrey Heer
Daniel S. Weld
97
249
0
01 Jan 2021
A Survey on Neural Network Interpretability
Yu Zhang
Peter Tiño
A. Leonardis
K. Tang
FaML
XAI
199
679
0
28 Dec 2020
Robustness to Spurious Correlations in Text Classification via Automatically Generated Counterfactuals
Zhao Wang
A. Culotta
CML
OOD
64
100
0
18 Dec 2020
Semantics Altering Modifications for Evaluating Comprehension in Machine Reading
Viktor Schlegel
Goran Nenadic
Riza Batista-Navarro
56
18
0
07 Dec 2020
On the Transferability of Adversarial Attacksagainst Neural Text Classifier
Liping Yuan
Xiaoqing Zheng
Yi Zhou
Cho-Jui Hsieh
Kai-Wei Chang
SILM
AAML
43
26
0
17 Nov 2020
Learning to Model and Ignore Dataset Bias with Mixed Capacity Ensembles
Christopher Clark
Mark Yatskar
Luke Zettlemoyer
59
62
0
07 Nov 2020
Detecting Hallucinated Content in Conditional Neural Sequence Generation
Chunting Zhou
Graham Neubig
Jiatao Gu
Mona T. Diab
P. Guzmán
Luke Zettlemoyer
Marjan Ghazvininejad
HILM
93
200
0
05 Nov 2020
Learning to Recognize Dialect Features
Dorottya Demszky
D. Sharma
J. Clark
Vinodkumar Prabhakaran
Jacob Eisenstein
181
38
0
23 Oct 2020
Word Shape Matters: Robust Machine Translation with Visual Embedding
Haohan Wang
Peiyan Zhang
Eric Xing
174
13
0
20 Oct 2020
The Risks of Invariant Risk Minimization
Elan Rosenfeld
Pradeep Ravikumar
Andrej Risteski
OOD
76
312
0
12 Oct 2020
Counterfactually-Augmented SNLI Training Data Does Not Yield Better Generalization Than Unaugmented Data
William Huang
Haokun Liu
Samuel R. Bowman
62
38
0
09 Oct 2020
Identifying Spurious Correlations for Robust Text Classification
Zhao Wang
A. Culotta
OOD
72
78
0
06 Oct 2020
CAT-Gen: Improving Robustness in NLP Models via Controlled Adversarial Text Generation
Tianlu Wang
Xuezhi Wang
Yao Qin
Ben Packer
Kang Li
Jilin Chen
Alex Beutel
Ed H. Chi
SILM
74
83
0
05 Oct 2020
InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective
Wei Ping
Shuohang Wang
Yu Cheng
Zhe Gan
R. Jia
Yue Liu
Jingjing Liu
AAML
195
116
0
05 Oct 2020
Domain Adversarial Fine-Tuning as an Effective Regularizer
Giorgos Vernikos
Katerina Margatina
Alexandra Chronopoulou
Ion Androutsopoulos
48
15
0
28 Sep 2020
Towards Debiasing NLU Models from Unknown Biases
Prasetya Ajie Utama
N. Moosavi
Iryna Gurevych
59
155
0
25 Sep 2020
Contextualized Perturbation for Textual Adversarial Attack
Dianqi Li
Yizhe Zhang
Hao Peng
Liqun Chen
Chris Brockett
Ming-Ting Sun
Bill Dolan
AAML
SILM
164
235
0
16 Sep 2020
Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets
Patrick Lewis
Pontus Stenetorp
Sebastian Riedel
OOD
ELM
153
187
0
06 Aug 2020
An Empirical Study on Robustness to Spurious Correlations using Pre-trained Language Models
Lifu Tu
Garima Lalwani
Spandana Gella
He He
LRM
93
186
0
14 Jul 2020
Robustness to Spurious Correlations via Human Annotations
Megha Srivastava
Tatsunori Hashimoto
Percy Liang
CML
OOD
48
90
0
13 Jul 2020
Learning from Failure: Training Debiased Classifier from Biased Classifier
J. Nam
Hyuntak Cha
SungSoo Ahn
Jaeho Lee
Jinwoo Shin
63
149
0
06 Jul 2020
In Search of Lost Domain Generalization
Ishaan Gulrajani
David Lopez-Paz
OOD
76
1,149
0
02 Jul 2020
Measuring Robustness to Natural Distribution Shifts in Image Classification
Rohan Taori
Achal Dave
Vaishaal Shankar
Nicholas Carlini
Benjamin Recht
Ludwig Schmidt
OOD
117
546
0
01 Jul 2020
1
2
3
Next