ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2112.08313
  4. Cited By
Measure and Improve Robustness in NLP Models: A Survey

Measure and Improve Robustness in NLP Models: A Survey

15 December 2021
Xuezhi Wang
Haohan Wang
Diyi Yang
ArXivPDFHTML

Papers citing "Measure and Improve Robustness in NLP Models: A Survey"

50 / 150 papers shown
Title
Are Time-Series Foundation Models Deployment-Ready? A Systematic Study of Adversarial Robustness Across Domains
Are Time-Series Foundation Models Deployment-Ready? A Systematic Study of Adversarial Robustness Across Domains
Jiawen Zhang
Zhenwei Zhang
Shun Zheng
Xumeng Wen
Jia Li
Jiang Bian
AI4TS
AAML
144
0
0
26 May 2025
RAS: Retrieval-And-Structuring for Knowledge-Intensive LLM Generation
RAS: Retrieval-And-Structuring for Knowledge-Intensive LLM Generation
Pengcheng Jiang
Lang Cao
Ruike Zhu
Minhao Jiang
Yunyi Zhang
Jimeng Sun
Jiawei Han
RALM
183
4
0
16 Feb 2025
LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs
LLMs as Workers in Human-Computational Algorithms? Replicating Crowdsourcing Pipelines with LLMs
Tongshuang Wu
Haiyi Zhu
Maya Albayrak
Alexis Axon
Amanda Bertsch
...
Ying-Jui Tseng
Patricia Vaidos
Zhijin Wu
Wei Wu
Chenyang Yang
124
34
0
10 Jan 2025
Evaluating Concurrent Robustness of Language Models Across Diverse Challenge Sets
Evaluating Concurrent Robustness of Language Models Across Diverse Challenge Sets
Vatsal Gupta
Pranshu Pandya
Tushar Kataria
Vivek Gupta
Dan Roth
AAML
110
1
0
03 Jan 2025
Specification Overfitting in Artificial Intelligence
Specification Overfitting in Artificial Intelligence
Benjamin Roth
Pedro Henrique Luz de Araujo
Yuxi Xia
Saskia Kaltenbrunner
Christoph Korab
181
1
0
13 Mar 2024
NL-Augmenter: A Framework for Task-Sensitive Natural Language
  Augmentation
NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation
Kaustubh D. Dhole
Varun Gangal
Sebastian Gehrmann
Aadesh Gupta
Zhenhao Li
...
Tianbao Xie
Usama Yaseen
Michael A. Yee
Jing Zhang
Yue Zhang
206
87
0
06 Dec 2021
Uncertainty Calibration for Ensemble-Based Debiasing Methods
Uncertainty Calibration for Ensemble-Based Debiasing Methods
Ruibin Xiong
Yimeng Chen
Liang Pang
Xueqi Chen
Yanyan Lan
42
21
0
07 Nov 2021
Toward Learning Human-aligned Cross-domain Robust Models by Countering
  Misaligned Features
Toward Learning Human-aligned Cross-domain Robust Models by Countering Misaligned Features
Haohan Wang
Zeyi Huang
Hanlin Zhang
Yong Jae Lee
Eric P. Xing
OOD
172
16
0
05 Nov 2021
Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of
  Language Models
Adversarial GLUE: A Multi-Task Benchmark for Robustness Evaluation of Language Models
Wei Ping
Chejian Xu
Shuohang Wang
Zhe Gan
Yu Cheng
Jianfeng Gao
Ahmed Hassan Awadallah
Yangqiu Song
VLM
ELM
AAML
63
222
0
04 Nov 2021
Identifying and Mitigating Spurious Correlations for Improving
  Robustness in NLP Models
Identifying and Mitigating Spurious Correlations for Improving Robustness in NLP Models
Tianlu Wang
Rohit Sridhar
Diyi Yang
Xuezhi Wang
AAML
169
75
0
14 Oct 2021
RockNER: A Simple Method to Create Adversarial Examples for Evaluating
  the Robustness of Named Entity Recognition Models
RockNER: A Simple Method to Create Adversarial Examples for Evaluating the Robustness of Named Entity Recognition Models
Bill Yuchen Lin
Wenyang Gao
Jun Yan
Ryan Rene Moreno
Xiang Ren
AAML
76
42
0
12 Sep 2021
Multi-granularity Textual Adversarial Attack with Behavior Cloning
Multi-granularity Textual Adversarial Attack with Behavior Cloning
Yangyi Chen
Jingtong Su
Wei Wei
AAML
36
33
0
09 Sep 2021
Just Train Twice: Improving Group Robustness without Training Group
  Information
Just Train Twice: Improving Group Robustness without Training Group Information
Emmy Liu
Behzad Haghgoo
Annie S. Chen
Aditi Raghunathan
Pang Wei Koh
Shiori Sagawa
Percy Liang
Chelsea Finn
OOD
94
562
0
19 Jul 2021
Tailor: Generating and Perturbing Text with Semantic Controls
Tailor: Generating and Perturbing Text with Semantic Controls
Alexis Ross
Tongshuang Wu
Hao Peng
Matthew E. Peters
Matt Gardner
179
78
0
15 Jul 2021
An Investigation of the (In)effectiveness of Counterfactually Augmented
  Data
An Investigation of the (In)effectiveness of Counterfactually Augmented Data
Nitish Joshi
He He
OODD
54
47
0
01 Jul 2021
Bad Characters: Imperceptible NLP Attacks
Bad Characters: Imperceptible NLP Attacks
Nicholas Boucher
Ilia Shumailov
Ross J. Anderson
Nicolas Papernot
AAML
SILM
68
106
0
18 Jun 2021
HiddenCut: Simple Data Augmentation for Natural Language Understanding
  with Better Generalization
HiddenCut: Simple Data Augmentation for Natural Language Understanding with Better Generalization
Jiaao Chen
Dinghan Shen
Weizhu Chen
Diyi Yang
BDL
65
47
0
31 May 2021
Counterfactual Invariance to Spurious Correlations: Why and How to Pass
  Stress Tests
Counterfactual Invariance to Spurious Correlations: Why and How to Pass Stress Tests
Victor Veitch
Alexander DÁmour
Steve Yadlowsky
Jacob Eisenstein
OOD
52
93
0
31 May 2021
Exploring Misclassifications of Robust Neural Networks to Enhance
  Adversarial Attacks
Exploring Misclassifications of Robust Neural Networks to Enhance Adversarial Attacks
Leo Schwinn
René Raab
A. Nguyen
Dario Zanca
Bjoern M. Eskofier
AAML
58
61
0
21 May 2021
A Survey of Data Augmentation Approaches for NLP
A Survey of Data Augmentation Approaches for NLP
Steven Y. Feng
Varun Gangal
Jason W. Wei
Sarath Chandar
Soroush Vosoughi
Teruko Mitamura
Eduard H. Hovy
AIMat
106
823
0
07 May 2021
Reliability Testing for Natural Language Processing Systems
Reliability Testing for Natural Language Processing Systems
Samson Tan
Shafiq Joty
K. Baxter
Araz Taeihagh
G. Bennett
Min-Yen Kan
75
40
0
06 May 2021
Improving Faithfulness in Abstractive Summarization with Contrast
  Candidate Generation and Selection
Improving Faithfulness in Abstractive Summarization with Contrast Candidate Generation and Selection
Sihao Chen
Fan Zhang
Kazoo Sone
Dan Roth
HILM
85
107
0
19 Apr 2021
Dynabench: Rethinking Benchmarking in NLP
Dynabench: Rethinking Benchmarking in NLP
Douwe Kiela
Max Bartolo
Yixin Nie
Divyansh Kaushik
Atticus Geiger
...
Pontus Stenetorp
Robin Jia
Joey Tianyi Zhou
Christopher Potts
Adina Williams
201
407
0
07 Apr 2021
Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots
Code-Mixing on Sesame Street: Dawn of the Adversarial Polyglots
Samson Tan
Shafiq Joty
AAML
61
35
0
17 Mar 2021
How to represent part-whole hierarchies in a neural network
How to represent part-whole hierarchies in a neural network
Geoffrey E. Hinton
OCL
MoE
83
203
0
25 Feb 2021
On Robustness of Neural Semantic Parsers
On Robustness of Neural Semantic Parsers
Shuo Huang
Zhuang Li
Zhuang Li
Lei Pan
AAML
71
16
0
02 Feb 2021
Robustness Gym: Unifying the NLP Evaluation Landscape
Robustness Gym: Unifying the NLP Evaluation Landscape
Karan Goel
Nazneen Rajani
Jesse Vig
Samson Tan
Jason M. Wu
Stephan Zheng
Caiming Xiong
Joey Tianyi Zhou
Christopher Ré
AAML
OffRL
OOD
186
140
0
13 Jan 2021
Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and
  Improving Models
Polyjuice: Generating Counterfactuals for Explaining, Evaluating, and Improving Models
Tongshuang Wu
Marco Tulio Ribeiro
Jeffrey Heer
Daniel S. Weld
97
249
0
01 Jan 2021
A Survey on Neural Network Interpretability
A Survey on Neural Network Interpretability
Yu Zhang
Peter Tiño
A. Leonardis
K. Tang
FaML
XAI
199
679
0
28 Dec 2020
Robustness to Spurious Correlations in Text Classification via
  Automatically Generated Counterfactuals
Robustness to Spurious Correlations in Text Classification via Automatically Generated Counterfactuals
Zhao Wang
A. Culotta
CML
OOD
64
100
0
18 Dec 2020
Semantics Altering Modifications for Evaluating Comprehension in Machine
  Reading
Semantics Altering Modifications for Evaluating Comprehension in Machine Reading
Viktor Schlegel
Goran Nenadic
Riza Batista-Navarro
56
18
0
07 Dec 2020
On the Transferability of Adversarial Attacksagainst Neural Text
  Classifier
On the Transferability of Adversarial Attacksagainst Neural Text Classifier
Liping Yuan
Xiaoqing Zheng
Yi Zhou
Cho-Jui Hsieh
Kai-Wei Chang
SILM
AAML
43
26
0
17 Nov 2020
Learning to Model and Ignore Dataset Bias with Mixed Capacity Ensembles
Learning to Model and Ignore Dataset Bias with Mixed Capacity Ensembles
Christopher Clark
Mark Yatskar
Luke Zettlemoyer
59
62
0
07 Nov 2020
Detecting Hallucinated Content in Conditional Neural Sequence Generation
Detecting Hallucinated Content in Conditional Neural Sequence Generation
Chunting Zhou
Graham Neubig
Jiatao Gu
Mona T. Diab
P. Guzmán
Luke Zettlemoyer
Marjan Ghazvininejad
HILM
93
200
0
05 Nov 2020
Learning to Recognize Dialect Features
Learning to Recognize Dialect Features
Dorottya Demszky
D. Sharma
J. Clark
Vinodkumar Prabhakaran
Jacob Eisenstein
181
38
0
23 Oct 2020
Word Shape Matters: Robust Machine Translation with Visual Embedding
Word Shape Matters: Robust Machine Translation with Visual Embedding
Haohan Wang
Peiyan Zhang
Eric Xing
174
13
0
20 Oct 2020
The Risks of Invariant Risk Minimization
The Risks of Invariant Risk Minimization
Elan Rosenfeld
Pradeep Ravikumar
Andrej Risteski
OOD
76
312
0
12 Oct 2020
Counterfactually-Augmented SNLI Training Data Does Not Yield Better
  Generalization Than Unaugmented Data
Counterfactually-Augmented SNLI Training Data Does Not Yield Better Generalization Than Unaugmented Data
William Huang
Haokun Liu
Samuel R. Bowman
62
38
0
09 Oct 2020
Identifying Spurious Correlations for Robust Text Classification
Identifying Spurious Correlations for Robust Text Classification
Zhao Wang
A. Culotta
OOD
72
78
0
06 Oct 2020
CAT-Gen: Improving Robustness in NLP Models via Controlled Adversarial
  Text Generation
CAT-Gen: Improving Robustness in NLP Models via Controlled Adversarial Text Generation
Tianlu Wang
Xuezhi Wang
Yao Qin
Ben Packer
Kang Li
Jilin Chen
Alex Beutel
Ed H. Chi
SILM
74
83
0
05 Oct 2020
InfoBERT: Improving Robustness of Language Models from An Information
  Theoretic Perspective
InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective
Wei Ping
Shuohang Wang
Yu Cheng
Zhe Gan
R. Jia
Yue Liu
Jingjing Liu
AAML
195
116
0
05 Oct 2020
Domain Adversarial Fine-Tuning as an Effective Regularizer
Domain Adversarial Fine-Tuning as an Effective Regularizer
Giorgos Vernikos
Katerina Margatina
Alexandra Chronopoulou
Ion Androutsopoulos
48
15
0
28 Sep 2020
Towards Debiasing NLU Models from Unknown Biases
Towards Debiasing NLU Models from Unknown Biases
Prasetya Ajie Utama
N. Moosavi
Iryna Gurevych
59
155
0
25 Sep 2020
Contextualized Perturbation for Textual Adversarial Attack
Contextualized Perturbation for Textual Adversarial Attack
Dianqi Li
Yizhe Zhang
Hao Peng
Liqun Chen
Chris Brockett
Ming-Ting Sun
Bill Dolan
AAML
SILM
164
235
0
16 Sep 2020
Question and Answer Test-Train Overlap in Open-Domain Question Answering
  Datasets
Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets
Patrick Lewis
Pontus Stenetorp
Sebastian Riedel
OOD
ELM
153
187
0
06 Aug 2020
An Empirical Study on Robustness to Spurious Correlations using
  Pre-trained Language Models
An Empirical Study on Robustness to Spurious Correlations using Pre-trained Language Models
Lifu Tu
Garima Lalwani
Spandana Gella
He He
LRM
93
186
0
14 Jul 2020
Robustness to Spurious Correlations via Human Annotations
Robustness to Spurious Correlations via Human Annotations
Megha Srivastava
Tatsunori Hashimoto
Percy Liang
CML
OOD
48
90
0
13 Jul 2020
Learning from Failure: Training Debiased Classifier from Biased
  Classifier
Learning from Failure: Training Debiased Classifier from Biased Classifier
J. Nam
Hyuntak Cha
SungSoo Ahn
Jaeho Lee
Jinwoo Shin
63
149
0
06 Jul 2020
In Search of Lost Domain Generalization
In Search of Lost Domain Generalization
Ishaan Gulrajani
David Lopez-Paz
OOD
76
1,149
0
02 Jul 2020
Measuring Robustness to Natural Distribution Shifts in Image
  Classification
Measuring Robustness to Natural Distribution Shifts in Image Classification
Rohan Taori
Achal Dave
Vaishaal Shankar
Nicholas Carlini
Benjamin Recht
Ludwig Schmidt
OOD
117
546
0
01 Jul 2020
123
Next