Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2205.01714
Cited By
Don't sweat the small stuff, classify the rest: Sample Shielding to protect text classifiers against adversarial attacks
3 May 2022
Jonathan Rusert
P. Srinivasan
AAML
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Don't sweat the small stuff, classify the rest: Sample Shielding to protect text classifiers against adversarial attacks"
16 / 16 papers shown
Title
TREATED:Towards Universal Defense against Textual Adversarial Attacks
Bin Zhu
Zhaoquan Gu
Le Wang
Zhihong Tian
AAML
45
8
0
13 Sep 2021
Towards Improving Adversarial Training of NLP Models
Jin Yong Yoo
Yanjun Qi
AAML
192
127
0
01 Sep 2021
Searching for an Effective Defender: Benchmarking Defense against Adversarial Word Substitution
Zongyi Li
Jianhan Xu
Jiehang Zeng
Linyang Li
Xiaoqing Zheng
Qi Zhang
Kai-Wei Chang
Cho-Jui Hsieh
AAML
48
74
0
29 Aug 2021
BERT-Defense: A Probabilistic Model Based on BERT to Combat Cognitively Inspired Orthographic Adversarial Attacks
Yannik Keller
J. Mackensen
Steffen Eger
AAML
107
30
0
02 Jun 2021
Certified Robustness to Text Adversarial Attacks by Randomized [MASK]
Jiehang Zeng
Xiaoqing Zheng
Jianhan Xu
Linyang Li
Liping Yuan
Xuanjing Huang
AAML
70
70
0
08 May 2021
Contextualized Perturbation for Textual Adversarial Attack
Dianqi Li
Yizhe Zhang
Hao Peng
Liqun Chen
Chris Brockett
Ming-Ting Sun
Bill Dolan
AAML
SILM
169
235
0
16 Sep 2020
Defense of Word-level Adversarial Attacks via Random Substitution Encoding
Zhaoyang Wang
Hongtao Wang
AAML
SILM
35
23
0
01 May 2020
BAE: BERT-based Adversarial Examples for Text Classification
Siddhant Garg
Goutham Ramakrishnan
AAML
SILM
211
556
0
04 Apr 2020
FreeLB: Enhanced Adversarial Training for Natural Language Understanding
Chen Zhu
Yu Cheng
Zhe Gan
S. Sun
Tom Goldstein
Jingjing Liu
AAML
286
443
0
25 Sep 2019
Certified Robustness to Adversarial Word Substitutions
Robin Jia
Aditi Raghunathan
Kerem Göksel
Percy Liang
AAML
337
294
0
03 Sep 2019
Is BERT Really Robust? A Strong Baseline for Natural Language Attack on Text Classification and Entailment
Di Jin
Zhijing Jin
Qiufeng Wang
Peter Szolovits
SILM
AAML
197
1,088
0
27 Jul 2019
White-to-Black: Efficient Distillation of Black-Box Adversarial Attacks
Yotam Gil
Yoav Chai
O. Gorodissky
Jonathan Berant
MLAU
AAML
48
46
0
04 Apr 2019
Text Processing Like Humans Do: Visually Attacking and Shielding NLP Systems
Steffen Eger
Gözde Gül Sahin
Andreas Rucklé
Ji-Ung Lee
Claudia Schulz
Mohsen Mesgar
Krishnkant Swarnkar
Edwin Simpson
Iryna Gurevych
AAML
104
163
0
27 Mar 2019
TextBugger: Generating Adversarial Text Against Real-world Applications
Jinfeng Li
S. Ji
Tianyu Du
Bo Li
Ting Wang
SILM
AAML
216
747
0
13 Dec 2018
Shielding Google's language toxicity model against adversarial attacks
Nestor Rodriguez
S. R. Galeano
AAML
37
15
0
05 Jan 2018
Convolutional Neural Networks for Sentence Classification
Yoon Kim
AILaw
VLM
644
13,432
0
25 Aug 2014
1