ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2205.01714
  4. Cited By
Don't sweat the small stuff, classify the rest: Sample Shielding to
  protect text classifiers against adversarial attacks

Don't sweat the small stuff, classify the rest: Sample Shielding to protect text classifiers against adversarial attacks

3 May 2022
Jonathan Rusert
P. Srinivasan
    AAML
ArXiv (abs)PDFHTML

Papers citing "Don't sweat the small stuff, classify the rest: Sample Shielding to protect text classifiers against adversarial attacks"

16 / 16 papers shown
Title
TREATED:Towards Universal Defense against Textual Adversarial Attacks
TREATED:Towards Universal Defense against Textual Adversarial Attacks
Bin Zhu
Zhaoquan Gu
Le Wang
Zhihong Tian
AAML
45
8
0
13 Sep 2021
Towards Improving Adversarial Training of NLP Models
Towards Improving Adversarial Training of NLP Models
Jin Yong Yoo
Yanjun Qi
AAML
192
127
0
01 Sep 2021
Searching for an Effective Defender: Benchmarking Defense against
  Adversarial Word Substitution
Searching for an Effective Defender: Benchmarking Defense against Adversarial Word Substitution
Zongyi Li
Jianhan Xu
Jiehang Zeng
Linyang Li
Xiaoqing Zheng
Qi Zhang
Kai-Wei Chang
Cho-Jui Hsieh
AAML
48
74
0
29 Aug 2021
BERT-Defense: A Probabilistic Model Based on BERT to Combat Cognitively
  Inspired Orthographic Adversarial Attacks
BERT-Defense: A Probabilistic Model Based on BERT to Combat Cognitively Inspired Orthographic Adversarial Attacks
Yannik Keller
J. Mackensen
Steffen Eger
AAML
107
30
0
02 Jun 2021
Certified Robustness to Text Adversarial Attacks by Randomized [MASK]
Certified Robustness to Text Adversarial Attacks by Randomized [MASK]
Jiehang Zeng
Xiaoqing Zheng
Jianhan Xu
Linyang Li
Liping Yuan
Xuanjing Huang
AAML
70
70
0
08 May 2021
Contextualized Perturbation for Textual Adversarial Attack
Contextualized Perturbation for Textual Adversarial Attack
Dianqi Li
Yizhe Zhang
Hao Peng
Liqun Chen
Chris Brockett
Ming-Ting Sun
Bill Dolan
AAMLSILM
169
235
0
16 Sep 2020
Defense of Word-level Adversarial Attacks via Random Substitution
  Encoding
Defense of Word-level Adversarial Attacks via Random Substitution Encoding
Zhaoyang Wang
Hongtao Wang
AAMLSILM
35
23
0
01 May 2020
BAE: BERT-based Adversarial Examples for Text Classification
BAE: BERT-based Adversarial Examples for Text Classification
Siddhant Garg
Goutham Ramakrishnan
AAMLSILM
211
556
0
04 Apr 2020
FreeLB: Enhanced Adversarial Training for Natural Language Understanding
FreeLB: Enhanced Adversarial Training for Natural Language Understanding
Chen Zhu
Yu Cheng
Zhe Gan
S. Sun
Tom Goldstein
Jingjing Liu
AAML
286
443
0
25 Sep 2019
Certified Robustness to Adversarial Word Substitutions
Certified Robustness to Adversarial Word Substitutions
Robin Jia
Aditi Raghunathan
Kerem Göksel
Percy Liang
AAML
337
294
0
03 Sep 2019
Is BERT Really Robust? A Strong Baseline for Natural Language Attack on
  Text Classification and Entailment
Is BERT Really Robust? A Strong Baseline for Natural Language Attack on Text Classification and Entailment
Di Jin
Zhijing Jin
Qiufeng Wang
Peter Szolovits
SILMAAML
199
1,088
0
27 Jul 2019
White-to-Black: Efficient Distillation of Black-Box Adversarial Attacks
White-to-Black: Efficient Distillation of Black-Box Adversarial Attacks
Yotam Gil
Yoav Chai
O. Gorodissky
Jonathan Berant
MLAUAAML
48
46
0
04 Apr 2019
Text Processing Like Humans Do: Visually Attacking and Shielding NLP
  Systems
Text Processing Like Humans Do: Visually Attacking and Shielding NLP Systems
Steffen Eger
Gözde Gül Sahin
Andreas Rucklé
Ji-Ung Lee
Claudia Schulz
Mohsen Mesgar
Krishnkant Swarnkar
Edwin Simpson
Iryna Gurevych
AAML
104
163
0
27 Mar 2019
TextBugger: Generating Adversarial Text Against Real-world Applications
TextBugger: Generating Adversarial Text Against Real-world Applications
Jinfeng Li
S. Ji
Tianyu Du
Bo Li
Ting Wang
SILMAAML
216
747
0
13 Dec 2018
Shielding Google's language toxicity model against adversarial attacks
Shielding Google's language toxicity model against adversarial attacks
Nestor Rodriguez
S. R. Galeano
AAML
39
15
0
05 Jan 2018
Convolutional Neural Networks for Sentence Classification
Convolutional Neural Networks for Sentence Classification
Yoon Kim
AILawVLM
644
13,432
0
25 Aug 2014
1