Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2211.02878
Cited By
Textual Manifold-based Defense Against Natural Language Adversarial Examples
5 November 2022
D. M. Nguyen
Anh Tuan Luu
AAML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Textual Manifold-based Defense Against Natural Language Adversarial Examples"
18 / 18 papers shown
Title
Exploring Gradient-Guided Masked Language Model to Detect Textual Adversarial Attacks
Xiaomei Zhang
Zhaoxi Zhang
Yanjun Zhang
Xufei Zheng
L. Zhang
Shengshan Hu
Shirui Pan
AAML
27
0
0
08 Apr 2025
DiffuseDef: Improved Robustness to Adversarial Attacks via Iterative Denoising
Zhenhao Li
Huichi Zhou
Marek Rei
Lucia Specia
DiffM
29
0
0
28 Jun 2024
Exploiting the Layered Intrinsic Dimensionality of Deep Models for Practical Adversarial Training
Enes Altinisik
Safa Messaoud
Husrev Taha Sencar
Hassan Sajjad
Sanjay Chawla
AAML
48
0
0
27 May 2024
Persistent Classification: A New Approach to Stability of Data and Adversarial Examples
Brian Bell
Michael Geyer
David Glickenstein
Keaton Hamm
C. Scheidegger
Amanda S. Fernandez
Juston Moore
AAML
44
0
0
11 Apr 2024
SemRoDe: Macro Adversarial Training to Learn Representations That are Robust to Word-Level Attacks
Brian Formento
Wenjie Feng
Chuan-Sheng Foo
Anh Tuan Luu
See-Kiong Ng
AAML
34
7
0
27 Mar 2024
Extreme Miscalibration and the Illusion of Adversarial Robustness
Vyas Raina
Samson Tan
V. Cevher
Aditya Rawal
Sheng Zha
George Karypis
AAML
41
2
0
27 Feb 2024
Defending Against Weight-Poisoning Backdoor Attacks for Parameter-Efficient Fine-Tuning
Shuai Zhao
Leilei Gan
Anh Tuan Luu
Jie Fu
Lingjuan Lyu
Meihuizi Jia
Jinming Wen
AAML
26
23
0
19 Feb 2024
Fooling the Textual Fooler via Randomizing Latent Representations
Duy C. Hoang
Quang H. Nguyen
Saurav Manchanda
MinLong Peng
Kok-Seng Wong
Khoa D. Doan
SILM
AAML
23
0
0
02 Oct 2023
Certifying LLM Safety against Adversarial Prompting
Aounon Kumar
Chirag Agarwal
Suraj Srinivas
Aaron Jiaxun Li
S. Feizi
Himabindu Lakkaraju
AAML
27
165
0
06 Sep 2023
Prompt as Triggers for Backdoor Attack: Examining the Vulnerability in Language Models
Shuai Zhao
Jinming Wen
Anh Tuan Luu
J. Zhao
Jie Fu
SILM
62
89
0
02 May 2023
Granular-ball computing: an efficient, robust, and interpretable adaptive multi-granularity representation and computation method
Shuyin Xia
Guoyin Wang
Xinbo Gao
Xiaoyu Lian
19
8
0
21 Apr 2023
Masked Language Model Based Textual Adversarial Example Detection
Xiaomei Zhang
Zhaoxi Zhang
Qi Zhong
Xufei Zheng
Yanjun Zhang
Shengshan Hu
L. Zhang
AAML
28
0
0
18 Apr 2023
FreeLB: Enhanced Adversarial Training for Natural Language Understanding
Chen Zhu
Yu Cheng
Zhe Gan
S. Sun
Tom Goldstein
Jingjing Liu
AAML
232
438
0
25 Sep 2019
Certified Robustness to Adversarial Word Substitutions
Robin Jia
Aditi Raghunathan
Kerem Göksel
Percy Liang
AAML
183
291
0
03 Sep 2019
Disentangling Adversarial Robustness and Generalization
David Stutz
Matthias Hein
Bernt Schiele
AAML
OOD
194
274
0
03 Dec 2018
Generating Natural Language Adversarial Examples
M. Alzantot
Yash Sharma
Ahmed Elgohary
Bo-Jhang Ho
Mani B. Srivastava
Kai-Wei Chang
AAML
245
915
0
21 Apr 2018
Adversarial Example Generation with Syntactically Controlled Paraphrase Networks
Mohit Iyyer
John Wieting
Kevin Gimpel
Luke Zettlemoyer
AAML
GAN
205
712
0
17 Apr 2018
Inverting The Generator Of A Generative Adversarial Network
Antonia Creswell
Anil Anthony Bharath
GAN
171
338
0
17 Nov 2016
1