Textual Manifold-based Defense Against Natural Language Adversarial Examples

5 November 2022

Papers citing "Textual Manifold-based Defense Against Natural Language Adversarial Examples"

18 / 18 papers shown

Title
Exploring Gradient-Guided Masked Language Model to Detect Textual Adversarial Attacks Xiaomei Zhang Zhaoxi Zhang Yanjun Zhang Xufei Zheng L. Zhang Shengshan Hu Shirui Pan AAML 27 0 0 08 Apr 2025
DiffuseDef: Improved Robustness to Adversarial Attacks via Iterative Denoising Zhenhao Li Huichi Zhou Marek Rei Lucia Specia DiffM 29 0 0 28 Jun 2024
Exploiting the Layered Intrinsic Dimensionality of Deep Models for Practical Adversarial Training Enes Altinisik Safa Messaoud Husrev Taha Sencar Hassan Sajjad Sanjay Chawla AAML 48 0 0 27 May 2024
Persistent Classification: A New Approach to Stability of Data and Adversarial Examples Brian Bell Michael Geyer David Glickenstein Keaton Hamm C. Scheidegger Amanda S. Fernandez Juston Moore AAML 44 0 0 11 Apr 2024
SemRoDe: Macro Adversarial Training to Learn Representations That are Robust to Word-Level Attacks Brian Formento Wenjie Feng Chuan-Sheng Foo Anh Tuan Luu See-Kiong Ng AAML 34 7 0 27 Mar 2024
Extreme Miscalibration and the Illusion of Adversarial Robustness Vyas Raina Samson Tan V. Cevher Aditya Rawal Sheng Zha George Karypis AAML 41 2 0 27 Feb 2024
Defending Against Weight-Poisoning Backdoor Attacks for Parameter-Efficient Fine-Tuning Shuai Zhao Leilei Gan Anh Tuan Luu Jie Fu Lingjuan Lyu Meihuizi Jia Jinming Wen AAML 26 23 0 19 Feb 2024
Fooling the Textual Fooler via Randomizing Latent Representations Duy C. Hoang Quang H. Nguyen Saurav Manchanda MinLong Peng Kok-Seng Wong Khoa D. Doan SILM AAML 23 0 0 02 Oct 2023
Certifying LLM Safety against Adversarial Prompting Aounon Kumar Chirag Agarwal Suraj Srinivas Aaron Jiaxun Li S. Feizi Himabindu Lakkaraju AAML 27 165 0 06 Sep 2023
Prompt as Triggers for Backdoor Attack: Examining the Vulnerability in Language Models Shuai Zhao Jinming Wen Anh Tuan Luu J. Zhao Jie Fu SILM 62 89 0 02 May 2023
Granular-ball computing: an efficient, robust, and interpretable adaptive multi-granularity representation and computation method Shuyin Xia Guoyin Wang Xinbo Gao Xiaoyu Lian 19 8 0 21 Apr 2023
Masked Language Model Based Textual Adversarial Example Detection Xiaomei Zhang Zhaoxi Zhang Qi Zhong Xufei Zheng Yanjun Zhang Shengshan Hu L. Zhang AAML 28 0 0 18 Apr 2023
FreeLB: Enhanced Adversarial Training for Natural Language Understanding Chen Zhu Yu Cheng Zhe Gan S. Sun Tom Goldstein Jingjing Liu AAML 232 438 0 25 Sep 2019
Certified Robustness to Adversarial Word Substitutions Robin Jia Aditi Raghunathan Kerem Göksel Percy Liang AAML 183 291 0 03 Sep 2019
Disentangling Adversarial Robustness and Generalization David Stutz Matthias Hein Bernt Schiele AAML OOD 194 274 0 03 Dec 2018
Generating Natural Language Adversarial Examples M. Alzantot Yash Sharma Ahmed Elgohary Bo-Jhang Ho Mani B. Srivastava Kai-Wei Chang AAML 245 915 0 21 Apr 2018
Adversarial Example Generation with Syntactically Controlled Paraphrase Networks Mohit Iyyer John Wieting Kevin Gimpel Luke Zettlemoyer AAML GAN 205 712 0 17 Apr 2018
Inverting The Generator Of A Generative Adversarial Network Antonia Creswell Anil Anthony Bharath GAN 171 338 0 17 Nov 2016