SemAttack: Natural Textual Attacks via Different Semantic Spaces

SemAttack: Natural Textual Attacks via Different Semantic Spaces

3 May 2022

Papers citing "SemAttack: Natural Textual Attacks via Different Semantic Spaces"

14 / 14 papers shown

Title
aiXamine: Simplified LLM Safety and Security Fatih Deniz Dorde Popovic Yazan Boshmaf Euisuh Jeong M. Ahmad Sanjay Chawla Issa M. Khalil ELM 80 0 0 21 Apr 2025
Obfuscating IoT Device Scanning Activity via Adversarial Example Generation Haocong Li Yaxin Zhang Long Cheng Wenjia Niu Haining Wang Qiang Li AAML 41 0 0 17 Jun 2024
VertAttack: Taking advantage of Text Classifiers' horizontal vision Jonathan Rusert AAML 43 1 0 12 Apr 2024
Quantifying Uncertainty in Natural Language Explanations of Large Language Models Sree Harsha Tanneru Chirag Agarwal Himabindu Lakkaraju LRM 27 14 0 06 Nov 2023
The Trickle-down Impact of Reward (In-)consistency on RLHF Lingfeng Shen Sihao Chen Linfeng Song Lifeng Jin Baolin Peng Haitao Mi Daniel Khashabi Dong Yu 40 21 0 28 Sep 2023
A Classification-Guided Approach for Adversarial Attacks against Neural Machine Translation Sahar Sadrizadeh Ljiljana Dolamic P. Frossard AAML SILM 44 2 0 29 Aug 2023
TextShield: Beyond Successfully Detecting Adversarial Sentences in Text Classification Lingfeng Shen Ze Zhang Haiyun Jiang Ying-Cong Chen AAML 41 5 0 03 Feb 2023
TransFool: An Adversarial Attack against Neural Machine Translation Models Sahar Sadrizadeh Ljiljana Dolamic P. Frossard SILM AAML 39 12 0 02 Feb 2023
Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models Wei Ping Ming-Yu Liu Chaowei Xiao P. Xu M. Patwary M. Shoeybi Bo-wen Li Anima Anandkumar Bryan Catanzaro 25 65 0 08 Feb 2022
Gradient-based Adversarial Attacks against Text Transformers Chuan Guo Alexandre Sablayrolles Hervé Jégou Douwe Kiela SILM 106 227 0 15 Apr 2021
Robust Encodings: A Framework for Combating Adversarial Typos Erik Jones Robin Jia Aditi Raghunathan Percy Liang AAML 142 102 0 04 May 2020
FreeLB: Enhanced Adversarial Training for Natural Language Understanding Chen Zhu Yu Cheng Zhe Gan S. Sun Tom Goldstein Jingjing Liu AAML 232 438 0 25 Sep 2019
Generating Natural Language Adversarial Examples M. Alzantot Yash Sharma Ahmed Elgohary Bo-Jhang Ho Mani B. Srivastava Kai-Wei Chang AAML 258 915 0 21 Apr 2018
GLUE: A Multi-Task Benchmark and Analysis Platform for Natural Language Understanding Alex Jinpeng Wang Amanpreet Singh Julian Michael Felix Hill Omer Levy Samuel R. Bowman ELM 299 6,984 0 20 Apr 2018