Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2012.14956
Cited By
Generating Natural Language Attacks in a Hard Label Black Box Setting
29 December 2020
Rishabh Maheshwary
Saket Maheshwary
Vikram Pudi
AAML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Generating Natural Language Attacks in a Hard Label Black Box Setting"
41 / 41 papers shown
Title
Confidence Elicitation: A New Attack Vector for Large Language Models
Brian Formento
Chuan-Sheng Foo
See-Kiong Ng
AAML
99
0
0
07 Feb 2025
Tougher Text, Smarter Models: Raising the Bar for Adversarial Defence Benchmarks
Yang Wang
Chenghua Lin
ELM
40
0
0
05 Jan 2025
NMT-Obfuscator Attack: Ignore a sentence in translation with only one word
Sahar Sadrizadeh
César Descalzo
Ljiljana Dolamic
P. Frossard
AAML
81
0
0
19 Nov 2024
SemRoDe: Macro Adversarial Training to Learn Representations That are Robust to Word-Level Attacks
Brian Formento
Wenjie Feng
Chuan-Sheng Foo
Anh Tuan Luu
See-Kiong Ng
AAML
34
7
0
27 Mar 2024
Subspace Defense: Discarding Adversarial Perturbations by Learning a Subspace for Clean Signals
Rui Zheng
Yuhao Zhou
Zhiheng Xi
Tao Gui
Qi Zhang
Xuanjing Huang
AAML
55
0
0
24 Mar 2024
SSCAE -- Semantic, Syntactic, and Context-aware natural language Adversarial Examples generator
J. Asl
Mohammad H. Rafiei
Manar Alohaly
Daniel Takabi
AAML
SILM
31
3
0
18 Mar 2024
Evaluating Robustness of Generative Search Engine on Adversarial Factual Questions
Xuming Hu
Xiaochuan Li
Junzhe Chen
Hai-Tao Zheng
Yangning Li
...
Yasheng Wang
Qun Liu
Lijie Wen
Philip S. Yu
Zhijiang Guo
AAML
ELM
32
5
0
25 Feb 2024
HQA-Attack: Toward High Quality Black-Box Hard-Label Adversarial Attack on Text
Han Liu
Zhi Xu
Xiaotong Zhang
Feng Zhang
Fenglong Ma
Hongyang Chen
Hong Yu
Xianchao Zhang
AAML
27
7
0
02 Feb 2024
Fooling the Textual Fooler via Randomizing Latent Representations
Duy C. Hoang
Quang H. Nguyen
Saurav Manchanda
MinLong Peng
Kok-Seng Wong
Khoa D. Doan
SILM
AAML
23
0
0
02 Oct 2023
Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM
Bochuan Cao
Yu Cao
Lu Lin
Jinghui Chen
AAML
36
135
0
18 Sep 2023
A Classification-Guided Approach for Adversarial Attacks against Neural Machine Translation
Sahar Sadrizadeh
Ljiljana Dolamic
P. Frossard
AAML
SILM
44
2
0
29 Aug 2023
LimeAttack: Local Explainable Method for Textual Hard-Label Adversarial Attack
HaiXiang Zhu
Zhaoqing Yang
Weiwei Shang
Yuren Wu
AAML
FAtt
10
3
0
01 Aug 2023
Text-CRS: A Generalized Certified Robustness Framework against Textual Adversarial Attacks
Xinyu Zhang
Hanbin Hong
Yuan Hong
Peng Huang
Binghui Wang
Zhongjie Ba
Kui Ren
SILM
44
18
0
31 Jul 2023
On Evaluating Adversarial Robustness of Large Vision-Language Models
Yunqing Zhao
Tianyu Pang
Chao Du
Xiao Yang
Chongxuan Li
Ngai-man Cheung
Min Lin
VLM
AAML
MLLM
33
166
0
26 May 2023
Masked Language Model Based Textual Adversarial Example Detection
Xiaomei Zhang
Zhaoxi Zhang
Qi Zhong
Xufei Zheng
Yanjun Zhang
Shengshan Hu
L. Zhang
AAML
28
0
0
18 Apr 2023
AdvCat: Domain-Agnostic Robustness Assessment for Cybersecurity-Critical Applications with Categorical Inputs
Helene Orsini
Hongyan Bao
Yujun Zhou
Xiangrui Xu
Yufei Han
Longyang Yi
Wei Wang
Xin Gao
Xiangliang Zhang
AAML
44
1
0
13 Dec 2022
On the Security Vulnerabilities of Text-to-SQL Models
Xutan Peng
Yipeng Zhang
Jingfeng Yang
Mark Stevenson
SILM
31
10
0
28 Nov 2022
Universal Evasion Attacks on Summarization Scoring
Wenchuan Mu
Kwan Hui Lim
AAML
38
1
0
25 Oct 2022
Why Should Adversarial Perturbations be Imperceptible? Rethink the Research Paradigm in Adversarial NLP
Yangyi Chen
Hongcheng Gao
Ganqu Cui
Fanchao Qi
Longtao Huang
Zhiyuan Liu
Maosong Sun
SILM
25
45
0
19 Oct 2022
Rethinking Textual Adversarial Defense for Pre-trained Language Models
Jiayi Wang
Rongzhou Bao
Zhuosheng Zhang
Hai Zhao
AAML
SILM
28
11
0
21 Jul 2022
RAF: Recursive Adversarial Attacks on Face Recognition Using Extremely Limited Queries
Keshav Kasichainula
Hadi Mansourifar
W. Shi
AAML
34
1
0
04 Jul 2022
Practice Makes a Solver Perfect: Data Augmentation for Math Word Problem Solvers
Vivek Kumar
Rishabh Maheshwary
Vikram Pudi
AIMat
22
14
0
30 Apr 2022
Understanding, Detecting, and Separating Out-of-Distribution Samples and Adversarial Samples in Text Classification
Cheng-Han Chiang
Hung-yi Lee
OODD
23
1
0
09 Apr 2022
Distinguishing Non-natural from Natural Adversarial Samples for More Robust Pre-trained Language Model
Jiayi Wang
Rongzhou Bao
Zhuosheng Zhang
Hai Zhao
AAML
29
4
0
19 Mar 2022
Robust Textual Embedding against Word-level Adversarial Attacks
Yichen Yang
Xiaosen Wang
Kun He
AAML
22
16
0
28 Feb 2022
Threats to Pre-trained Language Models: Survey and Taxonomy
Shangwei Guo
Chunlong Xie
Jiwei Li
Lingjuan Lyu
Tianwei Zhang
PILM
27
30
0
14 Feb 2022
TextHacker: Learning based Hybrid Local Search Algorithm for Text Hard-label Adversarial Attack
Zhen Yu
Xiaosen Wang
Wanxiang Che
Kun He
AAML
27
14
0
20 Jan 2022
Robust Natural Language Processing: Recent Advances, Challenges, and Future Directions
Marwan Omar
Soohyeon Choi
Daehun Nyang
David A. Mohaisen
32
57
0
03 Jan 2022
Effective and Imperceptible Adversarial Textual Attack via Multi-objectivization
Shengcai Liu
Ning Lu
W. Hong
Chao Qian
Ke Tang
AAML
22
15
0
02 Nov 2021
Bridge the Gap Between CV and NLP! A Gradient-based Textual Adversarial Attack Framework
Lifan Yuan
Yichi Zhang
Yangyi Chen
Wei Wei
AAML
21
33
0
28 Oct 2021
Adversarial Examples for Evaluating Math Word Problem Solvers
Vivek Kumar
Rishabh Maheshwary
Vikram Pudi
AAML
30
33
0
13 Sep 2021
Detecting Textual Adversarial Examples through Randomized Substitution and Vote
Xiaosen Wang
Yifeng Xiong
Kun He
AAML
25
11
0
13 Sep 2021
A Strong Baseline for Query Efficient Attacks in a Black Box Setting
Rishabh Maheshwary
Saket Maheshwary
Vikram Pudi
AAML
30
30
0
10 Sep 2021
Multi-granularity Textual Adversarial Attack with Behavior Cloning
Yangyi Chen
Jingtong Su
Wei Wei
AAML
17
32
0
09 Sep 2021
Efficient Combinatorial Optimization for Word-level Adversarial Textual Attack
Shengcai Liu
Ning Lu
Cheng Chen
Ke Tang
AAML
23
32
0
06 Sep 2021
Certified Robustness to Text Adversarial Attacks by Randomized [MASK]
Jiehang Zeng
Xiaoqing Zheng
Jianhan Xu
Linyang Li
Liping Yuan
Xuanjing Huang
AAML
26
67
0
08 May 2021
Gradient-based Adversarial Attacks against Text Transformers
Chuan Guo
Alexandre Sablayrolles
Hervé Jégou
Douwe Kiela
SILM
106
227
0
15 Apr 2021
Token-Modification Adversarial Attacks for Natural Language Processing: A Survey
Tom Roth
Yansong Gao
A. Abuadbba
Surya Nepal
Wei Liu
AAML
43
12
0
01 Mar 2021
Blacklight: Scalable Defense for Neural Networks against Query-Based Black-Box Attacks
Huiying Li
Shawn Shan
Emily Wenger
Jiayun Zhang
Haitao Zheng
Ben Y. Zhao
AAML
23
42
0
24 Jun 2020
Generating Natural Language Adversarial Examples
M. Alzantot
Yash Sharma
Ahmed Elgohary
Bo-Jhang Ho
Mani B. Srivastava
Kai-Wei Chang
AAML
258
915
0
21 Apr 2018
Convolutional Neural Networks for Sentence Classification
Yoon Kim
AILaw
VLM
267
13,368
0
25 Aug 2014
1