Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1801.04354
Cited By
Black-box Generation of Adversarial Text Sequences to Evade Deep Learning Classifiers
13 January 2018
Ji Gao
Jack Lanchantin
M. Soffa
Yanjun Qi
AAML
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Black-box Generation of Adversarial Text Sequences to Evade Deep Learning Classifiers"
50 / 360 papers shown
Title
Text Generation: A Systematic Literature Review of Tasks, Evaluation, and Challenges
Jonas Becker
Jan Philip Wahle
Bela Gipp
Terry Ruas
31
9
0
24 May 2024
RAID: A Shared Benchmark for Robust Evaluation of Machine-Generated Text Detectors
Liam Dugan
Alyssa Hwang
Filip Trhlik
Josh Magnus Ludan
Andrew Zhu
Hainiu Xu
Daphne Ippolito
Christopher Callison-Burch
DeLMO
AAML
35
44
0
13 May 2024
Automated Program Repair: Emerging trends pose and expose problems for benchmarks
J. Renzullo
Pemma Reiter
Westley Weimer
Stephanie Forrest
42
1
0
08 May 2024
Revisiting character-level adversarial attacks
Elias Abad Rocamora
Yongtao Wu
Fanghui Liu
Grigorios G. Chrysos
V. Cevher
AAML
39
3
0
07 May 2024
Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore
Junchao Wu
Runzhe Zhan
Derek F. Wong
Shu Yang
Xuebo Liu
Lidia S. Chao
Min Zhang
DeLMO
46
4
0
07 May 2024
Adversarial Attacks and Defense for Conversation Entailment Task
Zhenning Yang
Ryan Krawec
Liang-Yuan Wu
AAML
SILM
27
1
0
01 May 2024
Human-in-the-Loop Synthetic Text Data Inspection with Provenance Tracking
Hong Jin Kang
Fabrice Harel-Canada
Muhammad Ali Gulzar
Violet Peng
Miryung Kim
44
2
0
29 Apr 2024
Talking Nonsense: Probing Large Language Models' Understanding of Adversarial Gibberish Inputs
Valeriia Cherepanova
James Zou
AAML
33
4
0
26 Apr 2024
Typos that Broke the RAG's Back: Genetic Attack on RAG Pipeline by Simulating Documents in the Wild via Low-level Perturbations
Sukmin Cho
Soyeong Jeong
Jeongyeon Seo
Taeho Hwang
Jong C. Park
SILM
AAML
52
27
0
22 Apr 2024
Advancing the Robustness of Large Language Models through Self-Denoised Smoothing
Jiabao Ji
Bairu Hou
Zhen Zhang
Guanhua Zhang
Wenqi Fan
Qing Li
Yang Zhang
Gaowen Liu
Sijia Liu
Shiyu Chang
AAML
43
6
0
18 Apr 2024
GenFighter: A Generative and Evolutive Textual Attack Removal
Md Athikul Islam
Edoardo Serra
Sushil Jajodia
AAML
29
0
0
17 Apr 2024
Resilience of Large Language Models for Noisy Instructions
Bin Wang
Chengwei Wei
Zhengyuan Liu
Geyu Lin
Nancy F. Chen
49
11
0
15 Apr 2024
SpamDam: Towards Privacy-Preserving and Adversary-Resistant SMS Spam Detection
Yekai Li
Rufan Zhang
Wenxin Rong
Xianghang Mi
42
2
0
15 Apr 2024
Towards Building a Robust Toxicity Predictor
Dmitriy Bespalov
Sourav S. Bhabesh
Yi Xiang
Liutong Zhou
Yanjun Qi
AAML
116
10
0
09 Apr 2024
Semantic Stealth: Adversarial Text Attacks on NLP Using Several Methods
Roopkatha Dey
Aivy Debnath
Sayak Kumar Dutta
Kaustav Ghosh
Arijit Mitra
Arghya Roy Chowdhury
Jaydip Sen
AAML
SILM
29
1
0
08 Apr 2024
Goal-guided Generative Prompt Injection Attack on Large Language Models
Chong Zhang
Mingyu Jin
Qinkai Yu
Chengzhi Liu
Haochen Xue
Xiaobo Jin
AAML
SILM
42
12
0
06 Apr 2024
Adversarial Attacks and Dimensionality in Text Classifiers
Nandish Chattopadhyay
Atreya Goswami
Anupam Chattopadhyay
SILM
AAML
21
1
0
03 Apr 2024
READ: Improving Relation Extraction from an ADversarial Perspective
Dawei Li
William Hogan
Jingbo Shang
AAML
36
0
0
02 Apr 2024
Cross-Lingual Transfer Robustness to Lower-Resource Languages on Adversarial Datasets
Shadi Manafi
Nikhil Krishnaswamy
AAML
48
0
0
29 Mar 2024
SemRoDe: Macro Adversarial Training to Learn Representations That are Robust to Word-Level Attacks
Brian Formento
Wenjie Feng
Chuan-Sheng Foo
Anh Tuan Luu
See-Kiong Ng
AAML
34
7
0
27 Mar 2024
Subspace Defense: Discarding Adversarial Perturbations by Learning a Subspace for Clean Signals
Rui Zheng
Yuhao Zhou
Zhiheng Xi
Tao Gui
Qi Zhang
Xuanjing Huang
AAML
55
0
0
24 Mar 2024
Monotonic Paraphrasing Improves Generalization of Language Model Prompting
Qin Liu
Fei Wang
Nan Xu
Tianyi Yan
Tao Meng
Muhao Chen
LRM
43
7
0
24 Mar 2024
SSCAE -- Semantic, Syntactic, and Context-aware natural language Adversarial Examples generator
J. Asl
Mohammad H. Rafiei
Manar Alohaly
Daniel Takabi
AAML
SILM
31
3
0
18 Mar 2024
A Modified Word Saliency-Based Adversarial Attack on Text Classification Models
Hetvi Waghela
Sneha Rakshit
Jaydip Sen
AAML
31
7
0
17 Mar 2024
Generating Hard-Negative Out-of-Scope Data with ChatGPT for Intent Classification
Zhijian Li
Stefan Larson
Kevin Leach
OODD
34
1
0
08 Mar 2024
Extreme Miscalibration and the Illusion of Adversarial Robustness
Vyas Raina
Samson Tan
V. Cevher
Aditya Rawal
Sheng Zha
George Karypis
AAML
41
2
0
27 Feb 2024
Unveiling Vulnerability of Self-Attention
Khai Jiet Liong
Hongqiu Wu
Haizhen Zhao
36
0
0
26 Feb 2024
RoCoIns: Enhancing Robustness of Large Language Models through Code-Style Instructions
Yuan Zhang
Xiao Wang
Zhiheng Xi
Han Xia
Tao Gui
Qi Zhang
Xuanjing Huang
45
3
0
26 Feb 2024
ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation
Yi Zhang
Yun Tang
Wenjie Ruan
Xiaowei Huang
Siddartha Khastgir
P. Jennings
Xingyu Zhao
AAML
35
4
0
23 Feb 2024
Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment
Vyas Raina
Adian Liusie
Mark J. F. Gales
AAML
ELM
32
53
0
21 Feb 2024
Stealthy Attack on Large Language Model based Recommendation
Jinghao Zhang
Yuting Liu
Qiang Liu
Shu Wu
Guibing Guo
Liang Wang
35
13
0
18 Feb 2024
A Curious Case of Searching for the Correlation between Training Data and Adversarial Robustness of Transformer Textual Models
Cuong Dang
Dung D. Le
Thai Le
AAML
34
2
0
18 Feb 2024
Contrastive Instruction Tuning
Tianyi Yan
Fei Wang
James Y. Huang
Wenxuan Zhou
Fan Yin
Aram Galstyan
Wenpeng Yin
Muhao Chen
ALM
27
5
0
17 Feb 2024
PoisonedRAG: Knowledge Poisoning Attacks to Retrieval-Augmented Generation of Large Language Models
Wei Zou
Runpeng Geng
Binghui Wang
Jinyuan Jia
SILM
39
45
1
12 Feb 2024
Tighter Bounds on the Information Bottleneck with Application to Deep Learning
Nir Weingarten
Z. Yakhini
Moshe Butman
Ran Gilad-Bachrach
AAML
30
1
0
12 Feb 2024
Arabic Synonym BERT-based Adversarial Examples for Text Classification
Norah M. Alshahrani
Saied Alshahrani
Esma Wali
Jeanna Neefe Matthews
AAML
22
5
0
05 Feb 2024
Fast Adversarial Training against Textual Adversarial Attacks
Yichen Yang
Xin Liu
Kun He
AAML
16
4
0
23 Jan 2024
Adapters Mixup: Mixing Parameter-Efficient Adapters to Enhance the Adversarial Robustness of Fine-tuned Pre-trained Text Classifiers
Tuc Nguyen
Thai Le
AAML
SILM
MoE
11
1
0
18 Jan 2024
Deciphering Textual Authenticity: A Generalized Strategy through the Lens of Large Language Semantics for Detecting Human vs. Machine-Generated Text
Mazal Bethany
Brandon Wherry
Emet Bethany
Nishant Vishwamitra
Anthony Rios
Peyman Najafirad
DeLMO
36
3
0
17 Jan 2024
Stability Analysis of ChatGPT-based Sentiment Analysis in AI Quality Assurance
Tinghui Ouyang
AprilPyone Maungmaung
Koichi Konishi
Yoshiki Seo
Isao Echizen
AI4MH
23
5
0
15 Jan 2024
Tree of Attacks: Jailbreaking Black-Box LLMs Automatically
Anay Mehrotra
Manolis Zampetakis
Paul Kassianik
Blaine Nelson
Hyrum Anderson
Yaron Singer
Amin Karbasi
35
206
0
04 Dec 2023
SenTest: Evaluating Robustness of Sentence Encoders
Tanmay Chavan
Shantanu Patankar
Aditya Kane
Omkar Gokhale
Geetanjali Kale
Raviraj Joshi
24
0
0
29 Nov 2023
Improving the Robustness of Transformer-based Large Language Models with Dynamic Attention
Lujia Shen
Yuwen Pu
Shouling Ji
Changjiang Li
Xuhong Zhang
Chunpeng Ge
Ting Wang
AAML
29
3
0
29 Nov 2023
RETSim: Resilient and Efficient Text Similarity
Marina Zhang
Owen Vallis
Aysegul Bumin
Tanay Vakharia
Elie Bursztein
36
1
0
28 Nov 2023
Generating Valid and Natural Adversarial Examples with Large Language Models
Zimu Wang
Wei Wang
Qi Chen
Qiufeng Wang
Anh Nguyen
AAML
21
4
0
20 Nov 2023
Hijacking Large Language Models via Adversarial In-Context Learning
Yao Qiang
Xiangyu Zhou
Dongxiao Zhu
32
32
0
16 Nov 2023
Whispers of Doubt Amidst Echoes of Triumph in NLP Robustness
Ashim Gupta
Rishanth Rajendhran
Nathan Stringham
Vivek Srikumar
Ana Marasović
AAML
31
3
0
16 Nov 2023
DALA: A Distribution-Aware LoRA-Based Adversarial Attack against Language Models
Yibo Wang
Xiangjue Dong
James Caverlee
Philip S. Yu
29
2
0
14 Nov 2023
Robust Text Classification: Analyzing Prototype-Based Networks
Zhivar Sourati
D. Deshpande
Filip Ilievski
Kiril Gashteovski
S. Saralajew
OOD
OffRL
39
2
0
11 Nov 2023
Towards Effective Paraphrasing for Information Disguise
Anmol Agarwal
Shrey Gupta
Vamshi Krishna Bonagiri
Manas Gaur
Joseph M. Reagle
Ponnurangam Kumaraguru
35
3
0
08 Nov 2023
Previous
1
2
3
4
5
6
7
8
Next