Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2307.16630
Cited By
Text-CRS: A Generalized Certified Robustness Framework against Textual Adversarial Attacks
31 July 2023
Xinyu Zhang
Hanbin Hong
Yuan Hong
Peng Huang
Binghui Wang
Zhongjie Ba
Kui Ren
SILM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Text-CRS: A Generalized Certified Robustness Framework against Textual Adversarial Attacks"
22 / 22 papers shown
Title
Enhancing LLM Robustness to Perturbed Instructions: An Empirical Study
Aryan Agrawal
Lisa Alazraki
Shahin Honarvar
Marek Rei
57
0
0
03 Apr 2025
REFINE: Inversion-Free Backdoor Defense via Model Reprogramming
Y. Chen
Shuo Shao
Enhao Huang
Yiming Li
Pin-Yu Chen
Zhanyue Qin
Kui Ren
AAML
52
3
0
22 Feb 2025
Be a Multitude to Itself: A Prompt Evolution Framework for Red Teaming
Rui Li
Peiyi Wang
Jingyuan Ma
Di Zhang
Lei Sha
Zhifang Sui
LLMAG
46
0
0
22 Feb 2025
Learning Robust and Privacy-Preserving Representations via Information Theory
Binghui Zhang
Sayedeh Leila Noorbakhsh
Yun Dong
Yuan Hong
Binghui Wang
64
0
0
15 Dec 2024
A Certified Robust Watermark For Large Language Models
Xianheng Feng
Jian-wei Liu
Kui Ren
Chun Chen
AAML
WaLM
52
0
0
29 Sep 2024
Seeing Through the Mask: Rethinking Adversarial Examples for CAPTCHAs
Yahya Jabary
Andreas Plesner
Turlan Kuzhagaliyev
Roger Wattenhofer
AAML
29
0
0
09 Sep 2024
Distributed Backdoor Attacks on Federated Graph Learning and Certified Defenses
Yuxin Yang
Qiang Li
Jinyuan Jia
Yuan Hong
Binghui Wang
AAML
FedML
60
11
0
12 Jul 2024
A One-Layer Decoder-Only Transformer is a Two-Layer RNN: With an Application to Certified Robustness
Yuhao Zhang
Aws Albarghouthi
Loris Dántoni
OffRL
31
0
0
27 May 2024
Sandwich attack: Multi-language Mixture Adaptive Attack on LLMs
Bibek Upadhayay
Vahid Behzadan
AAML
26
13
0
09 Apr 2024
Exposing the Deception: Uncovering More Forgery Clues for Deepfake Detection
Zhongjie Ba
Qingyu Liu
Zhenguang Liu
Shuang Wu
Feng Lin
Liwang Lu
Kui Ren
AAML
38
34
0
04 Mar 2024
Fight Back Against Jailbreaking via Prompt Adversarial Tuning
Yichuan Mo
Yuji Wang
Zeming Wei
Yisen Wang
AAML
SILM
49
25
0
09 Feb 2024
Non-Exchangeable Conformal Language Generation with Nearest Neighbors
Dennis Ulmer
Chrysoula Zerva
André F. T. Martins
29
11
0
01 Feb 2024
FLTracer: Accurate Poisoning Attack Provenance in Federated Learning
Xinyu Zhang
Qingyu Liu
Zhongjie Ba
Yuan Hong
Tianhang Zheng
Feng Lin
Liwang Lu
Kui Ren
AAML
34
10
0
20 Oct 2023
Survey of Vulnerabilities in Large Language Models Revealed by Adversarial Attacks
Erfan Shayegani
Md Abdullah Al Mamun
Yu Fu
Pedram Zaree
Yue Dong
Nael B. Abu-Ghazaleh
AAML
147
146
0
16 Oct 2023
General Lipschitz: Certified Robustness Against Resolvable Semantic Transformations via Transformation-Dependent Randomized Smoothing
Dmitrii Korzh
Alireza Azadbakht
Maryam Tahmasbi
Alireza Javaheri
AAML
28
0
0
17 Aug 2023
PromptCARE: Prompt Copyright Protection by Watermark Injection and Verification
Hongwei Yao
Jian Lou
Kui Ren
Zhan Qin
AAML
VLM
37
25
0
05 Aug 2023
Certifiable Black-Box Attacks with Randomized Adversarial Examples: Breaking Defenses with Provable Confidence
Hanbin Hong
Xinyu Zhang
Binghui Wang
Zhongjie Ba
Yuan Hong
AAML
22
2
0
10 Apr 2023
It's Morphin' Time! Combating Linguistic Discrimination with Inflectional Perturbations
Samson Tan
Shafiq R. Joty
Min-Yen Kan
R. Socher
166
103
0
09 May 2020
Certified Robustness to Adversarial Word Substitutions
Robin Jia
Aditi Raghunathan
Kerem Göksel
Percy Liang
AAML
183
291
0
03 Sep 2019
Generating Natural Language Adversarial Examples
M. Alzantot
Yash Sharma
Ahmed Elgohary
Bo-Jhang Ho
Mani B. Srivastava
Kai-Wei Chang
AAML
245
914
0
21 Apr 2018
Adversarial Example Generation with Syntactically Controlled Paraphrase Networks
Mohit Iyyer
John Wieting
Kevin Gimpel
Luke Zettlemoyer
AAML
GAN
202
711
0
17 Apr 2018
Convolutional Neural Networks for Sentence Classification
Yoon Kim
AILaw
VLM
255
13,364
0
25 Aug 2014
1