Papers
Communities
Organizations
Events
Blog
Pricing
Feedback
Contact Sales
Search
Open menu
Home
Papers
All Papers
Title
Home
Papers
1902.02918
Cited By
v1
v2 (latest)
Certified Adversarial Robustness via Randomized Smoothing
8 February 2019
Jeremy M. Cohen
Elan Rosenfeld
J. Zico Kolter
AAML
Re-assign community
ArXiv (abs)
PDF
HTML
Github (390★)
Papers citing
"Certified Adversarial Robustness via Randomized Smoothing"
50 / 1,327 papers shown
Title
Defending Large Language Models against Jailbreak Attacks via Semantic Smoothing
Jiabao Ji
Bairu Hou
Alexander Robey
George J. Pappas
Hamed Hassani
Yang Zhang
Eric Wong
Shiyu Chang
AAML
141
56
0
25 Feb 2024
Optimal Zero-Shot Detector for Multi-Armed Attacks
Federica Granese
Marco Romanelli
Pablo Piantanida
AAML
130
0
0
24 Feb 2024
Holding Secrets Accountable: Auditing Privacy-Preserving Machine Learning
Hidde Lycklama
Alexander Viand
Nicolas Küchler
Christian Knabenhans
Anwar Hithnawi
131
7
0
24 Feb 2024
ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation
Yi Zhang
Yun Tang
Wenjie Ruan
Xiaowei Huang
Siddartha Khastgir
P. Jennings
Xingyu Zhao
AAML
139
6
0
23 Feb 2024
A Robust Defense against Adversarial Attacks on Deep Learning-based Malware Detectors via (De)Randomized Smoothing
Daniel Gibert
Giulio Zizzo
Quan Le
Jordi Planes
AAML
105
4
0
23 Feb 2024
On the Duality Between Sharpness-Aware Minimization and Adversarial Training
Yihao Zhang
Hangzhou He
Jingyu Zhu
Huanran Chen
Yifei Wang
Zeming Wei
AAML
160
17
0
23 Feb 2024
Verifiable Boosted Tree Ensembles
Stefano Calzavara
Lorenzo Cazzaro
Claudio Lucchese
Giulio Ermanno Pibiri
AAML
99
0
0
22 Feb 2024
SoK: Analyzing Adversarial Examples: A Framework to Study Adversary Knowledge
L. Fenaux
Florian Kerschbaum
AAML
145
0
0
22 Feb 2024
Verifying message-passing neural networks via topology-based bounds tightening
Christopher Hojny
Shiqiang Zhang
Juan S. Campos
Ruth Misener
AAML
150
9
0
21 Feb 2024
Adversarial Feature Alignment: Balancing Robustness and Accuracy in Deep Learning via Adversarial Training
L. Park
Jaeuk Kim
Myung Gyo Oh
Jaewoo Park
T.-H. Kwon
AAML
162
7
0
19 Feb 2024
Endowing Pre-trained Graph Models with Provable Fairness
Zhongjian Zhang
Mengmei Zhang
Yue Yu
Cheng Yang
Jiawei Liu
Chuan Shi
71
9
0
19 Feb 2024
Trust Regions for Explanations via Black-Box Probabilistic Certification
Amit Dhurandhar
Swagatam Haldar
Dennis L. Wei
Karthikeyan N. Ramamurthy
FAtt
143
3
0
17 Feb 2024
Quantum-Inspired Analysis of Neural Network Vulnerabilities: The Role of Conjugate Variables in System Attacks
Jun-Jie Zhang
Deyu Meng
AAML
113
3
0
16 Feb 2024
Adaptive Hierarchical Certification for Segmentation using Randomized Smoothing
Alaa Anani
Tobias Lorenz
Bernt Schiele
Mario Fritz
70
2
0
13 Feb 2024
Improving Black-box Robustness with In-Context Rewriting
Kyle O'Brien
Nathan Ng
Isha Puri
Jorge Mendez
Hamid Palangi
Yoon Kim
Marzyeh Ghassemi
Tom Hartvigsen
152
7
0
13 Feb 2024
Accelerated Smoothing: A Scalable Approach to Randomized Smoothing
Devansh Bhardwaj
Kshitiz Kaushik
Sarthak Gupta
AAML
197
0
0
12 Feb 2024
A Random Ensemble of Encrypted Vision Transformers for Adversarially Robust Defense
Ryota Iijima
Sayaka Shiota
Hitoshi Kiya
120
7
0
11 Feb 2024
Quantifying and Enhancing Multi-modal Robustness with Modality Preference
Zequn Yang
Yake Wei
Ce Liang
Di Hu
AAML
102
10
0
09 Feb 2024
Is Adversarial Training with Compressed Datasets Effective?
Tong Chen
Raghavendra Selvan
AAML
206
1
0
08 Feb 2024
Redesigning Traffic Signs to Mitigate Machine-Learning Patch Attacks
Tsufit Shua
Liron David
Mahmood Sharif
AAML
90
0
0
07 Feb 2024
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Mantas Mazeika
Long Phan
Xuwang Yin
Andy Zou
Zifan Wang
...
Nathaniel Li
Steven Basart
Bo Li
David A. Forsyth
Dan Hendrycks
AAML
163
520
0
06 Feb 2024
Disparate Impact on Group Accuracy of Linearization for Private Inference
Saswat Das
Marco Romanelli
Ferdinando Fioretto
FedML
100
4
0
06 Feb 2024
PreGIP: Watermarking the Pretraining of Graph Neural Networks for Deep Intellectual Property Protection
Enyan Dai
Min Lin
Suhang Wang
116
3
0
06 Feb 2024
PROSAC: Provably Safe Certification for Machine Learning Models under Adversarial Attacks
Ziquan Liu
Zhuo Zhi
Ilija Bogunovic
Carsten Gerner-Beuerle
Miguel R. D. Rodrigues
AAML
108
0
0
04 Feb 2024
Your Diffusion Model is Secretly a Certifiably Robust Classifier
Huanran Chen
Yinpeng Dong
Shitong Shao
Zhongkai Hao
Xiao Yang
Hang Su
Jun Zhu
DiffM
169
19
0
04 Feb 2024
Building Guardrails for Large Language Models
Yizhen Dong
Ronghui Mu
Gao Jin
Yi Qi
Jinwei Hu
Xingyu Zhao
Jie Meng
Wenjie Ruan
Xiaowei Huang
OffRL
179
45
0
02 Feb 2024
An Information Theoretic Approach to Machine Unlearning
Jack Foster
Kyle Fogarty
Stefan Schoepf
Cengiz Öztireli
Alexandra Brintrup
MU
111
9
0
02 Feb 2024
Double-Dip: Thwarting Label-Only Membership Inference Attacks with Transfer Learning and Randomization
Arezoo Rajabi
Reeya Pimple
Aiswarya Janardhanan
Surudhi Asokraj
Bhaskar Ramasubramanian
Radha Poovendran
99
0
0
02 Feb 2024
Trustworthy Distributed AI Systems: Robustness, Privacy, and Governance
Wenqi Wei
Ling Liu
157
26
0
02 Feb 2024
Security and Privacy Challenges of Large Language Models: A Survey
B. Das
M. H. Amini
Yanzhao Wu
PILM
ELM
163
193
0
30 Jan 2024
Adversarial Training on Purification (AToP): Advancing Both Robustness and Generalization
Guang Lin
Chao Li
Jianhai Zhang
Toshihisa Tanaka
Qibin Zhao
182
17
0
29 Jan 2024
RAW: A Robust and Agile Plug-and-Play Watermark Framework for AI-Generated Images with Provable Guarantees
Xun Xian
Ganghua Wang
Xuan Bi
Jayanth Srinivasa
Jayanth Srinivasa
Mingyi Hong
Jie Ding
WIGM
74
5
0
23 Jan 2024
WARM: On the Benefits of Weight Averaged Reward Models
Alexandre Ramé
Nino Vieillard
Léonard Hussenot
Robert Dadashi
Geoffrey Cideron
Olivier Bachem
Johan Ferret
244
115
0
22 Jan 2024
PuriDefense: Randomized Local Implicit Adversarial Purification for Defending Black-box Query-based Attacks
Ping Guo
Xiang Li
Zhiyuan Yang
Xi Lin
Qingchuan Zhao
Qingfu Zhang
AAML
145
4
0
19 Jan 2024
Predominant Aspects on Security for Quantum Machine Learning: Literature Review
Nicola Franco
Alona Sakhnenko
Leon Stolpmann
Daniel Thuerck
Fabian Petsch
Annika Rüll
J. M. Lorenz
91
13
0
15 Jan 2024
Adversarial Examples are Misaligned in Diffusion Model Manifolds
P. Lorenz
Ricard Durall
Jansi Keuper
DiffM
245
1
0
12 Jan 2024
DEM: A Method for Certifying Deep Neural Network Classifier Outputs in Aerospace
Guy Katz
Natan Levy
Idan Refaeli
Raz Yerushalmi
AAML
105
1
0
04 Jan 2024
Trust, But Verify: A Survey of Randomized Smoothing Techniques
Anupriya Kumari
Devansh Bhardwaj
Sukrit Jindal
Sarthak Gupta
AAML
127
2
0
19 Dec 2023
The Ultimate Combo: Boosting Adversarial Example Transferability by Composing Data Augmentations
Zebin Yun
Achi-Or Weingarten
Eyal Ronen
Mahmood Sharif
99
2
0
18 Dec 2023
The Pros and Cons of Adversarial Robustness
Yacine Izza
Sasha Rubin
AAML
69
1
0
18 Dec 2023
Adversarial Robustness on Image Classification with
k
k
k
-means
Rollin Omari
Junae Kim
Paul Montague
OOD
VLM
64
0
0
15 Dec 2023
Exploring Transferability for Randomized Smoothing
Kai Qiu
Huishuai Zhang
Zhirong Wu
Stephen Lin
AAML
72
1
0
14 Dec 2023
Robust MRI Reconstruction by Smoothed Unrolling (SMUG)
Shijun Liang
Van Hoang Minh Nguyen
Jinghan Jia
Ismail Alkhouri
Sijia Liu
S. Ravishankar
85
1
0
12 Dec 2023
May the Noise be with you: Adversarial Training without Adversarial Examples
Ayoub Arous
A. F. López-Lopera
Nael B. Abu-Ghazaleh
Ihsen Alouani
AAML
OOD
58
0
0
12 Dec 2023
QuadAttack: A Quadratic Programming Approach to Ordered Top-K Attacks
Thomas Paniagua
Ryan Grainger
Tianfu Wu
AAML
91
0
0
12 Dec 2023
Adversarial Estimation of Topological Dimension with Harmonic Score Maps
Eric C. Yeats
Cameron Darwin
Frank Liu
Hai Li
125
2
0
11 Dec 2023
Reward Certification for Policy Smoothed Reinforcement Learning
Ronghui Mu
Leandro Soriano Marcolino
Tianle Zhang
Yanghao Zhang
Xiaowei Huang
Wenjie Ruan
113
6
0
11 Dec 2023
BELT: Old-School Backdoor Attacks can Evade the State-of-the-Art Defense with Backdoor Exclusivity Lifting
Huming Qiu
Junjie Sun
Mi Zhang
Xudong Pan
Min Yang
AAML
134
5
0
08 Dec 2023
Node-aware Bi-smoothing: Certified Robustness against Graph Injection Attacks
Y. Lai
Yulin Zhu
Bailin Pan
Wei Song
AAML
107
9
0
07 Dec 2023
Indirect Gradient Matching for Adversarial Robust Distillation
Hongsin Lee
Seungju Cho
Changick Kim
AAML
FedML
136
2
0
06 Dec 2023
Previous
1
2
3
...
5
6
7
...
25
26
27
Next