Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1608.04644
Cited By
v1
v2 (latest)
Towards Evaluating the Robustness of Neural Networks
16 August 2016
Nicholas Carlini
D. Wagner
OOD
AAML
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"Towards Evaluating the Robustness of Neural Networks"
50 / 4,015 papers shown
Title
Adversarial Purification and Fine-tuning for Robust UDC Image Restoration
Zhenbo Song
Zhenyuan Zhang
Kaihao Zhang
Wenhan Luo
Zhaoxin Fan
Jianfeng Lu
AAML
114
0
0
21 Feb 2024
QuanTest: Entanglement-Guided Testing of Quantum Neural Network Systems
Jinjing Shi
Zimeng Xiao
Heyuan Shi
Yu Jiang
Xuelong Li
AAML
86
1
0
20 Feb 2024
VGMShield: Mitigating Misuse of Video Generative Models
Yan Pang
Yang Zhang
Yang Zhang
Tianhao Wang
119
3
0
20 Feb 2024
Query-Based Adversarial Prompt Generation
Jonathan Hayase
Ema Borevkovic
Nicholas Carlini
Florian Tramèr
Milad Nasr
AAML
SILM
101
32
0
19 Feb 2024
Adversarial Feature Alignment: Balancing Robustness and Accuracy in Deep Learning via Adversarial Training
L. Park
Jaeuk Kim
Myung Gyo Oh
Jaewoo Park
T.-H. Kwon
AAML
136
5
0
19 Feb 2024
The Effectiveness of Random Forgetting for Robust Generalization
V. Ramkumar
Bahram Zonooz
Elahe Arani
AAML
68
1
0
18 Feb 2024
Maintaining Adversarial Robustness in Continuous Learning
Xiaolei Ru
Xiaowei Cao
Zijia Liu
Jack Murdoch Moore
Xin-Ya Zhang
Xia Zhu
Wenjia Wei
Gang Yan
AAML
71
4
0
17 Feb 2024
PAL: Proxy-Guided Black-Box Attack on Large Language Models
Chawin Sitawarin
Norman Mu
David Wagner
Alexandre Araujo
ELM
84
35
0
15 Feb 2024
Only My Model On My Data: A Privacy Preserving Approach Protecting one Model and Deceiving Unauthorized Black-Box Models
Weiheng Chai
Brian Testa
Huantao Ren
Asif Salekin
Senem Velipasalar
32
0
0
14 Feb 2024
Detecting Adversarial Spectrum Attacks via Distance to Decision Boundary Statistics
Wenwei Zhao
Xiaowen Li
Shangqing Zhao
Jie Xu
Yao-Hong Liu
Zhuo Lu
AAML
53
1
0
14 Feb 2024
Generating Universal Adversarial Perturbations for Quantum Classifiers
Gautham Anil
Vishnu Vinod
Apurva Narayan
AAML
80
5
0
13 Feb 2024
Faster Repeated Evasion Attacks in Tree Ensembles
Lorenzo Cascioli
Laurens Devos
Ondvrej Kuvzelka
Jesse Davis
AAML
57
0
0
13 Feb 2024
Tighter Bounds on the Information Bottleneck with Application to Deep Learning
Nir Weingarten
Z. Yakhini
Moshe Butman
Ran Gilad-Bachrach
AAML
54
1
0
12 Feb 2024
Topological safeguard for evasion attack interpreting the neural networks' behavior
Xabier Echeberria-Barrio
Amaia Gil-Lerchundi
Iñigo Mendialdua
Raul Orduna Urrutia
AAML
57
3
0
12 Feb 2024
Accelerated Smoothing: A Scalable Approach to Randomized Smoothing
Devansh Bhardwaj
Kshitiz Kaushik
Sarthak Gupta
AAML
119
0
0
12 Feb 2024
Fight Back Against Jailbreaking via Prompt Adversarial Tuning
Yichuan Mo
Yuji Wang
Zeming Wei
Yisen Wang
AAML
SILM
98
32
0
09 Feb 2024
Anomaly Unveiled: Securing Image Classification against Adversarial Patch Attacks
Nandish Chattopadhyay
Amira Guesmi
Mohamed Bennai
AAML
69
2
0
09 Feb 2024
Quantifying and Enhancing Multi-modal Robustness with Modality Preference
Zequn Yang
Yake Wei
Ce Liang
Di Hu
AAML
74
10
0
09 Feb 2024
In-Context Learning Can Re-learn Forbidden Tasks
Sophie Xhonneux
David Dobre
Jian Tang
Gauthier Gidel
Dhanya Sridhar
72
5
0
08 Feb 2024
Rapid Optimization for Jailbreaking LLMs via Subconscious Exploitation and Echopraxia
Guangyu Shen
Shuyang Cheng
Kai-xian Zhang
Guanhong Tao
Shengwei An
Lu Yan
Zhuo Zhang
Shiqing Ma
Xiangyu Zhang
78
15
0
08 Feb 2024
Adversarial Robustness Through Artifact Design
Tsufit Shua
Mahmood Sharif
AAML
72
0
0
07 Feb 2024
Enhance DNN Adversarial Robustness and Efficiency via Injecting Noise to Non-Essential Neurons
Zhenyu Liu
Garrett Gagnon
Swagath Venkataramani
Liu Liu
AAML
72
0
0
06 Feb 2024
HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
Mantas Mazeika
Long Phan
Xuwang Yin
Andy Zou
Zifan Wang
...
Nathaniel Li
Steven Basart
Bo Li
David A. Forsyth
Dan Hendrycks
AAML
112
419
0
06 Feb 2024
SUB-PLAY: Adversarial Policies against Partially Observed Multi-Agent Reinforcement Learning Systems
Oubo Ma
Yuwen Pu
L. Du
Yang Dai
Ruo Wang
Xiaolei Liu
Yingcai Wu
Shouling Ji
AAML
75
4
0
06 Feb 2024
Transcending Adversarial Perturbations: Manifold-Aided Adversarial Examples with Legitimate Semantics
Shuai Li
Xiaoyu Jiang
Xiaoguang Ma
AAML
81
0
0
05 Feb 2024
Unraveling the Key of Machine Learning Solutions for Android Malware Detection
Jiahao Liu
Jun Zeng
Fabio Pierazzi
Lorenzo Cavallaro
Zhenkai Liang
AAML
83
8
0
05 Feb 2024
A Generative Approach to Surrogate-based Black-box Attacks
Raha Moraffah
Huan Liu
AAML
112
0
0
05 Feb 2024
Exploiting Class Probabilities for Black-box Sentence-level Attacks
Raha Moraffah
Huan Liu
62
1
0
05 Feb 2024
PROSAC: Provably Safe Certification for Machine Learning Models under Adversarial Attacks
Ziquan Liu
Zhuo Zhi
Ilija Bogunovic
Carsten Gerner-Beuerle
Miguel R. D. Rodrigues
AAML
80
0
0
04 Feb 2024
Jailbreaking Attack against Multimodal Large Language Model
Zhenxing Niu
Haoxuan Ji
Xinbo Gao
Gang Hua
Rong Jin
97
76
0
04 Feb 2024
MixedNUTS: Training-Free Accuracy-Robustness Balance via Nonlinearly Mixed Classifiers
Yatong Bai
Mo Zhou
Vishal M. Patel
Somayeh Sojoudi
AAML
96
8
0
03 Feb 2024
Unlearnable Examples For Time Series
Yujing Jiang
Xingjun Ma
S. Erfani
James Bailey
AI4TS
95
1
0
03 Feb 2024
STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition
Yi Chang
Zhao Ren
Zixing Zhang
Xin Jing
Kun Qian
Xi Shao
Bin Hu
Tanja Schultz
Björn W. Schuller
AAML
75
4
0
02 Feb 2024
Trustworthy Distributed AI Systems: Robustness, Privacy, and Governance
Wenqi Wei
Ling Liu
128
20
0
02 Feb 2024
Tropical Decision Boundaries for Neural Networks Are Robust Against Adversarial Attacks
Kurt Pasque
Christopher Teska
Ruriko Yoshida
Keiji Miura
Jefferson Huang
AAML
98
2
0
01 Feb 2024
Comparing Spectral Bias and Robustness For Two-Layer Neural Networks: SGD vs Adaptive Random Fourier Features
Aku Kammonen
Lisi Liang
Anamika Pandey
Raúl Tempone
78
3
0
01 Feb 2024
Game-Theoretic Unlearnable Example Generator
Shuang Liu
Yihan Wang
Xiao-Shan Gao
AAML
78
9
0
31 Jan 2024
AdvGPS: Adversarial GPS for Multi-Agent Perception Attack
Jinlong Li
Baolu Li
Xinyu Liu
Jianwu Fang
Felix Juefei Xu
Qing Guo
Hongkai Yu
78
5
0
30 Jan 2024
Systematically Assessing the Security Risks of AI/ML-enabled Connected Healthcare Systems
Mohammed Elnawawy
Mohammadreza Hallajiyan
Gargi Mitra
Shahrear Iqbal
Karthik Pattabiraman
65
6
0
30 Jan 2024
Adversarial Training on Purification (AToP): Advancing Both Robustness and Generalization
Guang Lin
Chao Li
Jianhai Zhang
Toshihisa Tanaka
Qibin Zhao
124
15
0
29 Jan 2024
Conserve-Update-Revise to Cure Generalization and Robustness Trade-off in Adversarial Training
Shruthi Gowda
Bahram Zonooz
Elahe Arani
AAML
94
3
0
26 Jan 2024
Boosting the Transferability of Adversarial Examples via Local Mixup and Adaptive Step Size
Junlin Liu
Xinchen Lyu
AAML
70
4
0
24 Jan 2024
A Training-Free Defense Framework for Robust Learned Image Compression
Myungseo Song
Jinyoung Choi
Bohyung Han
AAML
106
4
0
22 Jan 2024
Unraveling Attacks in Machine Learning-based IoT Ecosystems: A Survey and the Open Libraries Behind Them
Chao-Jung Liu
Boxi Chen
Wei Shao
Chris Zhang
Kelvin Wong
Yi Zhang
102
3
0
22 Jan 2024
Cloud-based XAI Services for Assessing Open Repository Models Under Adversarial Attacks
Zerui Wang
Yan Liu
AAML
60
2
0
22 Jan 2024
How Robust Are Energy-Based Models Trained With Equilibrium Propagation?
Siddharth Mansingh
Michal Kucer
Garrett Kenyon
Juston S. Moore
Michael Teti
AAML
110
1
0
21 Jan 2024
Adversarial Augmentation Training Makes Action Recognition Models More Robust to Realistic Video Distribution Shifts
Kiyoon Kim
Shreyank N. Gowda
Panagiotis Eustratiadis
Antreas Antoniou
Robert B Fisher
110
2
0
21 Jan 2024
CARE: Ensemble Adversarial Robustness Evaluation Against Adaptive Attackers for Security Applications
Hangsheng Zhang
Jiqiang Liu
Jinsong Dong
AAML
64
1
0
20 Jan 2024
PuriDefense: Randomized Local Implicit Adversarial Purification for Defending Black-box Query-based Attacks
Ping Guo
Xiang Li
Zhiyuan Yang
Xi Lin
Qingchuan Zhao
Qingfu Zhang
AAML
107
4
0
19 Jan 2024
Cross-Modality Perturbation Synergy Attack for Person Re-identification
Yunpeng Gong
Zhun Zhong
Zhiming Luo
Yansong Qu
Rongrong Ji
Min Jiang
AAML
139
26
0
18 Jan 2024
Previous
1
2
3
...
11
12
13
...
79
80
81
Next