ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1706.06083
  4. Cited By
Towards Deep Learning Models Resistant to Adversarial Attacks

Towards Deep Learning Models Resistant to Adversarial Attacks

19 June 2017
A. Madry
Aleksandar Makelov
Ludwig Schmidt
Dimitris Tsipras
Adrian Vladu
    SILM
    OOD
ArXivPDFHTML

Papers citing "Towards Deep Learning Models Resistant to Adversarial Attacks"

50 / 6,508 papers shown
Title
Towards Effective and Sparse Adversarial Attack on Spiking Neural Networks via Breaking Invisible Surrogate Gradients
Li Lun
Kunyu Feng
Qinglong Ni
Ling Liang
Yuan Wang
Ying Li
Dunshan Yu
Xiaoxin Cui
AAML
81
0
0
05 Mar 2025
CLIP is Strong Enough to Fight Back: Test-time Counterattacks towards Zero-shot Adversarial Robustness of CLIP
Songlong Xing
Zhengyu Zhao
N. Sebe
AAML
62
1
0
05 Mar 2025
Adversarial Training for Multimodal Large Language Models against Jailbreak Attacks
Adversarial Training for Multimodal Large Language Models against Jailbreak Attacks
Liming Lu
Shuchao Pang
Siyuan Liang
Haotian Zhu
Xiyu Zeng
Aishan Liu
Yunhuai Liu
Yongbin Zhou
AAML
51
2
0
05 Mar 2025
Task-Agnostic Attacks Against Vision Foundation Models
Brian Pulfer
Yury Belousov
Vitaliy Kinakh
Teddy Furon
S. Voloshynovskiy
AAML
77
0
0
05 Mar 2025
Predicting Practically? Domain Generalization for Predictive Analytics in Real-world Environments
Hanyu Duan
Yi Yang
Ahmed Abbasi
Kar Yan Tam
OOD
97
0
0
05 Mar 2025
LLM-Safety Evaluations Lack Robustness
Tim Beyer
Sophie Xhonneux
Simon Geisler
Gauthier Gidel
Leo Schwinn
Stephan Günnemann
ALM
ELM
215
0
0
04 Mar 2025
S4D-Bio Audio Monitoring of Bone Cement Disintegration in Pulsating Fluid Jet Surgery under Laboratory Conditions
Melanie Schaller
Sergej Hloch
Akash Nag
Dagmar Klichova
Nick Janssen
Frank Pude
Michal Zelenak
Bodo Rosenhahn
74
0
0
04 Mar 2025
DDAD: A Two-pronged Adversarial Defense Based on Distributional Discrepancy
Jiacheng Zhang
Benjamin I. P. Rubinstein
Junge Zhang
Feng Liu
71
0
0
04 Mar 2025
STAR: Stability-Inducing Weight Perturbation for Continual Learning
Masih Eskandar
Tooba Imtiaz
Davin Hill
Zifeng Wang
Jennifer Dy
CLL
36
0
0
03 Mar 2025
AutoAdvExBench: Benchmarking autonomous exploitation of adversarial example defenses
Nicholas Carlini
Javier Rando
Edoardo Debenedetti
Milad Nasr
F. Tramèr
AAML
ELM
47
2
0
03 Mar 2025
Same Question, Different Words: A Latent Adversarial Framework for Prompt Robustness
Tingchen Fu
Fazl Barez
AAML
65
0
0
03 Mar 2025
Generalizable Prompt Learning of CLIP: A Brief Overview
Generalizable Prompt Learning of CLIP: A Brief Overview
Fangming Cui
Yonggang Zhang
Xuan Wang
Xule Wang
Liang Xiao
VPVLM
VLM
200
0
0
03 Mar 2025
Adversarial Agents: Black-Box Evasion Attacks with Reinforcement Learning
Kyle Domico
Jean-Charles Noirot Ferrand
Ryan Sheatsley
Eric Pauley
Josiah Hanna
Patrick McDaniel
AAML
34
1
0
03 Mar 2025
Transformer Meets Twicing: Harnessing Unattended Residual Information
Laziz U. Abdullaev
Tan M. Nguyen
41
2
0
02 Mar 2025
TAET: Two-Stage Adversarial Equalization Training on Long-Tailed Distributions
TAET: Two-Stage Adversarial Equalization Training on Long-Tailed Distributions
Wang YuHang
Junkang Guo
Aolei Liu
Kaihao Wang
Zaitong Wu
Zhenyu Liu
Wenfei Yin
Jian Liu
AAML
50
0
0
02 Mar 2025
AMUN: Adversarial Machine UNlearning
AMUN: Adversarial Machine UNlearning
A. Boroojeny
Hari Sundaram
Varun Chandrasekaran
MU
AAML
48
0
0
02 Mar 2025
A Survey of Adversarial Defenses in Vision-based Systems: Categorization, Methods and Challenges
Nandish Chattopadhyay
Abdul Basit
B. Ouni
Muhammad Shafique
AAML
31
0
0
01 Mar 2025
Continuous Adversarial Text Representation Learning for Affective Recognition
Continuous Adversarial Text Representation Learning for Affective Recognition
Seungah Son
Andrez Saurez
Dongsoo Har
40
0
0
28 Feb 2025
Fast Adversarial Training against Sparse Attacks Requires Loss Smoothing
Fast Adversarial Training against Sparse Attacks Requires Loss Smoothing
Xuyang Zhong
Yixiao Huang
Chen Liu
AAML
46
0
0
28 Feb 2025
Concealed Adversarial attacks on neural networks for sequential data
Concealed Adversarial attacks on neural networks for sequential data
P. Sokerin
Dmitry Anikin
Sofia Krehova
Alexey Zaytsev
AAML
AI4TS
49
0
0
28 Feb 2025
Data-free Universal Adversarial Perturbation with Pseudo-semantic Prior
Data-free Universal Adversarial Perturbation with Pseudo-semantic Prior
Chanhui Lee
Yeonghwan Song
Jeany Son
AAML
180
0
0
28 Feb 2025
Exploring the Impact of Temperature Scaling in Softmax for Classification and Adversarial Robustness
Exploring the Impact of Temperature Scaling in Softmax for Classification and Adversarial Robustness
Hao Xuan
Bokai Yang
Xingyu Li
AAML
49
2
0
28 Feb 2025
QFAL: Quantum Federated Adversarial Learning
QFAL: Quantum Federated Adversarial Learning
Walid El Maouaki
Nouhaila Innan
Alberto Marchisio
Taoufik Said
Mohamed Bennai
Muhammad Shafique
FedML
58
4
0
28 Feb 2025
UDora: A Unified Red Teaming Framework against LLM Agents by Dynamically Hijacking Their Own Reasoning
J.N. Zhang
Shuang Yang
B. Li
AAML
LLMAG
58
0
0
28 Feb 2025
Decoder Gradient Shield: Provable and High-Fidelity Prevention of Gradient-Based Box-Free Watermark Removal
Decoder Gradient Shield: Provable and High-Fidelity Prevention of Gradient-Based Box-Free Watermark Removal
Haonan An
Guang Hua
Zhengru Fang
Guowen Xu
Susanto Rahardja
Yuguang Fang
AAML
53
0
0
28 Feb 2025
SafeText: Safe Text-to-image Models via Aligning the Text Encoder
SafeText: Safe Text-to-image Models via Aligning the Text Encoder
Yuepeng Hu
Zhengyuan Jiang
Neil Zhenqiang Gong
69
1
0
28 Feb 2025
Improving Adversarial Transferability in MLLMs via Dynamic Vision-Language Alignment Attack
Improving Adversarial Transferability in MLLMs via Dynamic Vision-Language Alignment Attack
Chenhe Gu
Jindong Gu
Andong Hua
Yao Qin
AAML
47
0
0
27 Feb 2025
LISArD: Learning Image Similarity to Defend Against Gray-box Adversarial Attacks
LISArD: Learning Image Similarity to Defend Against Gray-box Adversarial Attacks
Joana Cabral Costa
Tiago Roxo
Hugo Manuel Proença
Pedro R. M. Inácio
AAML
57
0
0
27 Feb 2025
Snowball Adversarial Attack on Traffic Sign Classification
Snowball Adversarial Attack on Traffic Sign Classification
Anthony Etim
Jakub Szefer
AAML
56
0
0
27 Feb 2025
HALO: Robust Out-of-Distribution Detection via Joint Optimisation
HALO: Robust Out-of-Distribution Detection via Joint Optimisation
Hugo Lyons Keenan
S. Erfani
Christopher Leckie
OODD
212
0
0
27 Feb 2025
Neural Antidote: Class-Wise Prompt Tuning for Purifying Backdoors in Pre-trained Vision-Language Models
Neural Antidote: Class-Wise Prompt Tuning for Purifying Backdoors in Pre-trained Vision-Language Models
Jiawei Kong
Hao Fang
Sihang Guo
Chenxi Qing
Bin Chen
Bin Wang
Shu-Tao Xia
AAML
VLM
90
0
0
26 Feb 2025
The NeRF Signature: Codebook-Aided Watermarking for Neural Radiance Fields
The NeRF Signature: Codebook-Aided Watermarking for Neural Radiance Fields
Ziyuan Luo
Anderson de Rezende Rocha
Boxin Shi
Qing Guo
Haoliang Li
Renjie Wan
49
0
0
26 Feb 2025
CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification
CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification
Mingkun Zhang
Keping Bi
Wei Chen
J. Guo
Xueqi Cheng
BDL
VLM
52
1
0
25 Feb 2025
Model-Free Adversarial Purification via Coarse-To-Fine Tensor Network Representation
Model-Free Adversarial Purification via Coarse-To-Fine Tensor Network Representation
Guang Lin
D. Nguyen
Zerui Tao
Konstantinos Slavakis
Toshihisa Tanaka
Qibin Zhao
AAML
64
0
0
25 Feb 2025
MACPruning: Dynamic Operation Pruning to Mitigate Side-Channel DNN Model Extraction
MACPruning: Dynamic Operation Pruning to Mitigate Side-Channel DNN Model Extraction
Ruyi Ding
Cheng Gongye
Davis Ranney
A. A. Ding
Yunsi Fei
AAML
68
0
0
24 Feb 2025
Improved Diffusion-based Generative Model with Better Adversarial Robustness
Improved Diffusion-based Generative Model with Better Adversarial Robustness
Zekun Wang
Mingyang Yi
Shuchen Xue
Ziyu Li
Ming Liu
Bing Qin
Zhi-Ming Ma
DiffM
42
0
0
24 Feb 2025
Improving the Transferability of Adversarial Examples by Inverse Knowledge Distillation
Improving the Transferability of Adversarial Examples by Inverse Knowledge Distillation
Wenyuan Wu
Zheng Liu
Yong Chen
Chao Su
Dezhong Peng
Xu Wang
AAML
39
0
0
24 Feb 2025
EigenShield: Causal Subspace Filtering via Random Matrix Theory for Adversarially Robust Vision-Language Models
EigenShield: Causal Subspace Filtering via Random Matrix Theory for Adversarially Robust Vision-Language Models
Nastaran Darabi
Devashri Naik
Sina Tayebati
Dinithi Jayasuriya
Ranganath Krishnan
A. R. Trivedi
AAML
52
0
0
24 Feb 2025
Interpreting Adversarial Attacks and Defences using Architectures with Enhanced Interpretability
Interpreting Adversarial Attacks and Defences using Architectures with Enhanced Interpretability
Akshay G Rao
Chandrashekhar Lakshminarayanan
Arun Rajkumar
AI4CE
AAML
39
0
0
24 Feb 2025
SMTFL: Secure Model Training to Untrusted Participants in Federated Learning
SMTFL: Secure Model Training to Untrusted Participants in Federated Learning
Zhihui Zhao
Xiaorong Dong
Yimo Ren
Jianhua Wang
Dan Yu
Hongsong Zhu
Yongle Chen
86
0
0
24 Feb 2025
A stochastic smoothing framework for nonconvex-nonconcave min-sum-max problems with applications to Wasserstein distributionally robust optimization
A stochastic smoothing framework for nonconvex-nonconcave min-sum-max problems with applications to Wasserstein distributionally robust optimization
Wei Liu
Muhammad Khan
Gabriel Mancino-Ball
Yangyang Xu
44
0
0
24 Feb 2025
Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images
Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images
Yubo Wang
Jianting Tang
Chaohu Liu
Linli Xu
AAML
63
1
0
23 Feb 2025
Class-Conditional Neural Polarizer: A Lightweight and Effective Backdoor Defense by Purifying Poisoned Features
Class-Conditional Neural Polarizer: A Lightweight and Effective Backdoor Defense by Purifying Poisoned Features
Mingli Zhu
Shaokui Wei
Hongyuan Zha
Baoyuan Wu
AAML
44
0
0
23 Feb 2025
Can Indirect Prompt Injection Attacks Be Detected and Removed?
Can Indirect Prompt Injection Attacks Be Detected and Removed?
Yulin Chen
Haoran Li
Yuan Sui
Yufei He
Yue Liu
Yangqiu Song
Bryan Hooi
AAML
44
3
0
23 Feb 2025
Unified Prompt Attack Against Text-to-Image Generation Models
Unified Prompt Attack Against Text-to-Image Generation Models
Duo Peng
Qiuhong Ke
Mark He Huang
Ping Hu
Jun Liu
50
0
0
23 Feb 2025
A generative approach to LLM harmfulness detection with special red flag tokens
A generative approach to LLM harmfulness detection with special red flag tokens
Sophie Xhonneux
David Dobre
Mehrnaz Mohfakhami
Leo Schwinn
Gauthier Gidel
55
1
0
22 Feb 2025
SEA: Shareable and Explainable Attribution for Query-based Black-box Attacks
SEA: Shareable and Explainable Attribution for Query-based Black-box Attacks
Yue Gao
Ilia Shumailov
Kassem Fawaz
AAML
148
0
0
21 Feb 2025
Nearshore Underwater Target Detection Meets UAV-borne Hyperspectral Remote Sensing: A Novel Hybrid-level Contrastive Learning Framework and Benchmark Dataset
Nearshore Underwater Target Detection Meets UAV-borne Hyperspectral Remote Sensing: A Novel Hybrid-level Contrastive Learning Framework and Benchmark Dataset
Jiahao Qi
Chuanhong Zhou
Xingyue Liu
Chen Chen
Dehui Zhu
Kangcheng Bin
Ping Zhong
74
0
0
21 Feb 2025
Tight Clusters Make Specialized Experts
Tight Clusters Make Specialized Experts
Stefan K. Nielsen
R. Teo
Laziz U. Abdullaev
Tan M. Nguyen
MoE
66
2
0
21 Feb 2025
CyberSentinel: An Emergent Threat Detection System for AI Security
CyberSentinel: An Emergent Threat Detection System for AI Security
Krti Tallam
44
2
0
20 Feb 2025
Previous
123456...129130131
Next