Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
1412.6572
Cited By
Explaining and Harnessing Adversarial Examples
20 December 2014
Ian Goodfellow
Jonathon Shlens
Christian Szegedy
AAML
GAN
Re-assign community
ArXiv
PDF
HTML
Papers citing
"Explaining and Harnessing Adversarial Examples"
50 / 3,760 papers shown
Title
MIMIR: Masked Image Modeling for Mutual Information-based Adversarial Robustness
Xiaoyun Xu
Shujian Yu
Jingzheng Wu
S. Picek
AAML
40
0
0
08 Dec 2023
SoK: Unintended Interactions among Machine Learning Defenses and Risks
Vasisht Duddu
S. Szyller
Nadarajah Asokan
AAML
58
2
0
07 Dec 2023
Defense Against Adversarial Attacks using Convolutional Auto-Encoders
Shreyasi Mandal
AAML
31
1
0
06 Dec 2023
Indirect Gradient Matching for Adversarial Robust Distillation
Hongsin Lee
Seungju Cho
Changick Kim
AAML
FedML
53
2
0
06 Dec 2023
ScAR: Scaling Adversarial Robustness for LiDAR Object Detection
Xiaohu Lu
H. Radha
AAML
3DPC
44
0
0
05 Dec 2023
Scaling Laws for Adversarial Attacks on Language Model Activations
Stanislav Fort
26
15
0
05 Dec 2023
InstructTA: Instruction-Tuned Targeted Attack for Large Vision-Language Models
Xunguang Wang
Zhenlan Ji
Pingchuan Ma
Zongjie Li
Shuai Wang
MLLM
48
12
0
04 Dec 2023
Rethinking Adversarial Training with Neural Tangent Kernel
Guanlin Li
Han Qiu
Shangwei Guo
Jiwei Li
Tianwei Zhang
AAML
34
0
0
04 Dec 2023
IMMA: Immunizing text-to-image Models against Malicious Adaptation
Yijia Zheng
Raymond A. Yeh
64
8
0
30 Nov 2023
Improving Feature Stability during Upsampling -- Spectral Artifacts and the Importance of Spatial Context
Shashank Agnihotri
Julia Grabinski
Margret Keuper
35
6
0
29 Nov 2023
NeRFTAP: Enhancing Transferability of Adversarial Patches on Face Recognition using Neural Radiance Fields
Xiaoliang Liu
Shen Furao
Feng Han
Jian Zhao
Changhai Nie
AAML
33
0
0
29 Nov 2023
Vulnerability Analysis of Transformer-based Optical Character Recognition to Adversarial Attacks
Lucas Beerens
D. Higham
41
1
0
28 Nov 2023
Unsupervised Multimodal Deepfake Detection Using Intra- and Cross-Modal Inconsistencies
Mulin Tian
Mahyar Khayatkhoei
Joe Mathai
Wael AbdAlmageed
47
6
0
28 Nov 2023
CLAP: Isolating Content from Style through Contrastive Learning with Augmented Prompts
Yichao Cai
Yuhang Liu
Zhen Zhang
Javen Qinfeng Shi
CLIP
VLM
39
8
0
28 Nov 2023
Concept Distillation: Leveraging Human-Centered Explanations for Model Improvement
Avani Gupta
Saurabh Saini
P. J. Narayanan
38
7
0
26 Nov 2023
Mixing Classifiers to Alleviate the Accuracy-Robustness Trade-Off
Yatong Bai
Brendon G. Anderson
Somayeh Sojoudi
AAML
37
2
0
26 Nov 2023
Trainwreck: A damaging adversarial attack on image classifiers
Jan Zahálka
41
1
0
24 Nov 2023
When Side-Channel Attacks Break the Black-Box Property of Embedded Artificial Intelligence
Benoît Coqueret
Mathieu Carbone
Olivier Sentieys
Gabriel Zaid
63
2
0
23 Nov 2023
Efficient Trigger Word Insertion
Yueqi Zeng
Ziqiang Li
Pengfei Xia
Lei Liu
Bin Li
AAML
30
5
0
23 Nov 2023
Transfer Attacks and Defenses for Large Language Models on Coding Tasks
Chi Zhang
Zifan Wang
Ravi Mangal
Matt Fredrikson
Limin Jia
Corina S. Pasareanu
AAML
SILM
34
1
0
22 Nov 2023
A Survey of Adversarial CAPTCHAs on its History, Classification and Generation
Zisheng Xu
Qiao Yan
Fei Yu
Victor C.M. Leung
AAML
29
1
0
22 Nov 2023
Investigating Weight-Perturbed Deep Neural Networks With Application in Iris Presentation Attack Detection
Renu Sharma
Redwan Sony
Arun Ross
AAML
21
3
0
21 Nov 2023
Token-Level Adversarial Prompt Detection Based on Perplexity Measures and Contextual Information
Zhengmian Hu
Gang Wu
Saayan Mitra
Ruiyi Zhang
Tong Sun
Heng-Chiao Huang
Vishy Swaminathan
37
24
0
20 Nov 2023
Beyond Boundaries: A Comprehensive Survey of Transferable Attacks on AI Systems
Guangjing Wang
Ce Zhou
Yuanda Wang
Bocheng Chen
Hanqing Guo
Qiben Yan
AAML
SILM
68
3
0
20 Nov 2023
PACOL: Poisoning Attacks Against Continual Learners
Huayu Li
G. Ditzler
AAML
30
2
0
18 Nov 2023
Formal Verification of Long Short-Term Memory based Audio Classifiers: A Star based Approach
Neelanjana Pal
Taylor T. Johnson
32
0
0
16 Nov 2023
Extending Neural Network Verification to a Larger Family of Piece-wise Linear Activation Functions
László Antal
Hana Masara
Erika Ábrahám
41
0
0
16 Nov 2023
Beyond Detection: Unveiling Fairness Vulnerabilities in Abusive Language Models
Yueqing Liang
Lu Cheng
Ali Payani
Kai Shu
28
3
0
15 Nov 2023
Fast Certification of Vision-Language Models Using Incremental Randomized Smoothing
Ashutosh Nirala
Ameya Joshi
Chinmay Hegde
S Sarkar
VLM
38
0
0
15 Nov 2023
On The Relationship Between Universal Adversarial Attacks And Sparse Representations
Dana Weitzner
Raja Giryes
AAML
37
0
0
14 Nov 2023
1-Lipschitz Neural Networks are more expressive with N-Activations
Bernd Prach
Christoph H. Lampert
AAML
FAtt
33
0
0
10 Nov 2023
Embedding Space Interpolation Beyond Mini-Batch, Beyond Pairs and Beyond Examples
Shashanka Venkataramanan
Ewa Kijak
Laurent Amsaleg
Yannis Avrithis
36
4
0
09 Nov 2023
SCAAT: Improving Neural Network Interpretability via Saliency Constrained Adaptive Adversarial Training
Rui Xu
Wenkang Qin
Peixiang Huang
Hao Wang
Lin Luo
FAtt
AAML
43
2
0
09 Nov 2023
Deep anytime-valid hypothesis testing
T. Pandeva
Patrick Forré
Aaditya Ramdas
S. Shekhar
40
4
0
30 Oct 2023
Balance, Imbalance, and Rebalance: Understanding Robust Overfitting from a Minimax Game Perspective
Yifei Wang
Liangchen Li
Jiansheng Yang
Zhouchen Lin
Yisen Wang
36
12
0
30 Oct 2023
RAIFLE: Reconstruction Attacks on Interaction-based Federated Learning with Adversarial Data Manipulation
Dzung Pham
Shreyas Kulkarni
Amir Houmansadr
38
0
0
29 Oct 2023
Purify++: Improving Diffusion-Purification with Advanced Diffusion Models and Control of Randomness
Boya Zhang
Weijian Luo
Zhihua Zhang
39
10
0
28 Oct 2023
LipSim: A Provably Robust Perceptual Similarity Metric
Sara Ghazanfari
Alexandre Araujo
Prashanth Krishnamurthy
Farshad Khorrami
Siddharth Garg
48
6
0
27 Oct 2023
Artifact-Robust Graph-Based Learning in Digital Pathology
Saba Heidari Gheshlaghi
Milan Aryal
Nasim Yahyasoltani
Masoud Ganji
OOD
32
0
0
27 Oct 2023
PubDef: Defending Against Transfer Attacks From Public Models
Chawin Sitawarin
Jaewon Chang
David Huang
Wesson Altoyan
David Wagner
AAML
46
6
0
26 Oct 2023
Uncertainty-weighted Loss Functions for Improved Adversarial Attacks on Semantic Segmentation
Kira Maag
Asja Fischer
AAML
SSeg
51
3
0
26 Oct 2023
Codebook Features: Sparse and Discrete Interpretability for Neural Networks
Alex Tamkin
Mohammad Taufeeque
Noah D. Goodman
45
27
0
26 Oct 2023
Segue: Side-information Guided Generative Unlearnable Examples for Facial Privacy Protection in Real World
Zhiling Zhang
Jie Zhang
Kui Zhang
Wenbo Zhou
Weiming Zhang
Neng H. Yu
32
1
0
24 Oct 2023
Theoretically Grounded Loss Functions and Algorithms for Score-Based Multi-Class Abstention
Anqi Mao
M. Mohri
Yutao Zhong
34
23
0
23 Oct 2023
Semantic-Aware Adversarial Training for Reliable Deep Hashing Retrieval
Xu Yuan
Zheng Zhang
Xunguang Wang
Lin Wu
AAML
42
11
0
23 Oct 2023
Diffusion-Based Adversarial Purification for Speaker Verification
Yibo Bai
Ju Liu
Xuelong Li
DiffM
47
2
0
22 Oct 2023
Finite-context Indexing of Restricted Output Space for NLP Models Facing Noisy Input
Minh Nguyen
Nancy F. Chen
30
0
0
21 Oct 2023
Toward Stronger Textual Attack Detectors
Pierre Colombo
Marine Picot
Nathan Noiry
Guillaume Staerman
Pablo Piantanida
69
5
0
21 Oct 2023
Adversarial Image Generation by Spatial Transformation in Perceptual Colorspaces
A. Aydin
A. Temi̇zel
43
4
0
21 Oct 2023
Training Image Derivatives: Increased Accuracy and Universal Robustness
V. Avrutskiy
51
0
0
21 Oct 2023
Previous
1
2
3
...
9
10
11
...
74
75
76
Next