ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 1805.02917
  4. Cited By
Interpretable Adversarial Perturbation in Input Embedding Space for Text

Interpretable Adversarial Perturbation in Input Embedding Space for Text

8 May 2018
Motoki Sato
Jun Suzuki
Hiroyuki Shindo
Yuji Matsumoto
ArXivPDFHTML

Papers citing "Interpretable Adversarial Perturbation in Input Embedding Space for Text"

37 / 37 papers shown
Title
LLMScan: Causal Scan for LLM Misbehavior Detection
LLMScan: Causal Scan for LLM Misbehavior Detection
Mengdi Zhang
Kai Kiat Goh
Peixin Zhang
Jun Sun
Rose Lin Xin
Hongyu Zhang
28
0
0
22 Oct 2024
Obfuscating IoT Device Scanning Activity via Adversarial Example
  Generation
Obfuscating IoT Device Scanning Activity via Adversarial Example Generation
Haocong Li
Yaxin Zhang
Long Cheng
Wenjia Niu
Haining Wang
Qiang Li
AAML
41
0
0
17 Jun 2024
SATO: Stable Text-to-Motion Framework
SATO: Stable Text-to-Motion Framework
Wenshuo Chen
Hongru Xiao
Erhang Zhang
Lijie Hu
Lei Wang
Mengyuan Liu
Chong Chen
47
5
0
02 May 2024
Semantic Stealth: Adversarial Text Attacks on NLP Using Several Methods
Semantic Stealth: Adversarial Text Attacks on NLP Using Several Methods
Roopkatha Dey
Aivy Debnath
Sayak Kumar Dutta
Kaustav Ghosh
Arijit Mitra
Arghya Roy Chowdhury
Jaydip Sen
AAML
SILM
29
1
0
08 Apr 2024
Round Trip Translation Defence against Large Language Model Jailbreaking Attacks
Round Trip Translation Defence against Large Language Model Jailbreaking Attacks
Canaan Yung
H. M. Dolatabadi
S. Erfani
Christopher Leckie
AAML
64
5
0
21 Feb 2024
Deep Learning-based Sentiment Classification: A Comparative Survey
Deep Learning-based Sentiment Classification: A Comparative Survey
Mohammed Kayed
R. Redondo
Alhassan Mabrouk
17
42
0
12 Dec 2023
Hyperpolyglot LLMs: Cross-Lingual Interpretability in Token Embeddings
Hyperpolyglot LLMs: Cross-Lingual Interpretability in Token Embeddings
Andrea W Wen-Yi
David Mimno
36
15
0
29 Nov 2023
Modeling Adversarial Attack on Pre-trained Language Models as Sequential
  Decision Making
Modeling Adversarial Attack on Pre-trained Language Models as Sequential Decision Making
Xuanjie Fang
Sijie Cheng
Yang Liu
Wen Wang
AAML
42
9
0
27 May 2023
ESimCSE Unsupervised Contrastive Learning Jointly with UDA
  Semi-Supervised Learning for Large Label System Text Classification Mode
ESimCSE Unsupervised Contrastive Learning Jointly with UDA Semi-Supervised Learning for Large Label System Text Classification Mode
Ruan Lu
Zhou Hangcheng
Ran Meng
Zhao Jin
Qin JiaoYu
Wei Feng
Wang ChenZi
40
0
0
19 Apr 2023
Learning video embedding space with Natural Language Supervision
Learning video embedding space with Natural Language Supervision
P. Uppala
Abhishek Bamotra
S. Priya
Vaidehi Joshi
CLIP
23
1
0
25 Mar 2023
SEAT: Stable and Explainable Attention
SEAT: Stable and Explainable Attention
Lijie Hu
Yixin Liu
Ninghao Liu
Mengdi Huai
Lichao Sun
Di Wang
OOD
32
18
0
23 Nov 2022
Character-level White-Box Adversarial Attacks against Transformers via
  Attachable Subwords Substitution
Character-level White-Box Adversarial Attacks against Transformers via Attachable Subwords Substitution
Aiwei Liu
Honghai Yu
Xuming Hu
Shuang Li
Li Lin
Fukun Ma
Yawen Yang
Lijie Wen
41
33
0
31 Oct 2022
Software Testing for Machine Learning
Software Testing for Machine Learning
D. Marijan
A. Gotlieb
AAML
22
27
0
30 Apr 2022
Interpretation of Black Box NLP Models: A Survey
Interpretation of Black Box NLP Models: A Survey
Shivani Choudhary
N. Chatterjee
S. K. Saha
FAtt
34
10
0
31 Mar 2022
Multilingual Text Classification for Dravidian Languages
Multilingual Text Classification for Dravidian Languages
Xiaotian Lin
Nankai Lin
Kanoksak Wattanachote
Shengyi Jiang
Lianxi Wang
69
3
0
03 Dec 2021
Effective and Imperceptible Adversarial Textual Attack via
  Multi-objectivization
Effective and Imperceptible Adversarial Textual Attack via Multi-objectivization
Shengcai Liu
Ning Lu
W. Hong
Chao Qian
Ke Tang
AAML
22
15
0
02 Nov 2021
Adversarial Attacks and Defenses for Social Network Text Processing
  Applications: Techniques, Challenges and Future Research Directions
Adversarial Attacks and Defenses for Social Network Text Processing Applications: Techniques, Challenges and Future Research Directions
I. Alsmadi
Kashif Ahmad
Mahmoud Nazzal
Firoj Alam
Ala I. Al-Fuqaha
Abdallah Khreishah
A. Algosaibi
AAML
37
16
0
26 Oct 2021
Interpreting Deep Learning Models in Natural Language Processing: A
  Review
Interpreting Deep Learning Models in Natural Language Processing: A Review
Xiaofei Sun
Diyi Yang
Xiaoya Li
Tianwei Zhang
Yuxian Meng
Han Qiu
Guoyin Wang
Eduard H. Hovy
Jiwei Li
19
45
0
20 Oct 2021
TREATED:Towards Universal Defense against Textual Adversarial Attacks
TREATED:Towards Universal Defense against Textual Adversarial Attacks
Bin Zhu
Zhaoquan Gu
Le Wang
Zhihong Tian
AAML
36
8
0
13 Sep 2021
Towards Robustness Against Natural Language Word Substitutions
Towards Robustness Against Natural Language Word Substitutions
Xinshuai Dong
A. Luu
Rongrong Ji
Hong Liu
SILM
AAML
38
113
0
28 Jul 2021
Defending Against Backdoor Attacks in Natural Language Generation
Defending Against Backdoor Attacks in Natural Language Generation
Xiaofei Sun
Xiaoya Li
Yuxian Meng
Xiang Ao
Fei Wu
Jiwei Li
Tianwei Zhang
AAML
SILM
31
47
0
03 Jun 2021
Making Attention Mechanisms More Robust and Interpretable with Virtual
  Adversarial Training
Making Attention Mechanisms More Robust and Interpretable with Virtual Adversarial Training
Shunsuke Kitada
Hitoshi Iyatomi
AAML
28
8
0
18 Apr 2021
Developing Future Human-Centered Smart Cities: Critical Analysis of
  Smart City Security, Interpretability, and Ethical Challenges
Developing Future Human-Centered Smart Cities: Critical Analysis of Smart City Security, Interpretability, and Ethical Challenges
Kashif Ahmad
Majdi Maabreh
M. Ghaly
Khalil Khan
Junaid Qadir
Ala I. Al-Fuqaha
27
142
0
14 Dec 2020
Adversarial Black-Box Attacks On Text Classifiers Using Multi-Objective
  Genetic Optimization Guided By Deep Networks
Adversarial Black-Box Attacks On Text Classifiers Using Multi-Objective Genetic Optimization Guided By Deep Networks
Alex Mathai
Shreya Khare
Srikanth G. Tamilselvam
Senthil Mani
AAML
36
6
0
08 Nov 2020
Adversarial Attack and Defense of Structured Prediction Models
Adversarial Attack and Defense of Structured Prediction Models
Wenjuan Han
Liwen Zhang
Yong-jia Jiang
Kewei Tu
AAML
12
37
0
04 Oct 2020
Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood
  Ensemble
Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble
Yi Zhou
Xiaoqing Zheng
Cho-Jui Hsieh
Kai-Wei Chang
Xuanjing Huang
SILM
39
48
0
20 Jun 2020
Differentiable Language Model Adversarial Attacks on Categorical
  Sequence Classifiers
Differentiable Language Model Adversarial Attacks on Categorical Sequence Classifiers
I. Fursov
A. Zaytsev
Nikita Klyuchnikov
A. Kravchenko
E. Burnaev
AAML
SILM
31
5
0
19 Jun 2020
Adversarial Attacks and Defenses: An Interpretation Perspective
Adversarial Attacks and Defenses: An Interpretation Perspective
Ninghao Liu
Mengnan Du
Ruocheng Guo
Huan Liu
Xia Hu
AAML
31
8
0
23 Apr 2020
Generating Natural Language Adversarial Examples on a Large Scale with
  Generative Models
Generating Natural Language Adversarial Examples on a Large Scale with Generative Models
Yankun Ren
J. Lin
Siliang Tang
Jun Zhou
Shuang Yang
Yuan Qi
Xiang Ren
GAN
AAML
SILM
32
21
0
10 Mar 2020
Adv-BERT: BERT is not robust on misspellings! Generating nature
  adversarial samples on BERT
Adv-BERT: BERT is not robust on misspellings! Generating nature adversarial samples on BERT
Lichao Sun
Kazuma Hashimoto
Wenpeng Yin
Akari Asai
Jia Li
Philip Yu
Caiming Xiong
SILM
AAML
12
101
0
27 Feb 2020
Adversarial Robustness for Code
Adversarial Robustness for Code
Pavol Bielik
Martin Vechev
AAML
22
89
0
11 Feb 2020
Improving Machine Reading Comprehension via Adversarial Training
Improving Machine Reading Comprehension via Adversarial Training
Ziqing Yang
Yiming Cui
Wanxiang Che
Ting Liu
Shijin Wang
Guoping Hu
27
17
0
09 Nov 2019
Aleatoric and Epistemic Uncertainty in Machine Learning: An Introduction
  to Concepts and Methods
Aleatoric and Epistemic Uncertainty in Machine Learning: An Introduction to Concepts and Methods
Eyke Hüllermeier
Willem Waegeman
PER
UD
87
1,355
0
21 Oct 2019
Say What I Want: Towards the Dark Side of Neural Dialogue Models
Say What I Want: Towards the Dark Side of Neural Dialogue Models
Haochen Liu
Tyler Derr
Zitao Liu
Jiliang Tang
31
16
0
13 Sep 2019
Securing Connected & Autonomous Vehicles: Challenges Posed by
  Adversarial Machine Learning and The Way Forward
Securing Connected & Autonomous Vehicles: Challenges Posed by Adversarial Machine Learning and The Way Forward
A. Qayyum
Muhammad Usama
Junaid Qadir
Ala I. Al-Fuqaha
AAML
24
187
0
29 May 2019
Low Resource Text Classification with ULMFit and Backtranslation
Low Resource Text Classification with ULMFit and Backtranslation
Sam Shleifer
VLM
19
57
0
21 Mar 2019
Adversarial Attacks on Deep Learning Models in Natural Language
  Processing: A Survey
Adversarial Attacks on Deep Learning Models in Natural Language Processing: A Survey
W. Zhang
Quan Z. Sheng
A. Alhazmi
Chenliang Li
AAML
24
57
0
21 Jan 2019
1