Imitation Attacks and Defenses for Black-box Machine Translation Systems

30 April 2020

Papers citing "Imitation Attacks and Defenses for Black-box Machine Translation Systems"

30 / 30 papers shown

Title
Attack and defense techniques in large language models: A survey and new perspectives Zhiyu Liao Kang Chen Yuanguo Lin Kangkang Li Yunxuan Liu Hefeng Chen Xingwang Huang Yuanhui Yu AAML 59 0 0 02 May 2025
StyleRec: A Benchmark Dataset for Prompt Recovery in Writing Style Transformation Shenyang Liu Yang Gao Shaoyan Zhai Liqiang Wang 37 0 0 06 Apr 2025
Revisiting character-level adversarial attacks Elias Abad Rocamora Yongtao Wu Fanghui Liu Grigorios G. Chrysos V. Cevher AAML 39 3 0 07 May 2024
ModelShield: Adaptive and Robust Watermark against Model Extraction Attack Kaiyi Pang Tao Qi Chuhan Wu Minhao Bai Minghu Jiang Yongfeng Huang AAML WaLM 72 2 0 03 May 2024
Generative Models are Self-Watermarked: Declaring Model Authentication through Re-Generation Aditya Desu Xuanli He Qiongkai Xu Wei Lu WIGM 32 1 0 23 Feb 2024
Watermarking Makes Language Models Radioactive Tom Sander Pierre Fernandez Alain Durmus Matthijs Douze Teddy Furon WaLM 41 11 0 22 Feb 2024
Stolen Subwords: Importance of Vocabularies for Machine Translation Model Stealing Vilém Zouhar AAML 40 0 0 29 Jan 2024
Anchor Points: Benchmarking Models with Much Fewer Examples Rajan Vivek Kawin Ethayarajh Diyi Yang Douwe Kiela ALM 29 22 0 14 Sep 2023
A Classification-Guided Approach for Adversarial Attacks against Neural Machine Translation Sahar Sadrizadeh Ljiljana Dolamic P. Frossard AAML SILM 44 2 0 29 Aug 2023
Isolation and Induction: Training Robust Deep Neural Networks against Model Stealing Attacks Jun Guo Aishan Liu Xingyu Zheng Siyuan Liang Yisong Xiao Yichao Wu Xianglong Liu AAML 38 12 0 02 Aug 2023
Make Text Unlearnable: Exploiting Effective Patterns to Protect Personal Data Xinzhe Li Ming Liu Shang Gao MU 37 8 0 02 Jul 2023
The False Promise of Imitating Proprietary LLMs Arnav Gudibande Eric Wallace Charles Burton Snell Xinyang Geng Hao Liu Pieter Abbeel Sergey Levine Dawn Song ALM 44 198 0 25 May 2023
Iterative Adversarial Attack on Image-guided Story Ending Generation Youze Wang Wenbo Hu Richang Hong 36 3 0 16 May 2023
GrOVe: Ownership Verification of Graph Neural Networks using Embeddings Asim Waheed Vasisht Duddu Nadarajah Asokan 35 9 0 17 Apr 2023
TransFool: An Adversarial Attack against Neural Machine Translation Models Sahar Sadrizadeh Ljiljana Dolamic P. Frossard SILM AAML 39 12 0 02 Feb 2023
SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control Xiaochuang Han Sachin Kumar Yulia Tsvetkov 45 79 0 31 Oct 2022
CATER: Intellectual Property Protection on Text Generation APIs via Conditional Watermarks Xuanli He Qiongkai Xu Yi Zeng Lingjuan Lyu Fangzhao Wu Jiwei Li R. Jia WaLM 188 72 0 19 Sep 2022
Gradient-Based Constrained Sampling from Language Models Sachin Kumar Biswajit Paria Yulia Tsvetkov BDL 32 53 0 25 May 2022
Mix and Match: Learning-free Controllable Text Generation using Energy Language Models Fatemehsadat Mireshghallah Kartik Goyal Taylor Berg-Kirkpatrick 36 78 0 24 Mar 2022
Distinguishing Non-natural from Natural Adversarial Samples for More Robust Pre-trained Language Model Jiayi Wang Rongzhou Bao Zhuosheng Zhang Hai Zhao AAML 29 4 0 19 Mar 2022
Protecting Intellectual Property of Language Generation APIs with Lexical Watermark Xuanli He Qiongkai Xu Lingjuan Lyu Fangzhao Wu Chenguang Wang WaLM 177 95 0 05 Dec 2021
Demystifying the Transferability of Adversarial Attacks in Computer Networks Ehsan Nowroozi Yassine Mekdad Mohammad Hajian Berenjestanaki Mauro Conti Abdeslam El Fergougui AAML 40 32 0 09 Oct 2021
Student Surpasses Teacher: Imitation Attack for Black-Box NLP APIs Qiongkai Xu Xuanli He Lingjuan Lyu Lizhen Qu Gholamreza Haffari MLAU 40 22 0 29 Aug 2021
Controlled Text Generation as Continuous Optimization with Multiple Constraints Sachin Kumar Eric Malmi Aliaksei Severyn Yulia Tsvetkov BDL AI4CE 43 76 0 04 Aug 2021
Hidden Backdoors in Human-Centric Language Models Shaofeng Li Hui Liu Tian Dong Benjamin Zi Hao Zhao Minhui Xue Haojin Zhu Jialiang Lu SILM 35 147 0 01 May 2021
Domain Adaptation and Multi-Domain Adaptation for Neural Machine Translation: A Survey Danielle Saunders AI4CE 25 85 0 14 Apr 2021
FUDGE: Controlled Text Generation With Future Discriminators Kevin Kaichuang Yang Dan Klein 39 313 0 12 Apr 2021
Model Extraction and Adversarial Transferability, Your BERT is Vulnerable! Xuanli He Lingjuan Lyu Qiongkai Xu Lichao Sun MIACV SILM 33 90 0 18 Mar 2021
Certified Robustness to Adversarial Word Substitutions Robin Jia Aditi Raghunathan Kerem Göksel Percy Liang AAML 188 291 0 03 Sep 2019
A causal framework for explaining the predictions of black-box sequence-to-sequence models David Alvarez-Melis Tommi Jaakkola CML 232 200 0 06 Jul 2017