Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2301.10226
Cited By
A Watermark for Large Language Models
24 January 2023
John Kirchenbauer
Jonas Geiping
Yuxin Wen
Jonathan Katz
Ian Miers
Tom Goldstein
VLM
WaLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Watermark for Large Language Models"
50 / 319 papers shown
Title
DeepEclipse: How to Break White-Box DNN-Watermarking Schemes
Alessandro Pegoraro
Carlotta Segna
Kavita Kumari
Ahmad-Reza Sadeghi
AAML
37
0
0
06 Mar 2024
Watermark Stealing in Large Language Models
Nikola Jovanović
Robin Staab
Martin Vechev
WaLM
AAML
40
32
0
29 Feb 2024
Token-Specific Watermarking with Enhanced Detectability and Semantic Coherence for Large Language Models
Mingjia Huo
Sai Ashish Somayajula
Youwei Liang
Ruisi Zhang
F. Koushanfar
Pengtao Xie
WaLM
33
15
0
28 Feb 2024
On the Challenges and Opportunities in Generative AI
Laura Manduchi
Kushagra Pandey
Robert Bamler
Ryan Cotterell
Sina Daubener
...
F. Wenzel
Frank Wood
Stephan Mandt
Vincent Fortuin
Vincent Fortuin
56
17
0
28 Feb 2024
On the Societal Impact of Open Foundation Models
Sayash Kapoor
Rishi Bommasani
Kevin Klyman
Shayne Longpre
Ashwin Ramaswami
...
Victor Storchan
Daniel Zhang
Daniel E. Ho
Percy Liang
Arvind Narayanan
26
54
0
27 Feb 2024
Multi-Bit Distortion-Free Watermarking for Large Language Models
Massieh Kordi Boroujeny
Ya Jiang
Kai Zeng
Brian L. Mark
WaLM
VLM
43
4
0
26 Feb 2024
Data-free Weight Compress and Denoise for Large Language Models
Runyu Peng
Yunhua Zhou
Qipeng Guo
Yang Gao
Hang Yan
Xipeng Qiu
Dahua Lin
39
1
0
26 Feb 2024
No Free Lunch in LLM Watermarking: Trade-offs in Watermarking Design Choices
Qi Pang
Shengyuan Hu
Wenting Zheng
Virginia Smith
WaLM
49
11
0
25 Feb 2024
Detecting Machine-Generated Texts by Multi-Population Aware Optimization for Maximum Mean Discrepancy
Shuhai Zhang
Yiliao Song
Jiahao Yang
Yuanqing Li
Bo Han
Mingkui Tan
DeLMO
37
5
0
25 Feb 2024
Generative Models are Self-Watermarked: Declaring Model Authentication through Re-Generation
Aditya Desu
Xuanli He
Qiongkai Xu
Wei Lu
WIGM
24
1
0
23 Feb 2024
Watermarking Makes Language Models Radioactive
Tom Sander
Pierre Fernandez
Alain Durmus
Matthijs Douze
Teddy Furon
WaLM
41
11
0
22 Feb 2024
Double-I Watermark: Protecting Model Copyright for LLM Fine-tuning
Shen Li
Liuyi Yao
Jinyang Gao
Lan Zhang
Yaliang Li
49
11
0
22 Feb 2024
Large Language Models are Vulnerable to Bait-and-Switch Attacks for Generating Harmful Content
Federico Bianchi
James Zou
32
4
0
21 Feb 2024
Generative AI Security: Challenges and Countermeasures
Banghua Zhu
Norman Mu
Jiantao Jiao
David Wagner
AAML
SILM
61
8
0
20 Feb 2024
Copyleft for Alleviating AIGC Copyright Dilemma: What-if Analysis, Public Perception and Implications
Xinwei Guo
Yujun Li
Yafeng Peng
Xuetao Wei
30
2
0
19 Feb 2024
M4GT-Bench: Evaluation Benchmark for Black-Box Machine-Generated Text Detection
Yuxia Wang
Jonibek Mansurov
Petar Ivanov
Jinyan Su
Artem Shelmanov
...
Thomas Arnold
Alham Fikri Aji
Nizar Habash
Iryna Gurevych
Preslav Nakov
DeLMO
28
31
0
17 Feb 2024
Ten Words Only Still Help: Improving Black-Box AI-Generated Text Detection via Proxy-Guided Efficient Re-Sampling
Yuhui Shi
Qiang Sheng
Juan Cao
Hao Mi
Beizhe Hu
Danding Wang
29
13
0
14 Feb 2024
Resilient Watermarking for LLM-Generated Codes
Boquan Li
Mengdi Zhang
Peixin Zhang
Jun Sun
Xingmei Wang
Zijian Liu
Tianzi Zhang
WaLM
38
3
0
12 Feb 2024
Permute-and-Flip: An optimally stable and watermarkable decoder for LLMs
Xuandong Zhao
Lei Li
Yu-Xiang Wang
55
10
0
08 Feb 2024
Copyright Protection in Generative AI: A Technical Perspective
Jie Ren
Han Xu
Pengfei He
Yingqian Cui
Shenglai Zeng
...
Hongzhi Wen
Jiayuan Ding
Hui Liu
Yi Chang
Jiliang Tang
DeLMO
28
31
0
04 Feb 2024
Building Guardrails for Large Language Models
Yizhen Dong
Ronghui Mu
Gao Jin
Yi Qi
Jinwei Hu
Xingyu Zhao
Jie Meng
Wenjie Ruan
Xiaowei Huang
OffRL
61
27
0
02 Feb 2024
LLM-Detector: Improving AI-Generated Chinese Text Detection with Open-Source LLM Instruction Tuning
Rongsheng Wang
Hao Chen
Ruizhe Zhou
Han Ma
Yaofei Duan
Yanlan Kang
Songhua Yang
Baoyu Fan
Tao Tan
DeLMO
39
9
0
02 Feb 2024
Hidding the Ghostwriters: An Adversarial Evaluation of AI-Generated Student Essay Detection
Xinlin Peng
Ying Zhou
Xianpei Han
Le Sun
Yingfei Sun
DeLMO
18
11
0
01 Feb 2024
Proactive Detection of Voice Cloning with Localized Watermarking
Robin San Roman
Pierre Fernandez
Alexandre Défossez
Teddy Furon
Tuan Tran
Hady ElSahar
53
41
0
30 Jan 2024
Provably Robust Multi-bit Watermarking for AI-generated Text
Wenjie Qu
Dong Yin
Zixin He
Wei Zou
Tianyang Tao
Jinyuan Jia
Jiaheng Zhang
Jinyuan Jia
Jiaheng Zhang
WaLM
82
2
0
30 Jan 2024
Adaptive Text Watermark for Large Language Models
Yepeng Liu
Yuheng Bu
WaLM
20
18
0
25 Jan 2024
Raidar: geneRative AI Detection viA Rewriting
Chengzhi Mao
Carl Vondrick
Hao Wang
Junfeng Yang
DeLMO
31
23
0
23 Jan 2024
Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text
Abhimanyu Hans
Avi Schwarzschild
Valeriia Cherepanova
Hamid Kazemi
Aniruddha Saha
Micah Goldblum
Jonas Geiping
Tom Goldstein
DeLMO
44
84
0
22 Jan 2024
Detecting Multimedia Generated by Large AI Models: A Survey
Li Lin
Neeraj Gupta
Yue Zhang
Hainan Ren
Chun-Hao Liu
Feng Ding
Xin Wang
Xin Li
Luisa Verdoliva
Shu Hu
88
58
0
22 Jan 2024
Excuse me, sir? Your language model is leaking (information)
Or Zamir
WaLM
25
5
0
18 Jan 2024
Cross-Attention Watermarking of Large Language Models
Folco Bertini Baldassini
H. Nguyen
Ching-Chung Chang
Isao Echizen
WaLM
19
1
0
12 Jan 2024
Risk Taxonomy, Mitigation, and Assessment Benchmarks of Large Language Model Systems
Tianyu Cui
Yanling Wang
Chuanpu Fu
Yong Xiao
Sijia Li
...
Junwu Xiong
Xinyu Kong
Zujie Wen
Ke Xu
Qi Li
60
56
0
11 Jan 2024
Optimizing watermarks for large language models
Bram Wouters
WaLM
26
4
0
28 Dec 2023
Silent Guardian: Protecting Text from Malicious Exploitation by Large Language Models
Jiawei Zhao
Kejiang Chen
Xianjian Yuan
Yuang Qi
Weiming Zhang
Neng H. Yu
64
8
0
15 Dec 2023
Towards Optimal Statistical Watermarking
Baihe Huang
Hanlin Zhu
Banghua Zhu
Kannan Ramchandran
Michael I. Jordan
Jason D. Lee
Jiantao Jiao
WaLM
39
11
0
13 Dec 2023
AI Control: Improving Safety Despite Intentional Subversion
Ryan Greenblatt
Buck Shlegeris
Kshitij Sachan
Fabien Roger
31
40
0
12 Dec 2023
Performance-lossless Black-box Model Watermarking
Na Zhao
Kejiang Chen
Weiming Zhang
Neng H. Yu
44
1
0
11 Dec 2023
On the Learnability of Watermarks for Language Models
Chenchen Gu
Xiang Lisa Li
Percy Liang
Tatsunori Hashimoto
WaLM
69
32
0
07 Dec 2023
New Evaluation Metrics Capture Quality Degradation due to LLM Watermarking
Karanpartap Singh
James Zou
WaLM
110
9
0
04 Dec 2023
Mark My Words: Analyzing and Evaluating Language Model Watermarks
Julien Piet
Chawin Sitawarin
Vivian Fang
Norman Mu
David Wagner
WaLM
37
33
0
01 Dec 2023
Beyond Boundaries: A Comprehensive Survey of Transferable Attacks on AI Systems
Guangjing Wang
Ce Zhou
Yuanda Wang
Bocheng Chen
Hanqing Guo
Qiben Yan
AAML
SILM
68
3
0
20 Nov 2023
AuthentiGPT: Detecting Machine-Generated Text via Black-Box Language Models Denoising
Zhen Guo
Shangdi Yu
DeLMO
34
10
0
13 Nov 2023
Watermarks in the Sand: Impossibility of Strong Watermarking for Generative Models
Hanlin Zhang
Benjamin L. Edelman
Danilo Francati
Daniele Venturi
G. Ateniese
Boaz Barak
WaLM
138
54
0
07 Nov 2023
Contextual Confidence and Generative AI
Shrey Jain
Zoe Hitzig
Pamela Mishkin
38
5
0
02 Nov 2023
Explainable Artificial Intelligence (XAI) 2.0: A Manifesto of Open Challenges and Interdisciplinary Research Directions
Luca Longo
Mario Brcic
Federico Cabitza
Jaesik Choi
Roberto Confalonieri
...
Andrés Páez
Wojciech Samek
Johannes Schneider
Timo Speith
Simone Stumpf
32
192
0
30 Oct 2023
Preventing Language Models From Hiding Their Reasoning
Fabien Roger
Ryan Greenblatt
LRM
26
16
0
27 Oct 2023
Publicly-Detectable Watermarking for Language Models
Jaiden Fairoze
Sanjam Garg
Somesh Jha
Saeed Mahloujifar
Mohammad Mahmoody
Mingyuan Wang
WaLM
139
45
0
27 Oct 2023
Wide Flat Minimum Watermarking for Robust Ownership Verification of GANs
Jianwei Fei
Zhihua Xia
B. Tondi
Mauro Barni
AAML
23
4
0
25 Oct 2023
HANSEN: Human and AI Spoken Text Benchmark for Authorship Analysis
Nafis Irtiza Tripto
Adaku Uchendu
Thai V. Le
Mattia Setzu
F. Giannotti
Dongwon Lee
DeLMO
31
6
0
25 Oct 2023
Towards Possibilities & Impossibilities of AI-generated Text Detection: A Survey
Soumya Suvra Ghosal
Souradip Chakraborty
Jonas Geiping
Furong Huang
Dinesh Manocha
Amrit Singh Bedi
DeLMO
38
33
0
23 Oct 2023
Previous
1
2
3
4
5
6
7
Next