Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2301.10226
Cited By
v1
v2
v3
v4 (latest)
A Watermark for Large Language Models
24 January 2023
John Kirchenbauer
Jonas Geiping
Yuxin Wen
Jonathan Katz
Ian Miers
Tom Goldstein
VLM
WaLM
Re-assign community
ArXiv (abs)
PDF
HTML
Papers citing
"A Watermark for Large Language Models"
50 / 120 papers shown
Title
Bridging Today and the Future of Humanity: AI Safety in 2024 and Beyond
Shanshan Han
173
1
0
09 Oct 2024
Non-Halting Queries: Exploiting Fixed Points in LLMs
Ghaith Hammouri
Kemal Derya
B. Sunar
72
0
0
08 Oct 2024
Image Watermarks are Removable Using Controllable Regeneration from Clean Noise
Yepeng Liu
Yiren Song
Hai Ci
Yu Zhang
Haofan Wang
Mike Zheng Shou
Yuheng Bu
WIGM
118
7
0
07 Oct 2024
AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text
Ximing Lu
Melanie Sclar
Skyler Hallinan
Niloofar Mireshghallah
Jiacheng Liu
...
Allyson Ettinger
Liwei Jiang
Khyathi Chandu
Nouha Dziri
Yejin Choi
DeLMO
91
16
0
05 Oct 2024
Can Watermarked LLMs be Identified by Users via Crafted Prompts?
Aiwei Liu
Sheng Guan
Yang Liu
Leyi Pan
Yifei Zhang
Liancheng Fang
Lijie Wen
Philip S. Yu
Xuming Hu
WaLM
388
5
0
04 Oct 2024
Ward: Provable RAG Dataset Inference via LLM Watermarks
Nikola Jovanović
Robin Staab
Maximilian Baader
Martin Vechev
466
5
0
04 Oct 2024
Efficiently Identifying Watermarked Segments in Mixed-Source Texts
Xuandong Zhao
Chenwen Liao
Yu-Xiang Wang
Lei Li
WaLM
100
1
0
04 Oct 2024
A Watermark for Black-Box Language Models
Dara Bahri
John Wieting
WaLM
151
6
0
02 Oct 2024
Membership Inference Attacks Cannot Prove that a Model Was Trained On Your Data
Jie Zhang
Debeshee Das
Gautam Kamath
Florian Tramèr
MIALM
MIACV
309
27
1
29 Sep 2024
Adaptive and Robust Watermark for Generative Tabular Data
Dung Daniel Ngo
Daniel Scott
Saheed O. Obitayo
Archan Ray
Akshay Seshadri
N. Kumar
Vamsi K. Potluru
Marco Pistoia
Manuela Veloso
AAML
96
1
0
23 Sep 2024
Measuring Human Contribution in AI-Assisted Content Generation
Yueqi Xie
Tao Qi
Jingwei Yi
Ryan Whalen
Junming Huang
Qian Ding
Yu Xie
Xing Xie
Fangzhao Wu
Fangzhao Wu
121
2
0
27 Aug 2024
Watermark Smoothing Attacks against Language Models
Hongyan Chang
Hamed Hassani
Reza Shokri
WaLM
141
3
0
19 Jul 2024
Hey, That's My Model! Introducing Chain & Hash, An LLM Fingerprinting Technique
M. Russinovich
Ahmed Salem
156
13
0
15 Jul 2024
Is Contrasting All You Need? Contrastive Learning for the Detection and Attribution of AI-generated Text
Lucio La Cava
Davide Costa
Andrea Tagarelli
DeLMO
110
5
0
12 Jul 2024
Waterfall: Framework for Robust and Scalable Text Watermarking
Gregory Kang Ruey Lau
Xinyuan Niu
Hieu Dao
Jiangwei Chen
Chuan-Sheng Foo
Bryan Kian Hsiang Low
WaLM
82
6
0
05 Jul 2024
Detecting AI-Generated Text: Factors Influencing Detectability with Current Methods
Kathleen C. Fraser
Hillary Dawkins
S. Kiritchenko
DeLMO
154
13
0
21 Jun 2024
CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models
Yuetai Li
Zhangchen Xu
Fengqing Jiang
Luyao Niu
D. Sahabandu
Bhaskar Ramasubramanian
Radha Poovendran
SILM
AAML
122
10
0
18 Jun 2024
Watermarking Language Models with Error Correcting Codes
Patrick Chao
Yan Sun
Edgar Dobriban
Hamed Hassani
WaLM
176
4
0
12 Jun 2024
Black-Box Detection of Language Model Watermarks
Thibaud Gloaguen
Nikola Jovanović
Robin Staab
Martin Vechev
80
7
0
28 May 2024
Securing the Future of GenAI: Policy and Technology
Mihai Christodorescu
Craven
Soheil Feizi
Neil Zhenqiang Gong
Mia Hoffmann
...
Jessica Newman
Emelia Probasco
Yanjun Qi
Khawaja Shams
Turek
SILM
99
6
0
21 May 2024
MarkLLM: An Open-Source Toolkit for LLM Watermarking
Leyi Pan
Aiwei Liu
Zhiwei He
Zitian Gao
Xuandong Zhao
...
Shuliang Liu
Xuming Hu
Lijie Wen
Irwin King
Philip S. Yu
138
37
0
16 May 2024
Stylometric Watermarks for Large Language Models
Georg Niess
Roman Kern
78
3
0
14 May 2024
Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore
Junchao Wu
Runzhe Zhan
Derek F. Wong
Shu Yang
Xuebo Liu
Lidia S. Chao
Min Zhang
DeLMO
123
5
0
07 May 2024
ModelShield: Adaptive and Robust Watermark against Model Extraction Attack
Kaiyi Pang
Tao Qi
Chuhan Wu
Minhao Bai
Minghu Jiang
Yongfeng Huang
AAML
WaLM
166
5
0
03 May 2024
LLMs for Cyber Security: New Opportunities
D. Divakaran
Sai Teja Peddinti
86
11
0
17 Apr 2024
ProMark: Proactive Diffusion Watermarking for Causal Attribution
Vishal Asnani
John Collomosse
Tu Bui
Xiaoming Liu
S. Agarwal
WIGM
DiffM
148
15
0
14 Mar 2024
Learning to Watermark LLM-generated Text via Reinforcement Learning
Xiaojun Xu
Yuanshun Yao
Yang Liu
94
14
0
13 Mar 2024
WaterMax: breaking the LLM watermark detectability-robustness-quality trade-off
Eva Giboulot
Furon Teddy
WaLM
79
24
0
06 Mar 2024
On the Challenges and Opportunities in Generative AI
Laura Manduchi
Kushagra Pandey
Robert Bamler
Ryan Cotterell
Sina Daubener
...
F. Wenzel
Frank Wood
Stephan Mandt
Vincent Fortuin
Vincent Fortuin
288
22
0
28 Feb 2024
On the Societal Impact of Open Foundation Models
Sayash Kapoor
Rishi Bommasani
Kevin Klyman
Shayne Longpre
Ashwin Ramaswami
...
Victor Storchan
Daniel Zhang
Daniel E. Ho
Percy Liang
Arvind Narayanan
79
60
0
27 Feb 2024
Data-free Weight Compress and Denoise for Large Language Models
Runyu Peng
Yunhua Zhou
Qipeng Guo
Yang Gao
Hang Yan
Xipeng Qiu
Dahua Lin
160
1
0
26 Feb 2024
Large Language Models are Vulnerable to Bait-and-Switch Attacks for Generating Harmful Content
Federico Bianchi
James Zou
74
5
0
21 Feb 2024
Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text
Abhimanyu Hans
Avi Schwarzschild
Valeriia Cherepanova
Hamid Kazemi
Aniruddha Saha
Micah Goldblum
Jonas Geiping
Tom Goldstein
DeLMO
104
107
0
22 Jan 2024
Detecting Multimedia Generated by Large AI Models: A Survey
Li Lin
Neeraj Gupta
Yue Zhang
Hainan Ren
Chun-Hao Liu
Feng Ding
Xin Eric Wang
Xin Li
Luisa Verdoliva
Shu Hu
221
64
0
22 Jan 2024
Cross-Attention Watermarking of Large Language Models
Folco Bertini Baldassini
H. Nguyen
Ching-Chung Chang
Isao Echizen
WaLM
48
2
0
12 Jan 2024
Performance-lossless Black-box Model Watermarking
Na Zhao
Kejiang Chen
Weiming Zhang
Neng H. Yu
90
3
0
11 Dec 2023
Beyond Boundaries: A Comprehensive Survey of Transferable Attacks on AI Systems
Guangjing Wang
Ce Zhou
Yuanda Wang
Bocheng Chen
Hanqing Guo
Qiben Yan
AAML
SILM
137
3
0
20 Nov 2023
Watermarks in the Sand: Impossibility of Strong Watermarking for Generative Models
Hanlin Zhang
Benjamin L. Edelman
Danilo Francati
Daniele Venturi
G. Ateniese
Boaz Barak
WaLM
263
64
0
07 Nov 2023
Explainable Artificial Intelligence (XAI) 2.0: A Manifesto of Open Challenges and Interdisciplinary Research Directions
Luca Longo
Mario Brcic
Federico Cabitza
Jaesik Choi
Roberto Confalonieri
...
Andrés Páez
Wojciech Samek
Johannes Schneider
Timo Speith
Simone Stumpf
152
226
0
30 Oct 2023
Publicly-Detectable Watermarking for Language Models
Jaiden Fairoze
Sanjam Garg
Somesh Jha
Saeed Mahloujifar
Mohammad Mahmoody
Mingyuan Wang
WaLM
206
51
0
27 Oct 2023
Did the Neurons Read your Book? Document-level Membership Inference for Large Language Models
Matthieu Meeus
Shubham Jain
Marek Rei
Yves-Alexandre de Montjoye
MIALM
83
33
0
23 Oct 2023
A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions
Junchao Wu
Shu Yang
Runzhe Zhan
Yulin Yuan
Derek F. Wong
Lidia S. Chao
DeLMO
103
33
0
23 Oct 2023
Necessary and Sufficient Watermark for Large Language Models
Yuki Takezawa
Ryoma Sato
Han Bao
Kenta Niwa
Makoto Yamada
WaLM
135
8
0
02 Oct 2023
From Text to Source: Results in Detecting Large Language Model-Generated Content
Wissam Antoun
Benoît Sagot
Djamé Seddah
DeLMO
85
13
0
23 Sep 2023
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate
Yuancheng Xu
Chenghao Deng
Yanchao Sun
Ruijie Zheng
Xiyao Wang
Jieyu Zhao
Furong Huang
83
4
0
07 Sep 2023
Studying the impacts of pre-training using ChatGPT-generated text on downstream tasks
Sarthak Anand
58
0
0
02 Sep 2023
SAKSHI: Decentralized AI Platforms
S. Bhat
Canhui Chen
Zerui Cheng
Zhixuan Fang
Ashwin Hebbar
...
Ranvir Rana
Peiyao Sheng
Himanshu Tyagi
Pramod Viswanath
Xuechao Wang
30
4
0
31 Jul 2023
On the application of Large Language Models for language teaching and assessment technology
Andrew Caines
Luca Benedetto
Shiva Taslimipoor
Christopher Davis
Yuan Gao
...
Marek Rei
H. Yannakoudakis
Andrew Mullooly
D. Nicholls
P. Buttery
ELM
70
48
0
17 Jul 2023
SentimentGPT: Exploiting GPT for Advanced Sentiment Analysis and its Departure from Current Machine Learning
Kiana Kheiri
Hamid Karimi
100
78
0
16 Jul 2023
ChatGPT in the Age of Generative AI and Large Language Models: A Concise Survey
S. Mohamadi
Ghulam Mujtaba
Ngan Le
Gianfranco Doretto
Don Adjeroh
LM&MA
AI4MH
113
21
0
09 Jul 2023
Previous
1
2
3
Next