Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2301.10226
Cited By
A Watermark for Large Language Models
24 January 2023
John Kirchenbauer
Jonas Geiping
Yuxin Wen
Jonathan Katz
Ian Miers
Tom Goldstein
VLM
WaLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Watermark for Large Language Models"
19 / 319 papers shown
Title
Language Model Behavior: A Comprehensive Survey
Tyler A. Chang
Benjamin Bergen
VLM
LRM
LM&MA
27
103
0
20 Mar 2023
A Recipe for Watermarking Diffusion Models
Yunqing Zhao
Tianyu Pang
Chao Du
Xiao Yang
Ngai-man Cheung
Min-Bin Lin
WIGM
30
115
0
17 Mar 2023
A Pathway Towards Responsible AI Generated Content
Chen Chen
Jie Fu
Lingjuan Lyu
49
71
0
02 Mar 2023
On pitfalls (and advantages) of sophisticated large language models
A. Strasser
27
14
0
25 Feb 2023
How Generative AI models such as ChatGPT can be (Mis)Used in SPC Practice, Education, and Research? An Exploratory Study
F. Megahed
Ying-Ju Chen
Joshua A. Ferris
S. Knoth
L. A. Jones‐Farmer
47
117
0
17 Feb 2023
Auditing large language models: a three-layered approach
Jakob Mokander
Jonas Schuett
Hannah Rose Kirk
Luciano Floridi
AILaw
MLAU
48
196
0
16 Feb 2023
Raising the Cost of Malicious AI-Powered Image Editing
Hadi Salman
Alaa Khaddaj
Guillaume Leclerc
Andrew Ilyas
A. Madry
DiffM
28
109
0
13 Feb 2023
A Categorical Archive of ChatGPT Failures
Ali Borji
ELM
35
379
0
06 Feb 2023
The Gradient of Generative AI Release: Methods and Considerations
Irene Solaiman
33
98
0
05 Feb 2023
Regulating ChatGPT and other Large Generative AI Models
P. Hacker
A. Engel
M. Mauer
AILaw
29
328
0
05 Feb 2023
The Science of Detecting LLM-Generated Texts
Ruixiang Tang
Yu-Neng Chuang
Xia Hu
DeLMO
42
169
0
04 Feb 2023
Red teaming ChatGPT via Jailbreaking: Bias, Robustness, Reliability and Toxicity
Terry Yue Zhuo
Yujin Huang
Chunyang Chen
Zhenchang Xing
SILM
36
102
0
30 Jan 2023
DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature
E. Mitchell
Yoonho Lee
Alexander Khazatsky
Christopher D. Manning
Chelsea Finn
29
582
0
26 Jan 2023
Attribution and Obfuscation of Neural Text Authorship: A Data Mining Perspective
Adaku Uchendu
Thai Le
Dongwon Lee
DeLMO
32
41
0
19 Oct 2022
CATER: Intellectual Property Protection on Text Generation APIs via Conditional Watermarks
Xuanli He
Qiongkai Xu
Yi Zeng
Lingjuan Lyu
Fangzhao Wu
Jiwei Li
R. Jia
WaLM
188
72
0
19 Sep 2022
Data Feedback Loops: Model-driven Amplification of Dataset Biases
Rohan Taori
Tatsunori B. Hashimoto
74
43
0
08 Sep 2022
Training language models to follow instructions with human feedback
Long Ouyang
Jeff Wu
Xu Jiang
Diogo Almeida
Carroll L. Wainwright
...
Amanda Askell
Peter Welinder
Paul Christiano
Jan Leike
Ryan J. Lowe
OSLM
ALM
339
12,003
0
04 Mar 2022
Protecting Intellectual Property of Language Generation APIs with Lexical Watermark
Xuanli He
Qiongkai Xu
Lingjuan Lyu
Fangzhao Wu
Chenguang Wang
WaLM
177
95
0
05 Dec 2021
Deep Serial Number: Computational Watermarking for DNN Intellectual Property Protection
Ruixiang Tang
Mengnan Du
Xia Hu
38
3
0
17 Nov 2020
Previous
1
2
3
4
5
6
7