Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2301.10226
Cited By
A Watermark for Large Language Models
24 January 2023
John Kirchenbauer
Jonas Geiping
Yuxin Wen
Jonathan Katz
Ian Miers
Tom Goldstein
VLM
WaLM
Re-assign community
ArXiv
PDF
HTML
Papers citing
"A Watermark for Large Language Models"
50 / 319 papers shown
Title
AutoDAN: Interpretable Gradient-Based Adversarial Attacks on Large Language Models
Sicheng Zhu
Ruiyi Zhang
Bang An
Gang Wu
Joe Barrow
Zichao Wang
Furong Huang
A. Nenkova
Tong Sun
SILM
AAML
30
40
0
23 Oct 2023
Did the Neurons Read your Book? Document-level Membership Inference for Large Language Models
Matthieu Meeus
Shubham Jain
Marek Rei
Yves-Alexandre de Montjoye
MIALM
28
30
0
23 Oct 2023
A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions
Junchao Wu
Shu Yang
Runzhe Zhan
Yulin Yuan
Derek F. Wong
Lidia S. Chao
DeLMO
29
23
0
23 Oct 2023
REMARK-LLM: A Robust and Efficient Watermarking Framework for Generative Large Language Models
Ruisi Zhang
Shehzeen Samarah Hussain
Paarth Neekhara
F. Koushanfar
31
27
0
18 Oct 2023
Watermarking LLMs with Weight Quantization
Linyang Li
Botian Jiang
Pengyu Wang
Ke Ren
Hang Yan
Xipeng Qiu
MQ
WaLM
13
11
0
17 Oct 2023
Data Contamination Through the Lens of Time
Manley Roberts
Himanshu Thakur
Christine Herlihy
Colin White
Samuel Dooley
84
31
0
16 Oct 2023
Embarrassingly Simple Text Watermarks
Ryoma Sato
Yuki Takezawa
Han Bao
Kenta Niwa
Makoto Yamada
WaLM
26
14
0
13 Oct 2023
SeqXGPT: Sentence-Level AI-Generated Text Detection
Pengyu Wang
Linyang Li
Ke Ren
Botian Jiang
Dong Zhang
Xipeng Qiu
DeLMO
23
50
0
13 Oct 2023
A Resilient and Accessible Distribution-Preserving Watermark for Large Language Models
Yihan Wu
Zhengmian Hu
Junfeng Guo
Hongyang R. Zhang
Heng-Chiao Huang
WaLM
25
21
0
11 Oct 2023
A Semantic Invariant Robust Watermark for Large Language Models
Aiwei Liu
Leyi Pan
Xuming Hu
Shiao Meng
Lijie Wen
WaLM
42
55
0
10 Oct 2023
On the Zero-Shot Generalization of Machine-Generated Text Detectors
Xiao Pu
Jingyu Zhang
Xiaochuang Han
Yulia Tsvetkov
Tianxing He
DeLMO
36
14
0
08 Oct 2023
Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature
Guangsheng Bao
Yanbin Zhao
Zhiyang Teng
Linyi Yang
Yue Zhang
26
130
0
08 Oct 2023
Zero-Shot Detection of Machine-Generated Codes
Xianjun Yang
Kexun Zhang
Haifeng Chen
Linda R. Petzold
William Yang Wang
Wei Cheng
DeLMO
26
11
0
08 Oct 2023
How Reliable Are AI-Generated-Text Detectors? An Assessment Framework Using Evasive Soft Prompts
Tharindu Kumarage
Paras Sheth
Raha Moraffah
Joshua Garland
Huan Liu
DeLMO
28
23
0
08 Oct 2023
AI Regulation in Europe: From the AI Act to Future Regulatory Challenges
Philipp Hacker
23
8
0
06 Oct 2023
SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation
Abe Bohan Hou
Jingyu Zhang
Tianxing He
Yichen Wang
Yung-Sung Chuang
Hongwei Wang
Lingfeng Shen
Benjamin Van Durme
Daniel Khashabi
Yulia Tsvetkov
WaLM
34
0
0
06 Oct 2023
Hate Speech Detection in Limited Data Contexts using Synthetic Data Generation
Aman Khullar
Daniel K. Nkemelu
Cuong V. Nguyen
Michael L. Best
37
2
0
04 Oct 2023
On the Generalization of Training-based ChatGPT Detection Methods
Han Xu
Jie Ren
Pengfei He
Shenglai Zeng
Yingqian Cui
Amy Liu
Hui Liu
Jiliang Tang
DeLMO
29
13
0
02 Oct 2023
Mirror Diffusion Models for Constrained and Watermarked Generation
Guan-Horng Liu
T. Chen
Evangelos A. Theodorou
Molei Tao
DiffM
18
22
0
02 Oct 2023
Necessary and Sufficient Watermark for Large Language Models
Yuki Takezawa
Ryoma Sato
Han Bao
Kenta Niwa
Makoto Yamada
WaLM
50
7
0
02 Oct 2023
WASA: WAtermark-based Source Attribution for Large Language Model-Generated Data
Jingtan Wang
Xinyang Lu
Zitong Zhao
Zhongxiang Dai
Chuan-Sheng Foo
See-Kiong Ng
K. H. Low
WaLM
57
14
0
01 Oct 2023
Can LLM-Generated Misinformation Be Detected?
Canyu Chen
Kai Shu
DeLMO
39
158
0
25 Sep 2023
From Text to Source: Results in Detecting Large Language Model-Generated Content
Wissam Antoun
Benoît Sagot
Djamé Seddah
DeLMO
33
11
0
23 Sep 2023
TOPFORMER: Topology-Aware Authorship Attribution of Deepfake Texts with Diverse Writing Styles
Adaku Uchendu
Thai Le
Dongwon Lee
DeLMO
24
3
0
22 Sep 2023
Unbiased Watermark for Large Language Models
Zhengmian Hu
Lichang Chen
Xidong Wu
Yihan Wu
Hongyang R. Zhang
Heng-Chiao Huang
WaLM
38
45
0
22 Sep 2023
A Statistical Turing Test for Generative Models
Hayden Helm
Carey E. Priebe
Weiwei Yang
DeLMO
24
7
0
16 Sep 2023
ICLEF: In-Context Learning with Expert Feedback for Explainable Style Transfer
Arkadiy Saakyan
Smaranda Muresan
23
3
0
15 Sep 2023
Detecting ChatGPT: A Survey of the State of Detecting ChatGPT-Generated Text
Mahdi Dhaini
Wessel Poelman
Ege Erdogan
DeLMO
49
12
0
14 Sep 2023
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate
Yuancheng Xu
Chenghao Deng
Yanchao Sun
Ruijie Zheng
Xiyao Wang
Jieyu Zhao
Furong Huang
35
4
0
07 Sep 2023
J-Guard: Journalism Guided Adversarially Robust Detection of AI-generated News
Tharindu Kumarage
Amrita Bhattacharjee
Djordje Padejski
Kristy Roschke
Dan Gillmor
Scott W. Ruston
Huan Liu
Joshua Garland
DeLMO
21
9
0
06 Sep 2023
Do You Trust ChatGPT? -- Perceived Credibility of Human and AI-Generated Content
Martin Huschens
Martin Briesch
Dominik Sobania
Franz Rothlauf
13
12
0
05 Sep 2023
Studying the impacts of pre-training using ChatGPT-generated text on downstream tasks
Sarthak Anand
21
0
0
02 Sep 2023
Towards Code Watermarking with Dual-Channel Transformations
Borui Yang
Wei Li
Liyao Xiang
Bo-wen Li
28
8
0
02 Sep 2023
Identifying and Mitigating the Security Risks of Generative AI
Clark W. Barrett
Bradley L Boyd
Ellie Burzstein
Nicholas Carlini
Brad Chen
...
Zulfikar Ramzan
Khawaja Shams
D. Song
Ankur Taly
Diyi Yang
SILM
37
92
0
28 Aug 2023
AI Deception: A Survey of Examples, Risks, and Potential Solutions
Peter S. Park
Simon Goldstein
Aidan O'Gara
Michael Chen
Dan Hendrycks
30
141
0
28 Aug 2023
Use of LLMs for Illicit Purposes: Threats, Prevention Measures, and Vulnerabilities
Maximilian Mozes
Xuanli He
Bennett Kleinberg
Lewis D. Griffin
39
78
0
24 Aug 2023
How to Protect Copyright Data in Optimization of Large Language Models?
T. Chu
Zhao-quan Song
Chiwun Yang
40
29
0
23 Aug 2023
A Cost Analysis of Generative Language Models and Influence Operations
Micah Musser
32
19
0
07 Aug 2023
PromptCARE: Prompt Copyright Protection by Watermark Injection and Verification
Hongwei Yao
Jian Lou
Kui Ren
Zhan Qin
AAML
VLM
37
25
0
05 Aug 2023
Dual Governance: The intersection of centralized regulation and crowdsourced safety mechanisms for Generative AI
Avijit Ghosh
D. Lakshmi
30
3
0
02 Aug 2023
Fighting Fire with Fire: Can ChatGPT Detect AI-generated Text?
Amrita Bhattacharjee
Huang Liu
DeLMO
22
56
0
02 Aug 2023
Advancing Beyond Identification: Multi-bit Watermark for Large Language Models
Kiyoon Yoo
Wonhyuk Ahn
Nojun Kwak
WaLM
30
17
0
01 Aug 2023
NLLG Quarterly arXiv Report 06/23: What are the most influential current AI Papers?
Steffen Eger
Christoph Leiter
Jonas Belouadi
Ran Zhang
Aida Kostikova
Daniil Larionov
Yanran Chen
Vivian Fresen
AI4CE
29
4
0
31 Jul 2023
SAKSHI: Decentralized AI Platforms
S. Bhat
Canhui Chen
Zerui Cheng
Zhixuan Fang
Ashwin Hebbar
...
Ranvir Rana
Peiyao Sheng
Himanshu Tyagi
Pramod Viswanath
Xuechao Wang
13
4
0
31 Jul 2023
Anatomy of an AI-powered malicious social botnet
Kai-Cheng Yang
Filippo Menczer
DeLMO
43
67
0
30 Jul 2023
An Unforgeable Publicly Verifiable Watermark for Large Language Models
Aiwei Liu
Leyi Pan
Xuming Hu
Shuang Li
Lijie Wen
Irwin King
Philip S. Yu
WaLM
54
31
0
30 Jul 2023
Towards Codable Watermarking for Injecting Multi-bits Information to LLMs
Lean Wang
Wenkai Yang
Deli Chen
Hao Zhou
Yankai Lin
Fandong Meng
Jie Zhou
Xu Sun
WaLM
39
15
0
29 Jul 2023
Three Bricks to Consolidate Watermarks for Large Language Models
Pierre Fernandez
Antoine Chaffin
Karim Tit
Vivien Chappelier
Teddy Furon
WaLM
19
47
0
26 Jul 2023
OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with Adversarially Generated Examples
Ryuto Koike
Masahiro Kaneko
Naoaki Okazaki
DeLMO
40
74
0
21 Jul 2023
Is ChatGPT Involved in Texts? Measure the Polish Ratio to Detect ChatGPT-Generated Text
Lingyi Yang
Feng Jiang
Haizhou Li
DeLMO
42
23
0
21 Jul 2023
Previous
1
2
3
4
5
6
7
Next