ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2301.10226
  4. Cited By
A Watermark for Large Language Models

A Watermark for Large Language Models

24 January 2023
John Kirchenbauer
Jonas Geiping
Yuxin Wen
Jonathan Katz
Ian Miers
Tom Goldstein
    VLM
    WaLM
ArXivPDFHTML

Papers citing "A Watermark for Large Language Models"

50 / 319 papers shown
Title
AutoDAN: Interpretable Gradient-Based Adversarial Attacks on Large
  Language Models
AutoDAN: Interpretable Gradient-Based Adversarial Attacks on Large Language Models
Sicheng Zhu
Ruiyi Zhang
Bang An
Gang Wu
Joe Barrow
Zichao Wang
Furong Huang
A. Nenkova
Tong Sun
SILM
AAML
30
40
0
23 Oct 2023
Did the Neurons Read your Book? Document-level Membership Inference for
  Large Language Models
Did the Neurons Read your Book? Document-level Membership Inference for Large Language Models
Matthieu Meeus
Shubham Jain
Marek Rei
Yves-Alexandre de Montjoye
MIALM
28
30
0
23 Oct 2023
A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future
  Directions
A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions
Junchao Wu
Shu Yang
Runzhe Zhan
Yulin Yuan
Derek F. Wong
Lidia S. Chao
DeLMO
29
23
0
23 Oct 2023
REMARK-LLM: A Robust and Efficient Watermarking Framework for Generative
  Large Language Models
REMARK-LLM: A Robust and Efficient Watermarking Framework for Generative Large Language Models
Ruisi Zhang
Shehzeen Samarah Hussain
Paarth Neekhara
F. Koushanfar
31
27
0
18 Oct 2023
Watermarking LLMs with Weight Quantization
Watermarking LLMs with Weight Quantization
Linyang Li
Botian Jiang
Pengyu Wang
Ke Ren
Hang Yan
Xipeng Qiu
MQ
WaLM
13
11
0
17 Oct 2023
Data Contamination Through the Lens of Time
Data Contamination Through the Lens of Time
Manley Roberts
Himanshu Thakur
Christine Herlihy
Colin White
Samuel Dooley
84
31
0
16 Oct 2023
Embarrassingly Simple Text Watermarks
Embarrassingly Simple Text Watermarks
Ryoma Sato
Yuki Takezawa
Han Bao
Kenta Niwa
Makoto Yamada
WaLM
26
14
0
13 Oct 2023
SeqXGPT: Sentence-Level AI-Generated Text Detection
SeqXGPT: Sentence-Level AI-Generated Text Detection
Pengyu Wang
Linyang Li
Ke Ren
Botian Jiang
Dong Zhang
Xipeng Qiu
DeLMO
23
50
0
13 Oct 2023
A Resilient and Accessible Distribution-Preserving Watermark for Large
  Language Models
A Resilient and Accessible Distribution-Preserving Watermark for Large Language Models
Yihan Wu
Zhengmian Hu
Junfeng Guo
Hongyang R. Zhang
Heng-Chiao Huang
WaLM
25
21
0
11 Oct 2023
A Semantic Invariant Robust Watermark for Large Language Models
A Semantic Invariant Robust Watermark for Large Language Models
Aiwei Liu
Leyi Pan
Xuming Hu
Shiao Meng
Lijie Wen
WaLM
42
55
0
10 Oct 2023
On the Zero-Shot Generalization of Machine-Generated Text Detectors
On the Zero-Shot Generalization of Machine-Generated Text Detectors
Xiao Pu
Jingyu Zhang
Xiaochuang Han
Yulia Tsvetkov
Tianxing He
DeLMO
36
14
0
08 Oct 2023
Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text
  via Conditional Probability Curvature
Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature
Guangsheng Bao
Yanbin Zhao
Zhiyang Teng
Linyi Yang
Yue Zhang
26
130
0
08 Oct 2023
Zero-Shot Detection of Machine-Generated Codes
Zero-Shot Detection of Machine-Generated Codes
Xianjun Yang
Kexun Zhang
Haifeng Chen
Linda R. Petzold
William Yang Wang
Wei Cheng
DeLMO
26
11
0
08 Oct 2023
How Reliable Are AI-Generated-Text Detectors? An Assessment Framework
  Using Evasive Soft Prompts
How Reliable Are AI-Generated-Text Detectors? An Assessment Framework Using Evasive Soft Prompts
Tharindu Kumarage
Paras Sheth
Raha Moraffah
Joshua Garland
Huan Liu
DeLMO
28
23
0
08 Oct 2023
AI Regulation in Europe: From the AI Act to Future Regulatory Challenges
AI Regulation in Europe: From the AI Act to Future Regulatory Challenges
Philipp Hacker
23
8
0
06 Oct 2023
SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text
  Generation
SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation
Abe Bohan Hou
Jingyu Zhang
Tianxing He
Yichen Wang
Yung-Sung Chuang
Hongwei Wang
Lingfeng Shen
Benjamin Van Durme
Daniel Khashabi
Yulia Tsvetkov
WaLM
34
0
0
06 Oct 2023
Hate Speech Detection in Limited Data Contexts using Synthetic Data
  Generation
Hate Speech Detection in Limited Data Contexts using Synthetic Data Generation
Aman Khullar
Daniel K. Nkemelu
Cuong V. Nguyen
Michael L. Best
37
2
0
04 Oct 2023
On the Generalization of Training-based ChatGPT Detection Methods
On the Generalization of Training-based ChatGPT Detection Methods
Han Xu
Jie Ren
Pengfei He
Shenglai Zeng
Yingqian Cui
Amy Liu
Hui Liu
Jiliang Tang
DeLMO
29
13
0
02 Oct 2023
Mirror Diffusion Models for Constrained and Watermarked Generation
Mirror Diffusion Models for Constrained and Watermarked Generation
Guan-Horng Liu
T. Chen
Evangelos A. Theodorou
Molei Tao
DiffM
18
22
0
02 Oct 2023
Necessary and Sufficient Watermark for Large Language Models
Necessary and Sufficient Watermark for Large Language Models
Yuki Takezawa
Ryoma Sato
Han Bao
Kenta Niwa
Makoto Yamada
WaLM
50
7
0
02 Oct 2023
WASA: WAtermark-based Source Attribution for Large Language
  Model-Generated Data
WASA: WAtermark-based Source Attribution for Large Language Model-Generated Data
Jingtan Wang
Xinyang Lu
Zitong Zhao
Zhongxiang Dai
Chuan-Sheng Foo
See-Kiong Ng
K. H. Low
WaLM
57
14
0
01 Oct 2023
Can LLM-Generated Misinformation Be Detected?
Can LLM-Generated Misinformation Be Detected?
Canyu Chen
Kai Shu
DeLMO
39
158
0
25 Sep 2023
From Text to Source: Results in Detecting Large Language Model-Generated
  Content
From Text to Source: Results in Detecting Large Language Model-Generated Content
Wissam Antoun
Benoît Sagot
Djamé Seddah
DeLMO
33
11
0
23 Sep 2023
TOPFORMER: Topology-Aware Authorship Attribution of Deepfake Texts with
  Diverse Writing Styles
TOPFORMER: Topology-Aware Authorship Attribution of Deepfake Texts with Diverse Writing Styles
Adaku Uchendu
Thai Le
Dongwon Lee
DeLMO
24
3
0
22 Sep 2023
Unbiased Watermark for Large Language Models
Unbiased Watermark for Large Language Models
Zhengmian Hu
Lichang Chen
Xidong Wu
Yihan Wu
Hongyang R. Zhang
Heng-Chiao Huang
WaLM
38
45
0
22 Sep 2023
A Statistical Turing Test for Generative Models
A Statistical Turing Test for Generative Models
Hayden Helm
Carey E. Priebe
Weiwei Yang
DeLMO
24
7
0
16 Sep 2023
ICLEF: In-Context Learning with Expert Feedback for Explainable Style
  Transfer
ICLEF: In-Context Learning with Expert Feedback for Explainable Style Transfer
Arkadiy Saakyan
Smaranda Muresan
23
3
0
15 Sep 2023
Detecting ChatGPT: A Survey of the State of Detecting ChatGPT-Generated
  Text
Detecting ChatGPT: A Survey of the State of Detecting ChatGPT-Generated Text
Mahdi Dhaini
Wessel Poelman
Ege Erdogan
DeLMO
49
12
0
14 Sep 2023
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation
  Strategies towards Equal Long-term Benefit Rate
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate
Yuancheng Xu
Chenghao Deng
Yanchao Sun
Ruijie Zheng
Xiyao Wang
Jieyu Zhao
Furong Huang
35
4
0
07 Sep 2023
J-Guard: Journalism Guided Adversarially Robust Detection of
  AI-generated News
J-Guard: Journalism Guided Adversarially Robust Detection of AI-generated News
Tharindu Kumarage
Amrita Bhattacharjee
Djordje Padejski
Kristy Roschke
Dan Gillmor
Scott W. Ruston
Huan Liu
Joshua Garland
DeLMO
21
9
0
06 Sep 2023
Do You Trust ChatGPT? -- Perceived Credibility of Human and AI-Generated
  Content
Do You Trust ChatGPT? -- Perceived Credibility of Human and AI-Generated Content
Martin Huschens
Martin Briesch
Dominik Sobania
Franz Rothlauf
13
12
0
05 Sep 2023
Studying the impacts of pre-training using ChatGPT-generated text on
  downstream tasks
Studying the impacts of pre-training using ChatGPT-generated text on downstream tasks
Sarthak Anand
21
0
0
02 Sep 2023
Towards Code Watermarking with Dual-Channel Transformations
Towards Code Watermarking with Dual-Channel Transformations
Borui Yang
Wei Li
Liyao Xiang
Bo-wen Li
28
8
0
02 Sep 2023
Identifying and Mitigating the Security Risks of Generative AI
Identifying and Mitigating the Security Risks of Generative AI
Clark W. Barrett
Bradley L Boyd
Ellie Burzstein
Nicholas Carlini
Brad Chen
...
Zulfikar Ramzan
Khawaja Shams
D. Song
Ankur Taly
Diyi Yang
SILM
37
92
0
28 Aug 2023
AI Deception: A Survey of Examples, Risks, and Potential Solutions
AI Deception: A Survey of Examples, Risks, and Potential Solutions
Peter S. Park
Simon Goldstein
Aidan O'Gara
Michael Chen
Dan Hendrycks
30
141
0
28 Aug 2023
Use of LLMs for Illicit Purposes: Threats, Prevention Measures, and
  Vulnerabilities
Use of LLMs for Illicit Purposes: Threats, Prevention Measures, and Vulnerabilities
Maximilian Mozes
Xuanli He
Bennett Kleinberg
Lewis D. Griffin
39
78
0
24 Aug 2023
How to Protect Copyright Data in Optimization of Large Language Models?
How to Protect Copyright Data in Optimization of Large Language Models?
T. Chu
Zhao-quan Song
Chiwun Yang
40
29
0
23 Aug 2023
A Cost Analysis of Generative Language Models and Influence Operations
A Cost Analysis of Generative Language Models and Influence Operations
Micah Musser
32
19
0
07 Aug 2023
PromptCARE: Prompt Copyright Protection by Watermark Injection and
  Verification
PromptCARE: Prompt Copyright Protection by Watermark Injection and Verification
Hongwei Yao
Jian Lou
Kui Ren
Zhan Qin
AAML
VLM
37
25
0
05 Aug 2023
Dual Governance: The intersection of centralized regulation and
  crowdsourced safety mechanisms for Generative AI
Dual Governance: The intersection of centralized regulation and crowdsourced safety mechanisms for Generative AI
Avijit Ghosh
D. Lakshmi
30
3
0
02 Aug 2023
Fighting Fire with Fire: Can ChatGPT Detect AI-generated Text?
Fighting Fire with Fire: Can ChatGPT Detect AI-generated Text?
Amrita Bhattacharjee
Huang Liu
DeLMO
22
56
0
02 Aug 2023
Advancing Beyond Identification: Multi-bit Watermark for Large Language
  Models
Advancing Beyond Identification: Multi-bit Watermark for Large Language Models
Kiyoon Yoo
Wonhyuk Ahn
Nojun Kwak
WaLM
30
17
0
01 Aug 2023
NLLG Quarterly arXiv Report 06/23: What are the most influential current
  AI Papers?
NLLG Quarterly arXiv Report 06/23: What are the most influential current AI Papers?
Steffen Eger
Christoph Leiter
Jonas Belouadi
Ran Zhang
Aida Kostikova
Daniil Larionov
Yanran Chen
Vivian Fresen
AI4CE
29
4
0
31 Jul 2023
SAKSHI: Decentralized AI Platforms
SAKSHI: Decentralized AI Platforms
S. Bhat
Canhui Chen
Zerui Cheng
Zhixuan Fang
Ashwin Hebbar
...
Ranvir Rana
Peiyao Sheng
Himanshu Tyagi
Pramod Viswanath
Xuechao Wang
13
4
0
31 Jul 2023
Anatomy of an AI-powered malicious social botnet
Anatomy of an AI-powered malicious social botnet
Kai-Cheng Yang
Filippo Menczer
DeLMO
43
67
0
30 Jul 2023
An Unforgeable Publicly Verifiable Watermark for Large Language Models
An Unforgeable Publicly Verifiable Watermark for Large Language Models
Aiwei Liu
Leyi Pan
Xuming Hu
Shuang Li
Lijie Wen
Irwin King
Philip S. Yu
WaLM
54
31
0
30 Jul 2023
Towards Codable Watermarking for Injecting Multi-bits Information to
  LLMs
Towards Codable Watermarking for Injecting Multi-bits Information to LLMs
Lean Wang
Wenkai Yang
Deli Chen
Hao Zhou
Yankai Lin
Fandong Meng
Jie Zhou
Xu Sun
WaLM
39
15
0
29 Jul 2023
Three Bricks to Consolidate Watermarks for Large Language Models
Three Bricks to Consolidate Watermarks for Large Language Models
Pierre Fernandez
Antoine Chaffin
Karim Tit
Vivien Chappelier
Teddy Furon
WaLM
19
47
0
26 Jul 2023
OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with
  Adversarially Generated Examples
OUTFOX: LLM-Generated Essay Detection Through In-Context Learning with Adversarially Generated Examples
Ryuto Koike
Masahiro Kaneko
Naoaki Okazaki
DeLMO
40
74
0
21 Jul 2023
Is ChatGPT Involved in Texts? Measure the Polish Ratio to Detect
  ChatGPT-Generated Text
Is ChatGPT Involved in Texts? Measure the Polish Ratio to Detect ChatGPT-Generated Text
Lingyi Yang
Feng Jiang
Haizhou Li
DeLMO
42
23
0
21 Jul 2023
Previous
1234567
Next