Papers
Communities
Events
Blog
Pricing
Search
Open menu
Home
Papers
2311.07138
Cited By
v1
v2 (latest)
WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models
13 November 2023
Shangqing Tu
Yuliang Sun
Yushi Bai
Jifan Yu
Lei Hou
Juanzi Li
WaLM
Re-assign community
ArXiv (abs)
PDF
HTML
Github (26★)
Papers citing
"WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models"
13 / 13 papers shown
Title
Can AI-Generated Text be Reliably Detected?
Vinu Sankar Sadasivan
Aounon Kumar
S. Balasubramanian
Wenxiao Wang
Soheil Feizi
DeLMO
224
390
0
20 Jan 2025
An Entropy-based Text Watermarking Detection Method
Yijian Lu
Aiwei Liu
Dianzhi Yu
Jingjing Li
Irwin King
WaLM
68
38
0
20 Mar 2024
Towards Possibilities & Impossibilities of AI-generated Text Detection: A Survey
Soumya Suvra Ghosal
Souradip Chakraborty
Jonas Geiping
Furong Huang
Dinesh Manocha
Amrit Singh Bedi
DeLMO
76
37
0
23 Oct 2023
Robust Distortion-free Watermarks for Language Models
Rohith Kuditipudi
John Thickstun
Tatsunori Hashimoto
Percy Liang
WaLM
82
182
0
28 Jul 2023
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena
Lianmin Zheng
Wei-Lin Chiang
Ying Sheng
Siyuan Zhuang
Zhanghao Wu
...
Dacheng Li
Eric Xing
Haotong Zhang
Joseph E. Gonzalez
Ion Stoica
ALM
OSLM
ELM
408
4,422
0
09 Jun 2023
AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback
Yann Dubois
Xuechen Li
Rohan Taori
Tianyi Zhang
Ishaan Gulrajani
Jimmy Ba
Carlos Guestrin
Percy Liang
Tatsunori B. Hashimoto
ALM
134
605
0
22 May 2023
DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature
E. Mitchell
Yoonho Lee
Alexander Khazatsky
Christopher D. Manning
Chelsea Finn
93
626
0
26 Jan 2023
Evaluating Large Language Models Trained on Code
Mark Chen
Jerry Tworek
Heewoo Jun
Qiming Yuan
Henrique Pondé
...
Bob McGrew
Dario Amodei
Sam McCandlish
Ilya Sutskever
Wojciech Zaremba
ELM
ALM
236
5,647
0
07 Jul 2021
Evaluating the Factual Consistency of Abstractive Text Summarization
Wojciech Kry'sciñski
Bryan McCann
Caiming Xiong
R. Socher
HILM
115
746
0
28 Oct 2019
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Colin Raffel
Noam M. Shazeer
Adam Roberts
Katherine Lee
Sharan Narang
Michael Matena
Yanqi Zhou
Wei Li
Peter J. Liu
AIMat
462
20,317
0
23 Oct 2019
ELI5: Long Form Question Answering
Angela Fan
Yacine Jernite
Ethan Perez
David Grangier
Jason Weston
Michael Auli
AI4MH
ELM
103
624
0
22 Jul 2019
BERTScore: Evaluating Text Generation with BERT
Tianyi Zhang
Varsha Kishore
Felix Wu
Kilian Q. Weinberger
Yoav Artzi
349
5,860
0
21 Apr 2019
HotpotQA: A Dataset for Diverse, Explainable Multi-hop Question Answering
Zhilin Yang
Peng Qi
Saizheng Zhang
Yoshua Bengio
William W. Cohen
Ruslan Salakhutdinov
Christopher D. Manning
RALM
188
2,694
0
25 Sep 2018
1