Communities
Connect sessions
AI calendar
Organizations
Join Slack
Contact Sales
Search
Open menu
Home
Papers
2503.08708
Cited By
v1
v2 (latest)
TH-Bench: Evaluating Evading Attacks via Humanizing AI Text on Machine-Generated Text Detectors
10 March 2025
Jingyi Zheng
Junfeng Wang
Zhen Sun
Wenhan Dong
Yule Liu
Xinlei He
AAML
Re-assign community
ArXiv (abs)
PDF
HTML
Github
Papers citing
"TH-Bench: Evaluating Evading Attacks via Humanizing AI Text on Machine-Generated Text Detectors"
28 / 28 papers shown
Can AI-Generated Text be Reliably Detected?
Vinu Sankar Sadasivan
Aounon Kumar
S. Balasubramanian
Wenxiao Wang
Soheil Feizi
DeLMO
1.1K
540
0
20 Jan 2025
On the Generalization and Adaptation Ability of Machine-Generated Text Detectors in Academic Writing
Yule Liu
Zhiyuan Zhong
Yifan Liao
Zhen Sun
Jingyi Zheng
...
Qingyuan Gong
Fenghua Tong
Yang Chen
Yang Zhang
Xinlei He
DeLMO
416
0
0
23 Dec 2024
RAFT: Realistic Attacks to Fool Text Detectors
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2024
James Wang
Ran Li
Junfeng Yang
Chengzhi Mao
AAML
DeLMO
271
11
0
04 Oct 2024
Humanizing Machine-Generated Content: Evading AI-Text Detection through Adversarial Attack
International Conference on Language Resources and Evaluation (LREC), 2024
Ying Zhou
Xianpei Han
Le Sun
DeLMO
AAML
326
32
0
02 Apr 2024
Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks
Yichen Wang
Shangbin Feng
Abe Bohan Hou
Xiao Pu
Chao Shen
Xiaoming Liu
Yulia Tsvetkov
Tianxing He
DeLMO
317
28
0
18 Feb 2024
Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated Text
International Conference on Machine Learning (ICML), 2024
Abhimanyu Hans
Avi Schwarzschild
Valeriia Cherepanova
Hamid Kazemi
Aniruddha Saha
Micah Goldblum
Jonas Geiping
Tom Goldstein
DeLMO
410
242
0
22 Jan 2024
A Survey on Detection of LLMs-Generated Content
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Xianjun Yang
Liangming Pan
Xuandong Zhao
Haifeng Chen
Linda R. Petzold
William Y. Wang
Wei Cheng
DeLMO
397
81
0
24 Oct 2023
A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions
Junchao Wu
Shu Yang
Runzhe Zhan
Yulin Yuan
Yang Li
Lidia S. Chao
DeLMO
558
120
0
23 Oct 2023
An LLM can Fool Itself: A Prompt-Based Adversarial Attack
Xilie Xu
Keyi Kong
Ning Liu
Li-zhen Cui
Haiyan Zhao
Jingfeng Zhang
Mohan Kankanhalli
AAML
SILM
296
142
0
20 Oct 2023
Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability Curvature
International Conference on Learning Representations (ICLR), 2023
Guangsheng Bao
Yanbin Zhao
Zhiyang Teng
Linyi Yang
Yue Zhang
412
292
0
08 Oct 2023
ConDA: Contrastive Domain Adaptation for AI-generated Text Detection
International Joint Conference on Natural Language Processing (IJCNLP), 2023
Amrita Bhattacharjee
Tharindu Kumarage
Raha Moraffah
Huan Liu
DeLMO
388
77
0
07 Sep 2023
RADAR: Robust AI-Text Detection via Adversarial Learning
Neural Information Processing Systems (NeurIPS), 2023
Xiaomeng Hu
Pin-Yu Chen
Tsung-Yi Ho
DeLMO
428
223
0
07 Jul 2023
Sources of Hallucination by Large Language Models on Inference Tasks
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Nick McKenna
Tianyi Li
Liang Cheng
Mohammad Javad Hosseini
Mark Johnson
Mark Steedman
LRM
HILM
401
261
0
23 May 2023
DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated Text
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Jinyan Su
Terry Yue Zhuo
Haiyan Zhao
Preslav Nakov
DeLMO
379
250
0
23 May 2023
Influence of External Information on Large Language Models Mirrors Social Cognitive Patterns
IEEE Transactions on Computational Social Systems (IEEE TCSS), 2023
Ning Bian
Hongyu Lin
Peilin Liu
Yaojie Lu
Chunkang Zhang
Xianpei Han
Xianpei Han
Le Sun
291
30
0
08 May 2023
Should ChatGPT be Biased? Challenges and Risks of Bias in Large Language Models
First Monday (FM), 2023
Emilio Ferrara
SILM
554
362
0
07 Apr 2023
Summary of ChatGPT-Related Research and Perspective Towards the Future of Large Language Models
Yi-Hsien Liu
Tianle Han
Siyuan Ma
Jia-Yu Zhang
Yuanyu Yang
...
Xiang Li
Ning Qiang
Dingang Shen
Tianming Liu
Bao Ge
ALM
ELM
AI4CE
LM&MA
LLMAG
405
717
0
04 Apr 2023
MGTBench: Benchmarking Machine-Generated Text Detection
Conference on Computer and Communications Security (CCS), 2023
Xinlei He
Xinyue Shen
Sihao Lin
Michael Backes
Yang Zhang
DeLMO
389
145
0
26 Mar 2023
Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense
Neural Information Processing Systems (NeurIPS), 2023
Kalpesh Krishna
Yixiao Song
Marzena Karpinska
John Wieting
Mohit Iyyer
DeLMO
436
485
0
23 Mar 2023
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAG
MLLM
5.3K
23,506
0
15 Mar 2023
AI and the FCI: Can ChatGPT Project an Understanding of Introductory Physics?
Colin G. West
177
74
0
02 Mar 2023
DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature
International Conference on Machine Learning (ICML), 2023
E. Mitchell
Yoonho Lee
Alexander Khazatsky
Christopher D. Manning
Chelsea Finn
827
956
0
26 Jan 2023
How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection
Biyang Guo
Xin Zhang
Ziyuan Wang
Minqi Jiang
Jinran Nie
Yuxuan Ding
Jianwei Yue
Yupeng Wu
DeLMO
ELM
381
815
0
18 Jan 2023
Automatic Detection of Generated Text is Easiest when Humans are Fooled
Annual Meeting of the Association for Computational Linguistics (ACL), 2019
Daphne Ippolito
Daniel Duckworth
Chris Callison-Burch
Douglas Eck
DeLMO
883
471
0
02 Nov 2019
Release Strategies and the Social Impacts of Language Models
Irene Solaiman
Miles Brundage
Jack Clark
Amanda Askell
Ariel Herbert-Voss
...
Miles McCain
Alex Newhouse
Jason Blazakis
Kris McGuffie
Jasmine Wang
573
811
0
24 Aug 2019
Is BERT Really Robust? A Strong Baseline for Natural Language Attack on Text Classification and Entailment
AAAI Conference on Artificial Intelligence (AAAI), 2019
Di Jin
Zhijing Jin
Qiufeng Wang
Peter Szolovits
SILM
AAML
1.0K
1,314
0
27 Jul 2019
GLTR: Statistical Detection and Visualization of Generated Text
Annual Meeting of the Association for Computational Linguistics (ACL), 2019
Sebastian Gehrmann
Hendrik Strobelt
Alexander M. Rush
DeLMO
434
759
0
10 Jun 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLM
SSL
SSeg
3.1K
113,499
0
11 Oct 2018
1
Page 1 of 1