ResearchTrend.AI
  • Communities
  • Connect sessions
  • AI calendar
  • Organizations
  • Join Slack
  • Contact Sales
Papers
Communities
Social Events
Terms and Conditions
Pricing
Contact Sales
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2026 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2503.08708
  4. Cited By
TH-Bench: Evaluating Evading Attacks via Humanizing AI Text on Machine-Generated Text Detectors
v1v2 (latest)

TH-Bench: Evaluating Evading Attacks via Humanizing AI Text on Machine-Generated Text Detectors

10 March 2025
Jingyi Zheng
Junfeng Wang
Zhen Sun
Wenhan Dong
Yule Liu
Xinlei He
    AAML
ArXiv (abs)PDFHTMLGithub

Papers citing "TH-Bench: Evaluating Evading Attacks via Humanizing AI Text on Machine-Generated Text Detectors"

28 / 28 papers shown
Can AI-Generated Text be Reliably Detected?
Can AI-Generated Text be Reliably Detected?
Vinu Sankar Sadasivan
Aounon Kumar
S. Balasubramanian
Wenxiao Wang
Soheil Feizi
DeLMO
1.1K
540
0
20 Jan 2025
On the Generalization and Adaptation Ability of Machine-Generated Text Detectors in Academic Writing
On the Generalization and Adaptation Ability of Machine-Generated Text Detectors in Academic Writing
Yule Liu
Zhiyuan Zhong
Yifan Liao
Zhen Sun
Jingyi Zheng
...
Qingyuan Gong
Fenghua Tong
Yang Chen
Yang Zhang
Xinlei He
DeLMO
416
0
0
23 Dec 2024
RAFT: Realistic Attacks to Fool Text Detectors
RAFT: Realistic Attacks to Fool Text DetectorsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024
James Wang
Ran Li
Junfeng Yang
Chengzhi Mao
AAMLDeLMO
271
11
0
04 Oct 2024
Humanizing Machine-Generated Content: Evading AI-Text Detection through
  Adversarial Attack
Humanizing Machine-Generated Content: Evading AI-Text Detection through Adversarial AttackInternational Conference on Language Resources and Evaluation (LREC), 2024
Ying Zhou
Xianpei Han
Le Sun
DeLMOAAML
326
32
0
02 Apr 2024
Stumbling Blocks: Stress Testing the Robustness of Machine-Generated
  Text Detectors Under Attacks
Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks
Yichen Wang
Shangbin Feng
Abe Bohan Hou
Xiao Pu
Chao Shen
Xiaoming Liu
Yulia Tsvetkov
Tianxing He
DeLMO
317
28
0
18 Feb 2024
Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated
  Text
Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated TextInternational Conference on Machine Learning (ICML), 2024
Abhimanyu Hans
Avi Schwarzschild
Valeriia Cherepanova
Hamid Kazemi
Aniruddha Saha
Micah Goldblum
Jonas Geiping
Tom Goldstein
DeLMO
410
242
0
22 Jan 2024
A Survey on Detection of LLMs-Generated Content
A Survey on Detection of LLMs-Generated ContentConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Xianjun Yang
Liangming Pan
Xuandong Zhao
Haifeng Chen
Linda R. Petzold
William Y. Wang
Wei Cheng
DeLMO
397
81
0
24 Oct 2023
A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future
  Directions
A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions
Junchao Wu
Shu Yang
Runzhe Zhan
Yulin Yuan
Yang Li
Lidia S. Chao
DeLMO
558
120
0
23 Oct 2023
An LLM can Fool Itself: A Prompt-Based Adversarial Attack
An LLM can Fool Itself: A Prompt-Based Adversarial Attack
Xilie Xu
Keyi Kong
Ning Liu
Li-zhen Cui
Haiyan Zhao
Jingfeng Zhang
Mohan Kankanhalli
AAMLSILM
296
142
0
20 Oct 2023
Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text
  via Conditional Probability Curvature
Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability CurvatureInternational Conference on Learning Representations (ICLR), 2023
Guangsheng Bao
Yanbin Zhao
Zhiyang Teng
Linyi Yang
Yue Zhang
412
292
0
08 Oct 2023
ConDA: Contrastive Domain Adaptation for AI-generated Text Detection
ConDA: Contrastive Domain Adaptation for AI-generated Text DetectionInternational Joint Conference on Natural Language Processing (IJCNLP), 2023
Amrita Bhattacharjee
Tharindu Kumarage
Raha Moraffah
Huan Liu
DeLMO
388
77
0
07 Sep 2023
RADAR: Robust AI-Text Detection via Adversarial Learning
RADAR: Robust AI-Text Detection via Adversarial LearningNeural Information Processing Systems (NeurIPS), 2023
Xiaomeng Hu
Pin-Yu Chen
Tsung-Yi Ho
DeLMO
428
223
0
07 Jul 2023
Sources of Hallucination by Large Language Models on Inference Tasks
Sources of Hallucination by Large Language Models on Inference TasksConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Nick McKenna
Tianyi Li
Liang Cheng
Mohammad Javad Hosseini
Mark Johnson
Mark Steedman
LRMHILM
401
261
0
23 May 2023
DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of
  Machine-Generated Text
DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated TextConference on Empirical Methods in Natural Language Processing (EMNLP), 2023
Jinyan Su
Terry Yue Zhuo
Haiyan Zhao
Preslav Nakov
DeLMO
379
250
0
23 May 2023
Influence of External Information on Large Language Models Mirrors
  Social Cognitive Patterns
Influence of External Information on Large Language Models Mirrors Social Cognitive PatternsIEEE Transactions on Computational Social Systems (IEEE TCSS), 2023
Ning Bian
Hongyu Lin
Peilin Liu
Yaojie Lu
Chunkang Zhang
Xianpei Han
Xianpei Han
Le Sun
291
30
0
08 May 2023
Should ChatGPT be Biased? Challenges and Risks of Bias in Large Language
  Models
Should ChatGPT be Biased? Challenges and Risks of Bias in Large Language ModelsFirst Monday (FM), 2023
Emilio Ferrara
SILM
554
362
0
07 Apr 2023
Summary of ChatGPT-Related Research and Perspective Towards the Future
  of Large Language Models
Summary of ChatGPT-Related Research and Perspective Towards the Future of Large Language Models
Yi-Hsien Liu
Tianle Han
Siyuan Ma
Jia-Yu Zhang
Yuanyu Yang
...
Xiang Li
Ning Qiang
Dingang Shen
Tianming Liu
Bao Ge
ALMELMAI4CELM&MALLMAG
405
717
0
04 Apr 2023
MGTBench: Benchmarking Machine-Generated Text Detection
MGTBench: Benchmarking Machine-Generated Text DetectionConference on Computer and Communications Security (CCS), 2023
Xinlei He
Xinyue Shen
Sihao Lin
Michael Backes
Yang Zhang
DeLMO
389
145
0
26 Mar 2023
Paraphrasing evades detectors of AI-generated text, but retrieval is an
  effective defense
Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defenseNeural Information Processing Systems (NeurIPS), 2023
Kalpesh Krishna
Yixiao Song
Marzena Karpinska
John Wieting
Mohit Iyyer
DeLMO
436
485
0
23 Mar 2023
GPT-4 Technical Report
GPT-4 Technical Report
OpenAI OpenAI
OpenAI Josh Achiam
Steven Adler
Sandhini Agarwal
Lama Ahmad
...
Shengjia Zhao
Tianhao Zheng
Juntang Zhuang
William Zhuk
Barret Zoph
LLMAGMLLM
5.3K
23,506
0
15 Mar 2023
AI and the FCI: Can ChatGPT Project an Understanding of Introductory
  Physics?
AI and the FCI: Can ChatGPT Project an Understanding of Introductory Physics?
Colin G. West
177
74
0
02 Mar 2023
DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability
  Curvature
DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability CurvatureInternational Conference on Machine Learning (ICML), 2023
E. Mitchell
Yoonho Lee
Alexander Khazatsky
Christopher D. Manning
Chelsea Finn
827
956
0
26 Jan 2023
How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation,
  and Detection
How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection
Biyang Guo
Xin Zhang
Ziyuan Wang
Minqi Jiang
Jinran Nie
Yuxuan Ding
Jianwei Yue
Yupeng Wu
DeLMOELM
381
815
0
18 Jan 2023
Automatic Detection of Generated Text is Easiest when Humans are Fooled
Automatic Detection of Generated Text is Easiest when Humans are FooledAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Daphne Ippolito
Daniel Duckworth
Chris Callison-Burch
Douglas Eck
DeLMO
883
471
0
02 Nov 2019
Release Strategies and the Social Impacts of Language Models
Release Strategies and the Social Impacts of Language Models
Irene Solaiman
Miles Brundage
Jack Clark
Amanda Askell
Ariel Herbert-Voss
...
Miles McCain
Alex Newhouse
Jason Blazakis
Kris McGuffie
Jasmine Wang
573
811
0
24 Aug 2019
Is BERT Really Robust? A Strong Baseline for Natural Language Attack on
  Text Classification and Entailment
Is BERT Really Robust? A Strong Baseline for Natural Language Attack on Text Classification and EntailmentAAAI Conference on Artificial Intelligence (AAAI), 2019
Di Jin
Zhijing Jin
Qiufeng Wang
Peter Szolovits
SILMAAML
1.0K
1,314
0
27 Jul 2019
GLTR: Statistical Detection and Visualization of Generated Text
GLTR: Statistical Detection and Visualization of Generated TextAnnual Meeting of the Association for Computational Linguistics (ACL), 2019
Sebastian Gehrmann
Hendrik Strobelt
Alexander M. Rush
DeLMO
434
759
0
10 Jun 2019
BERT: Pre-training of Deep Bidirectional Transformers for Language
  Understanding
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin
Ming-Wei Chang
Kenton Lee
Kristina Toutanova
VLMSSLSSeg
3.1K
113,499
0
11 Oct 2018
1
Page 1 of 1