v1v2 (latest)

TH-Bench: Evaluating Evading Attacks via Humanizing AI Text on Machine-Generated Text Detectors

10 March 2025

ArXiv (abs)PDF HTML Github

Papers citing "TH-Bench: Evaluating Evading Attacks via Humanizing AI Text on Machine-Generated Text Detectors"

28 / 28 papers shown

Can AI-Generated Text be Reliably Detected?

Vinu Sankar Sadasivan

1.1K

540

20 Jan 2025

On the Generalization and Adaptation Ability of Machine-Generated Text Detectors in Academic Writing

...

416

23 Dec 2024

RAFT: Realistic Attacks to Fool Text DetectorsConference on Empirical Methods in Natural Language Processing (EMNLP), 2024

271

04 Oct 2024

Humanizing Machine-Generated Content: Evading AI-Text Detection through Adversarial AttackInternational Conference on Language Resources and Evaluation (LREC), 2024

326

02 Apr 2024

Stumbling Blocks: Stress Testing the Robustness of Machine-Generated Text Detectors Under Attacks

Xiaoming Liu

Tianxing He

317

18 Feb 2024

Spotting LLMs With Binoculars: Zero-Shot Detection of Machine-Generated TextInternational Conference on Machine Learning (ICML), 2024

410

242

22 Jan 2024

A Survey on Detection of LLMs-Generated ContentConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

397

24 Oct 2023

A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future Directions

558

120

23 Oct 2023

An LLM can Fool Itself: A Prompt-Based Adversarial Attack

Ning Liu

296

142

20 Oct 2023

Fast-DetectGPT: Efficient Zero-Shot Detection of Machine-Generated Text via Conditional Probability CurvatureInternational Conference on Learning Representations (ICLR), 2023

Yue Zhang

412

292

08 Oct 2023

ConDA: Contrastive Domain Adaptation for AI-generated Text DetectionInternational Joint Conference on Natural Language Processing (IJCNLP), 2023

Huan Liu

388

07 Sep 2023

RADAR: Robust AI-Text Detection via Adversarial LearningNeural Information Processing Systems (NeurIPS), 2023

Xiaomeng Hu

Pin-Yu Chen

Tsung-Yi Ho

DeLMO

428

223

07 Jul 2023

Sources of Hallucination by Large Language Models on Inference TasksConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

Nick McKenna

Tianyi Li

Liang Cheng

Mohammad Javad Hosseini

Mark Johnson

Mark Steedman

LRM HILM

401

261

23 May 2023

DetectLLM: Leveraging Log Rank Information for Zero-Shot Detection of Machine-Generated TextConference on Empirical Methods in Natural Language Processing (EMNLP), 2023

379

250

23 May 2023

Influence of External Information on Large Language Models Mirrors Social Cognitive PatternsIEEE Transactions on Computational Social Systems (IEEE TCSS), 2023

Yaojie Lu

291

08 May 2023

Should ChatGPT be Biased? Challenges and Risks of Bias in Large Language ModelsFirst Monday (FM), 2023

Emilio Ferrara

SILM

554

362

07 Apr 2023

Summary of ChatGPT-Related Research and Perspective Towards the Future of Large Language Models

...

Xiang Li

Ning Qiang

Dingang Shen

Tianming Liu

Bao Ge

ALM ELM AI4CE LM&MA LLMAG

405

717

04 Apr 2023

MGTBench: Benchmarking Machine-Generated Text DetectionConference on Computer and Communications Security (CCS), 2023

Michael Backes

389

145

26 Mar 2023

Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defenseNeural Information Processing Systems (NeurIPS), 2023

436

485

23 Mar 2023

GPT-4 Technical Report

...

5.3K

23,506

15 Mar 2023

AI and the FCI: Can ChatGPT Project an Understanding of Introductory Physics?

Colin G. West

177

02 Mar 2023

DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability CurvatureInternational Conference on Machine Learning (ICML), 2023

E. Mitchell

Yoonho Lee

Alexander Khazatsky

Christopher D. Manning

Chelsea Finn

827

956

26 Jan 2023

How Close is ChatGPT to Human Experts? Comparison Corpus, Evaluation, and Detection

381

815

18 Jan 2023

Automatic Detection of Generated Text is Easiest when Humans are FooledAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

883

471

02 Nov 2019

Release Strategies and the Social Impacts of Language Models

...

573

811

24 Aug 2019

Is BERT Really Robust? A Strong Baseline for Natural Language Attack on Text Classification and EntailmentAAAI Conference on Artificial Intelligence (AAAI), 2019

1.0K

1,314

27 Jul 2019

GLTR: Statistical Detection and Visualization of Generated TextAnnual Meeting of the Association for Computational Linguistics (ACL), 2019

434

759

10 Jun 2019

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

3.1K

113,499

11 Oct 2018