PhishDebate: An LLM-Based Multi-Agent Framework for Phishing Website Detection

18 June 2025

Wenhao Li

Selvakumar Manickam

Yung-wey Chong

Shankar Karuppayah

Author Contacts:

wenhaoli@ieee.org selva@usm.my chong@usm.my kshankar@usm.my

LLMAG

ArXiv (abs)PDF HTML

Main:6 Pages

10 Figures

Bibliography:2 Pages

Appendix:2 Pages

Abstract

Phishing websites continue to pose a significant cybersecurity threat, often leveraging deceptive structures, brand impersonation, and social engineering tactics to evade detection. While recent advances in large language models (LLMs) have enabled improved phishing detection through contextual understanding, most existing approaches rely on single-agent classification facing the risks of hallucination and lack interpretability or robustness. To address these limitations, we propose PhishDebate, a modular multi-agent LLM-based debate framework for phishing website detection. PhishDebate employs four specialized agents to independently analyze different textual aspects of a webpage--URL structure, HTML composition, semantic content, and brand impersonation--under the coordination of a Moderator and a final Judge. Through structured debate and divergent thinking, the framework delivers more accurate and interpretable decisions. Extensive evaluations on commercial LLMs demonstrate that PhishDebate achieves 98.2% recall and 98.2% True Positive Rate (TPR) on a real-world phishing dataset, and outperforms single-agent and Chain of Thought (CoT) baselines. Additionally, its modular design allows agent-level configurability, enabling adaptation to varying resource and application requirements.

View on arXiv

@article{li2025_2506.15656,
  title={ PhishDebate: An LLM-Based Multi-Agent Framework for Phishing Website Detection },
  author={ Wenhao Li and Selvakumar Manickam and Yung-wey Chong and Shankar Karuppayah },
  journal={arXiv preprint arXiv:2506.15656},
  year={ 2025 }
}

Comments on this paper