7
0

PhishDebate: An LLM-Based Multi-Agent Framework for Phishing Website Detection

Wenhao Li
Selvakumar Manickam
Yung-wey Chong
Shankar Karuppayah
Main:6 Pages
10 Figures
Bibliography:2 Pages
Appendix:2 Pages
Abstract

Phishing websites continue to pose a significant cybersecurity threat, often leveraging deceptive structures, brand impersonation, and social engineering tactics to evade detection. While recent advances in large language models (LLMs) have enabled improved phishing detection through contextual understanding, most existing approaches rely on single-agent classification facing the risks of hallucination and lack interpretability or robustness. To address these limitations, we propose PhishDebate, a modular multi-agent LLM-based debate framework for phishing website detection. PhishDebate employs four specialized agents to independently analyze different textual aspects of a webpage--URL structure, HTML composition, semantic content, and brand impersonation--under the coordination of a Moderator and a final Judge. Through structured debate and divergent thinking, the framework delivers more accurate and interpretable decisions. Extensive evaluations on commercial LLMs demonstrate that PhishDebate achieves 98.2% recall and 98.2% True Positive Rate (TPR) on a real-world phishing dataset, and outperforms single-agent and Chain of Thought (CoT) baselines. Additionally, its modular design allows agent-level configurability, enabling adaptation to varying resource and application requirements.

View on arXiv
@article{li2025_2506.15656,
  title={ PhishDebate: An LLM-Based Multi-Agent Framework for Phishing Website Detection },
  author={ Wenhao Li and Selvakumar Manickam and Yung-wey Chong and Shankar Karuppayah },
  journal={arXiv preprint arXiv:2506.15656},
  year={ 2025 }
}
Comments on this paper