ResearchTrend.AI
  • Papers
  • Communities
  • Events
  • Blog
  • Pricing
Papers
Communities
Social Events
Terms and Conditions
Pricing
Parameter LabParameter LabTwitterGitHubLinkedInBlueskyYoutube

© 2025 ResearchTrend.AI, All rights reserved.

  1. Home
  2. Papers
  3. 2506.06180
40
0

Detecting Voice Phishing with Precision: Fine-Tuning Small Language Models

6 June 2025
Ju Yong Sim
Seong Hwan Kim
ArXiv (abs)PDFHTML
Main:11 Pages
5 Figures
Bibliography:1 Pages
12 Tables
Appendix:3 Pages
Abstract

We develop a voice phishing (VP) detector by fine-tuning Llama3, a representative open-source, small language model (LM). In the prompt, we provide carefully-designed VP evaluation criteria and apply the Chain-of-Thought (CoT) technique. To evaluate the robustness of LMs and highlight differences in their performance, we construct an adversarial test dataset that places the models under challenging conditions. Moreover, to address the lack of VP transcripts, we create transcripts by referencing existing or new types of VP techniques. We compare cases where evaluation criteria are included, the CoT technique is applied, or both are used together. In the experiment, our results show that the Llama3-8B model, fine-tuned with a dataset that includes a prompt with VP evaluation criteria, yields the best performance among small LMs and is comparable to that of a GPT-4-based VP detector. These findings indicate that incorporating human expert knowledge into the prompt is more effective than using the CoT technique for small LMs in VP detection.

View on arXiv
@article{sim2025_2506.06180,
  title={ Detecting Voice Phishing with Precision: Fine-Tuning Small Language Models },
  author={ Ju Yong Sim and Seong Hwan Kim },
  journal={arXiv preprint arXiv:2506.06180},
  year={ 2025 }
}
Comments on this paper