PackHero: A Scalable Graph-based Approach for Efficient Packer Identification

31 May 2025

Main:18 Pages

6 Figures

Bibliography:3 Pages

6 Tables

Abstract

Anti-analysis techniques, particularly packing, challenge malware analysts, making packer identification fundamental. Existing packer identifiers have significant limitations: signature-based methods lack flexibility and struggle against dynamic evasion, while Machine Learning approaches require extensive training data, limiting scalability and adaptability. Consequently, achieving accurate and adaptable packer identification remains an open problem. This paper presents PackHero, a scalable and efficient methodology for identifying packers using a novel static approach. PackHero employs a Graph Matching Network and clustering to match and group Call Graphs from programs packed with known packers. We evaluate our approach on a public dataset of malware and benign samples packed with various packers, demonstrating its effectiveness and scalability across varying sample sizes. PackHero achieves a macro-average F1-score of 93.7% with just 10 samples per packer, improving to 98.3% with 100 samples. Notably, PackHero requires fewer samples to achieve stable performance compared to other Machine Learning-based tools. Overall, PackHero matches the performance of State-of-the-art signature-based tools, outperforming them in handling Virtualization-based packers such as Themida/Winlicense, with a recall of 100%.

View on arXiv

@article{gennaro2025_2506.00659,
  title={ PackHero: A Scalable Graph-based Approach for Efficient Packer Identification },
  author={ Marco Di Gennaro and Mario DÓnghia and Mario Polino and Stefano Zanero and Michele Carminati },
  journal={arXiv preprint arXiv:2506.00659},
  year={ 2025 }
}

Comments on this paper