PhishBENCH: A benchmark dataset for legal tasks related to phishing attacks

TitlePhishBENCH: A benchmark dataset for legal tasks related to phishing attacks
Publication TypeConference Proceedings
Year of Conference2025
AuthorsKatoikos, I, Kosmopoulos, D, Iglezakis, I, Kiortsi, P, Thomopoulos, G, Fidas, C
Conference NameInternational Conference on Artificial Intelligence Tools
Keywordsbenchmark, large language model, phishing law, retrieval augmented generation
Abstract

Phishing attacks constitute a significant cyber-threat, causing financial losses and undermining digital trust. Despite advancements in awareness campaigns and regulatory efforts, many individuals and organizations struggle to understand their legal rights and obligations when victimized by phishing. This paper presents a benchmark framework designed to support researchers in the analysis of legal texts and cases, with the goal of helping users navigate the complex legal landscape surrounding phishing attacks. We evaluate BERT-based models, large language models, and retrieval-augmented generation using a curated knowledge base of phishing laws, regulations, and common scenarios. We provide baseline evaluations across multiple task categories, including true-false questions, phishing attack cases, penal code analysis, legal cases with judicial decisions, and question-answering tasks, all within the Greek legal jurisdiction.