Leveraging Adversarial Samples for Enhanced Classification of Malicious and Evasive PDF Files

dc.contributor.authorTrad, Fouad
dc.contributor.authorHussein, Ali
dc.contributor.authorChehab, Ali
dc.contributor.departmentDepartment of Electrical and Computer Engineering
dc.contributor.facultyMaroun Semaan Faculty of Engineering and Architecture (MSFEA)
dc.contributor.institutionAmerican University of Beirut
dc.date.accessioned2025-01-24T11:31:08Z
dc.date.available2025-01-24T11:31:08Z
dc.date.issued2023
dc.description.abstractThe Portable Document Format (PDF) is considered one of the most popular formats due to its flexibility and portability across platforms. Although people have used machine learning techniques to detect malware in PDF files, the problem with these models is their weak resistance against evasion attacks, which constitutes a major security threat. The goal of this study is to introduce three machine learning-based systems that enhance malware detection in the presence of evasion attacks by substantially relying on evasive data to train malware and evasion detection models. To evaluate the robustness of the proposed systems, we used two testing datasets, a real dataset containing around 100,000 PDF samples and an evasive dataset containing 500,000 samples that we generated. We compared the results of the proposed systems to a baseline model that was not adversarially trained. When tested against the evasive dataset, the proposed systems provided an increase of around 80% in the f1-score compared to the baseline. This proves the value of the proposed approaches towards the ability to deal with evasive attacks. © 2023 by the authors.
dc.identifier.doihttps://doi.org/10.3390/app13063472
dc.identifier.eid2-s2.0-85152038546
dc.identifier.urihttp://hdl.handle.net/10938/27532
dc.language.isoen
dc.publisherMDPI
dc.relation.ispartofApplied Sciences (Switzerland)
dc.sourceScopus
dc.subjectAdversarial training
dc.subjectEvasion attacks
dc.subjectEvasion detection
dc.subjectEvasive data generation
dc.subjectMachine learning
dc.subjectMalicious documents
dc.subjectModel robustness
dc.subjectPdf malware detection
dc.titleLeveraging Adversarial Samples for Enhanced Classification of Malicious and Evasive PDF Files
dc.typeArticle

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
2023-810.pdf
Size:
1.16 MB
Format:
Adobe Portable Document Format