Decoding the Secrets of Machine Learning in Windows Malware Classification: A Deep Dive into Datasets, Features, and Model Performance

This repository hosts the feature importance scores of the following experiments:

Binary detection

Static

All samples
Packed only
No packed

Dynamic

All samples

Family classification

Static

All samples
Packed only
No packed

Dynamic

All samples

Citation

If you use any of the contents, please cite it as:

@misc{dambra2023decoding,
      title={Decoding the Secrets of Machine Learning in Malware Classification: A Deep Dive into Datasets, Feature Extraction, and Model Performance}, 
      author={Savino Dambra and Yufei Han and Simone Aonzo and Platon Kotzias and Antonino Vitale and Juan Caballero and Davide Balzarotti and Leyla Bilge},
      year={2023},
      eprint={2307.14657},
      archivePrefix={arXiv},
      primaryClass={cs.CR}
}

eurecom-s3 / DecodingMLSecretsOfWindowsMalwareClassification

Decoding the Secrets of Machine Learning in Windows Malware Classification: A Deep Dive into Datasets, Features, and Model Performance

Citation

About