Terminou / Obfuscated-Program-Dataset

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Obfuscated Program Dataset

Why do we need a dataset with obfuscated and not obfuscated programs?

Software obfuscation is used to transform code and make it more challenging to reverse engineer. Since an obfuscated program does not lose its utility, malicious codes can be obfuscated in order to prevent the detection by the attacker’s target. Detecting obfuscated programs is crucial for some applications. With the help of this dataset, learning models can be created.

How can I contribute to dataset?

No matter what programming language (Python, Java, C, Javascript, C#...) you use and no matter what obfuscation tool/technique (Obfuscar, DashO, Dotfuscator etc.) you use. Just pull request your obfuscated and not obfuscated programs into repository. Remember, diversity is always better.

Current Dataset

Obfuscated Not Obfuscated
Number of programs 45 35

About


Languages

Language:Python 100.0%