malicious-URL-recognition-demo

A demo of malicious URL recognition based on logistic regression

Feature Extraction.py:

Extract features from the raw data. The features include: length of URL, length of primary domain, number of dots, contain IP, average word length, longest word length, the number of special character.

Logistic Regression.py:

Implement logistic regression with python. Using the stochastic gradient ascent method to make the data quickly converge to the actual values.

Logistic Regression-statsmodels.py:

Use statsmodels(https://www.statsmodels.org/) to implement logistic regression.

40.model.cs:

Results after using the logistic regression method and adjusting the parameters, based on Microsoft Azure Machine Learning Studio(https://studio.azureml.net/)

25.model.cs:

Results after using the decision tree, based on Microsoft Azure Machine Learning Studio

ZHAN-MQ / malicious-URL-recognition-demo