spam-random-forest
Spam Detection using Random Forest Machine Learning Algorithm
Before you start
- Install Python (google it for your OS... I'm using Linux Mint)
- Install Miniconda, it's lighter version of Anaconda, a library used for machine learning in python. Anaconda consists of packages like numpy, scipy, pandas and many others. But in miniconda we have to install packages which we need (installing packages in miniconda is simple af, I have described it below)
- Head over to package list and check out the name of package you want to install.
- Type
conda install ****
to install the package **** - For example if you want to install numpy, scipy and sklearn then type in your command line
$ conda install numpy
$ conda install scipy
$ conda install scikit-learn
one by one and you are good to go :)
Run this code using:-
$ python spamRF.py
english.txt and english_big.txt are spam dataset. Using these two datasets and spam.py, I've made a new dataset spam.csv which consists of 9 columns.
spamRF.py is used to apply machine learning on spam.csv.