This is a group project for university assingment where we create discord bot to detect spam
all data are in data folder
For training we used data from UCI's SMS Spam Collection Data Set which we have put in this repo here
for testing the model against real discord messages, we have created our own custom dataset here
the ML model is created from file ./RMCS_NLP_PROJECT.ipynb
which we modify to fit our need from here
the model is exported into ./nb_model.joblib
and ./vect.joblib
Clone this repo, go to the directory, and run
pip install -r ./requirements.txt
this website will tell you how to create your own discord bot account and get the token
python bot.py <YOUR DISCORD BOT TOKEN>
(default is naive bayes)
python bot.py <YOUR DISCORD BOT TOKEN> --model naiveBayes
python bot.py <YOUR DISCORD BOT TOKEN> --model randomForest