It uses Text Extraction Feature like TF-IDF Vectorizer and simple python code, to classify the messages as spam or ham (normal).
To minimise spam and phising attacks and any kind of digital frauds.
- Python3
- Pandas
- Scikit-Learn
-
Python3 can be installed from their official site https://www.python.org/ . Or you can use anaconda environment.
-
Pandas can be installed by
conda install -c anaconda pandas
or
pip3 install pandas
- Scikit-Learn can be installed by
conda install -c anaconda scikit-learn
or
pip3 install -U scikit-learn
- No known bugs.
- The dataset contains more ham messages as compared to spam, this can cause few False-Positives or Type1 error.
- This project will be improved in few days by using Neural Networks and other efficient algorithms.