QuantumBoy-729 / Big-Data-Project

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

5th Semester Big-Data-Project

Project Title :- Spam Detection Using Machine Learning and Spark Team members :

  1. Rushab Prakash
  2. Divyansh Raina
  3. Ootung
  4. Kushal

Running the files:

-> To run the streaming file use:python3 stream.py -f spam -b 100

-> To run the streamer file use: python3 streamer.py or /opt/spark/bin/spark-submit streamer.py

Models: the models we have used are

1)SGDclassifier

2)Logistic Regression

3)Random Forest classifier

4)K-Means Clustering

About


Languages

Language:Python 100.0%