A scalable K-Means clustering system with MapReduce paradigm and gRPC communication. This project showcases the seamless coordination between masters, mappers, and reducers to enable efficient data analysis, fostering cluster discovery and insights.
- Clone the repo
git clone https://github.com/aryanGupta-09/MapReduce.git
- Go to the project directory
cd MapReduce
- Generate the Python code for gRPC
python -m grpc_tools.protoc -I. --python_out=. --grpc_python_out=. kmeans.proto
-
Update Input/points.txt
-
Run the Python files