aryanGupta-09 / Kmeans-using-MapReduce

K-means clustering algorithm using MapReduce.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Kmeans-using-MapReduce

A scalable K-Means clustering system with MapReduce paradigm and gRPC communication. This project showcases the seamless coordination between masters, mappers, and reducers to enable efficient data analysis, fostering cluster discovery and insights.

Tech Stack

Python Protobuf  gRPC

Installation

  1. Clone the repo
  git clone https://github.com/aryanGupta-09/MapReduce.git
  1. Go to the project directory
  cd MapReduce
  1. Generate the Python code for gRPC
  python -m grpc_tools.protoc -I. --python_out=. --grpc_python_out=. kmeans.proto
  1. Update Input/points.txt

  2. Run the Python files