Apache Beam K-means: `Big Data Class Project`

Introduction 🌟

Explore and implement K-means clustering in sequential, streaming, and distributed modes using Apache Beam.

Check the insights in ./notebooks. Problem statement in ./docs.

Access the notebook directly on Colab.

Architecture detailed using the Makefile.

Implementing K-means clustering in sequential, streaming, and distributed formats using Apache Beam.

Language:Jupyter Notebook 98.8%Language:Makefile 1.2%