Explore and implement K-means clustering in sequential, streaming, and distributed modes using Apache Beam.
Understanding K-means
Sequential K-Means in Python
: Crafting a Python-based model.Streaming K-means in Python
: Tailoring for dynamic data.Apache Beam for Scalability
: Large dataset processing.
Check the insights in ./notebooks
. Problem statement in ./docs
.
Access the notebook directly on Colab.
Architecture detailed using the Makefile
.
- π LinkedIn.