There are 0 repository under apache-spark-cluster topic.
A guide on how to set up Jupyter with Pyspark painlessly on AWS EC2 clusters, with S3 I/O support
This project has customization likes custom data sources, plugins written for the distributed systems like Apache Spark, Apache Ignite etc
This package contains the code for calculating external clustering validity indices in Spark. The package includes Chi Index among others.
Implementations of Markov Clustrer Algorithm (MCL) and Regularized Markov Cluster Algorithm (R-MCL) in Apache Spark
Analysis performed on data from the Steam platform using Apache Spark and Cloud services such as Amazon Web Services.
data enginerring project - visualize visa numbers by country, time issued from japan