There are 0 repository under virtual-clusters topic.
Sorting of large dataset files(80GB) using Hadoop(Mapreduce) techniques and Apache Spark in Java and scheduled job on the virtual cluster(using 4 nodes) using a SLURM scheduler with bash scripting
Recipes for setting up a local virtual cluster using specific scheduling engines.
Creates psuedo hdfs and spark cluster