bootstrap script for Alluxio on EMR
Inspire by https://github.com/kernel164/emr-bootstrap-alluxio but with Alluxio 1.8.1 on EMR 5.17
Blog post about how to setup Alluxio on EMR: Alluxio on EMR: fast storage access and sharing for Spark jobs
EMR cluster configurations
[
{
"Classification": "core-site",
"Properties": {
"fs.alluxio.impl": "alluxio.hadoop.FileSystem",
"fs.AbstractFileSystem.alluxio.impl": "alluxio.hadoop.AlluxioFileSystem"
}
},
{
"Classification": "spark-defaults",
"Properties": {
"spark.driver.extraClassPath": ":/usr/lib/hadoop-lzo/lib/*:/usr/lib/hadoop/hadoop-aws.jar:/usr/share/aws/aws-java-sdk/*:/usr/share/aws/emr/emrfs/conf:/usr/share/aws/emr/emrfs/lib/*:/usr/share/aws/emr/emrfs/auxlib/*:/usr/share/aws/emr/security/conf:/usr/share/aws/emr/security/lib/*:/opt/alluxio-1.8.1-hadoop-2.8/client/alluxio-1.8.1-client.jar",
"spark.executor.extraClassPath": ":/opt/alluxio-1.8.1-hadoop-2.8/client/alluxio-1.8.1-client.jar"
}
}
]