/
Configuring the Spark Runtime via Helm

Configuring the Spark Runtime via Helm

Place spark/hadoop config files in moc/ subdirectory

After expanding the chart, find the moc directory, and inside it, create a new subdirectory. This directory can be called anything.

cd moc mkdir my-hadoop-conf

Copy into this directory all your spark and hadoop configuration files.

Create a configmap based on the contents of this directory

Update your helm override file to include the following:

moc: configMaps: hadoop-config: my-hadoop-conf/*

When helm install or helm upgrade is run, a configmap named hadoop-config will be created that contains the contents of the my-hadoop-conf directory. Note that the supplied directory name is relative to the moc/ directory, and helm cannot create configmaps based on directories outside the moc/ directory.

Configure the spark runtime to use these files

Update your helm override file to include the following:

moc: sparkRuntimes: "alpha": env: HADOOP_CONF_DIR: /modelop/hadoop/conf configMaps: hadoop-config: mountPath: /modelop/hadoop/conf

The above will configure spark-runtime-alpha with the files from the moc/my-hadoop-conf directory, via the hadoop-config configmap. The files will be mounted inside the pod at /modelop/hadoop/conf, and the HADOOP_CONF_DIR env variable will direct the spark runtime to look there for the files.

Configure SPARK_HOME

By default, the SPARK_HOME env variable is set to /modelop/bin/spark-2.4.4-bin-hadoop2.6. If a different version of spark/hadoop is installed onto the image, be sure to set the SPARK_HOME env variable in your helm override file: