hallowed-camera-82098
12/13/2023, 9:08 PM- key: cloud.google.com/gke-spot
operator: DoesNotExist
- key: cloud.google.com/gke-spot
operator: DoesNotExist
- key: cloud.google.com/gke-spot
operator: In
values:
- "true"
hallowed-camera-82098
12/14/2023, 1:08 AMtall-lock-23197
full-ram-17934
12/14/2023, 6:48 PMfull-ram-17934
12/14/2023, 6:49 PMtall-lock-23197
full-ram-17934
12/15/2023, 5:50 PMtall-lock-23197
average-finland-92144
12/18/2023, 12:32 PMspark-config-default
block you used in the Helm chart?hallowed-camera-82098
12/18/2023, 2:24 PMspark-config-default:
- spark.kubernetes.authenticate.driver.serviceAccountName: "flyte-worker"
- spark.hadoop.fs.gs.project:
"{{ .Values.userSettings.googleProjectId }}"
- spark.hadoop.fs.gs.impl: "com.google.cloud.hadoop.fs.gcs.GoogleHadoopFileSystem"
- spark.hadoop.fs.AbstractFileSystem.gs.impl: "com.google.cloud.hadoop.fs.gcs.GoogleHadoopFS"
- spark.hadoop.mapreduce.fileoutputcommitter.algorithm.version: "2"
- spark.kubernetes.allocation.batch.size: "50"
- spark.excludeOnFailure.enabled: "true"
- spark.excludeOnFailure.timeout: "5m"
- spark.task.maxfailures: "4"
hallowed-mouse-14616
12/18/2023, 2:33 PMinterruptible
configuration.hallowed-mouse-14616
12/18/2023, 2:36 PMhallowed-camera-82098
12/18/2023, 2:42 PMflat-area-42876
12/20/2023, 9:13 PMhallowed-camera-82098
12/21/2023, 6:11 PMfull-ram-17934
12/21/2023, 7:48 PMflat-area-42876
12/21/2023, 9:53 PMflat-area-42876
12/21/2023, 10:23 PMAffinity: config.GetK8sPluginConfig().DefaultAffinity,
Should be
Affinity: config.GetK8sPluginConfig().DefaultAffinity.DeepCopy()
Pointer to DefaultAffinity value would get set in the spark driverSpec -> resource Spec -> update node selector requirements altering the set DefaultAffinity until propellor restarts.
The issue was handled with this PR.hallowed-mouse-14616
12/22/2023, 2:34 PM