echoing-carpenter-92090
12/15/2023, 3:04 PMechoing-carpenter-92090
12/15/2023, 3:06 PMplugins:
k8s:
resource-tolerations:
- <http://nvidia.com/gpu|nvidia.com/gpu>:
- key: "key1"
operator: "Equal"
value: "value1"
effect: "NoSchedule"
echoing-carpenter-92090
12/15/2023, 3:55 PMplugins:
k8s:
resource-tolerations:
- <http://nvidia.com/gpu|nvidia.com/gpu>:
- key: "gpu"
operator: "Equal"
value: "true"
effect: "NoSchedule"
inject-finalizer: true
My nodegroup taints
<http://nvidia.com/gpu|nvidia.com/gpu> true NoSchedule
gpu true NoSchedule
Error in console
12/15/2023 3:53:22 PM UTC task submitted to K8s
12/15/2023 3:53:22 PM UTC Unschedulable:0/5 nodes are available: 1 Insufficient memory, 1 node(s) had untolerated taint {dedicated: flyte}, 2 Insufficient cpu, 4 Insufficient <http://nvidia.com/gpu|nvidia.com/gpu>. preemption: 0/5 nodes are available: 1 Preemption is not helpful for scheduling, 4 No preemption victims found for incoming pod..
tall-lock-23197
average-finland-92144
12/18/2023, 12:59 PMkey
you can use "<http://nvidia.com/gpu|nvidia.com/gpu>"
which, BTW, I think is the resource type exposed by the NVIDIA driver, so the key is restrictive here.
Also, is that new node also tainted with dedicated:flyte
?