elegant-toddler-67101
07/26/2023, 1:37 PMflyte-core
helm chart on GKE, we have node pool with taints.
What are we missing?
@task(
container_image="{{.image.indeed.fqn}}:{{.image.indeed.version}}",
requests=Resources(cpu="1", mem="2Gi", gpu="1"),
limits=Resources(cpu="1", mem="3Gi")
)
and this configuration in flyte-core
chart values:
k8s:
plugins:
k8s:
gpu-resource-name: <http://nvidia.com/gpu|nvidia.com/gpu>
resource-tolerations:
- <http://nvidia.com/gpu|nvidia.com/gpu>:
- key: "<http://nvidia.com/gpu|nvidia.com/gpu>"
operator: "Equal"
value: "present"
effect: "NoSchedule"
freezing-boots-56761
elegant-toddler-67101
07/26/2023, 1:54 PMfreezing-boots-56761
freezing-boots-56761
apiVersion: v1
kind: Pod
metadata:
name: nvidia-smi
namespace: default
spec:
restartPolicy: OnFailure
containers:
- name: nvidia-smi
image: nvidia/cuda:12.2.0-runtime-ubuntu20.04
args:
- "nvidia-smi"
resources:
limits:
<http://nvidia.com/gpu|nvidia.com/gpu>: 1
elegant-toddler-67101
07/31/2023, 7:27 AMfreezing-boots-56761
elegant-toddler-67101
08/07/2023, 2:46 PM