Is there a way I can modify default task tolerations in the Flyte #flytekit

Is there a way I can modify default task toleratio...

late-pencil-3873

02/07/2022, 7:05 PM

Is there a way I can modify default task tolerations in the helm charts values file? https://github.com/flyteorg/flyte/blob/master/charts/flyte/values.yaml

thankful-minister-83577

02/07/2022, 7:44 PM

what’s the use-case? gpus?

thankful-minister-83577

02/07/2022, 7:52 PM

more specifically, are you looking to add tolerations to everything? or only to certain task types?

thankful-minister-83577

02/07/2022, 7:58 PM

for everything: add this https://github.com/flyteorg/flyteplugins/blob/6b72fd6b8d0f172bd64d967f13ec4cade51dba7e/go/tasks/pluginmachinery/flytek8s/config/config.go#L88

thankful-minister-83577

02/07/2022, 7:58 PM

by resource add this https://github.com/flyteorg/flyteplugins/blob/6b72fd6b8d0f172bd64d967f13ec4cade51dba7e/go/tasks/pluginmachinery/flytek8s/config/config.go#L117

thankful-minister-83577

02/07/2022, 7:58 PM

https://github.com/flyteorg/flyteplugins/blob/6b72fd6b8d0f172bd64d967f13ec4cade51dba7e/go/tasks/pluginmachinery/flytek8s/config/config.go#L133 may also be relevant

freezing-airport-6809

02/07/2022, 10:17 PM

Also for every container you can add default tolerations

freezing-airport-6809

02/07/2022, 10:17 PM

We are working on adding a podTemplate (eventually per namespace), that will be used as the default template for all container tasks

late-pencil-3873

02/08/2022, 10:37 AM

My use case: we share a cluster with some other workloads (an mlflow tracking server instance, a webapp that uses the deployed models, …) and I don’t want these pods to use the expensive machines when no training is occurring, thus the taint. It is ok that all flyte pods by default have the toleration. With the provided links I was able to configure a default-toleation. Thanks for the guidance and really nice that this can be customized in such a fine-grained way :)

late-pencil-3873

02/08/2022, 3:38 PM

Is there a way to specify tolerations also for Spark tasks?

thankful-minister-83577

02/08/2022, 10:34 PM

@gentle-kite-33075 do you know this by chance? tolerations for spark tasks?

freezing-airport-6809

02/08/2022, 11:17 PM

@late-pencil-3873 there is a way to add tolerations for spark - https://github.com/GoogleCloudPlatform/spark-on-k8s-operator/blob/master/pkg/apis/sparkoperator.k8s.io/v1beta2/types.go#L440

freezing-airport-6809

02/08/2022, 11:18 PM

but the spark plugin does not support that today. I think the spark plugin needs an upgrade. IMO it should be something like this

Copy code

@task(
config=Spark(), pod_template=Pod(),....)
def foo():
  ...

late-pencil-3873

02/10/2022, 11:29 AM

Actually setting this up in the spark helm chart itself is absolutely valid I think. Deploying the spark helm chart with a tolerations value fails atm but I will debug why.

late-pencil-3873

02/10/2022, 11:29 AM

Thanks for the hints!

late-pencil-3873

03/14/2022, 4:07 PM

@worried-farmer-76794

worried-farmer-76794

03/14/2022, 4:18 PM

Thanks for the update. I think the

tolerations

value of spark-operator helm chart doesn't work out of box. I will try to configure that one more time tomorrow.

late-pencil-3873

04/10/2022, 2:41 PM

@acceptable-policeman-57188

late-pencil-3873

04/10/2022, 2:41 PM

(Thread I mentioned in my dm.)

👀 1

183 Views

Open in Slack

Previous Next