Has anyone else had issues with errors about pods ...
# flyte-support
c
Has anyone else had issues with errors about pods being deleted externally?
Copy code
[3/3] currentAttempt done. Last Error: SYSTEM::object [nerf-test-production/f3hmfc16u3esce-n0-0-n0-n0-0-n1-3] terminated in the background, manually
Its a bit difficult to know exactly what is going on but I don't think these pods are being deleted. Personally I think they are never actually getting created. This problem seems to happen a lot more when our k8s cluster is under high load. Tighter rate limiting in
kube-client-config
seems to help but I'm reluctant to reduce it further. Does anyone have any suggestions?
a
@calm-pilot-2010 are you using
inject-finalizer
?
c
I don't see it in our config. Is it enabled by default?
a
Is it enabled by default?
I don't think so. You can set it under
configmap.k8s.plugins.k8s
inject-finalizer: true
c
Thanks. Would you mind explaining what it actually does?
a
It should inject a K8s finalizer at startTask. The finalizer will wait until the Task is complete to delete the Pod. It seems this is only a need when the K8s API server is under high load
c
Thanks I will give it a try.
a
sure, it's not a silver bullet but let's try 🙂
c
I think it solved the problem. Thanks for the help 🙂
gratitude thank you 1
🙇🏽 1