@FlyteCoreTeam Having another issue when running o...
# hacktoberfest-2022
r
@FlyteCoreTeam Having another issue when running one of the tasks for the nlp PR. https://github.com/flyteorg/flytesnacks/pull/911 on demo cluster with custom built image I am getting this error for some of the downstream tasks which take the serialised flytefile (
.model
) as input:
Pod failed. No message received from kubernetes.
[fb8f63b27ec284bbfbc8-n3-0] terminated with exit code (137). Reason [Error]. Message:
`tar: Removing leading
/' from member names
See screenshot below. The error seems like its trying to extract something from tar archive but Im not doing this anywhere in my task logic ? Also, think someone else has reported this error as well https://flyte-org.slack.com/archives/CP2HDHKE1/p1664441568396559?thread_ts=1664433734.963769&cid=CP2HDHKE1
e
Error 137 usually means OOM. Can you bump the resources in that task?
r
@Eduardo Apolinario (eapolinario) thanks a lot, that fixed it ! i bumped the resource mem from 200 to 600Mi 👍
n
I wonder if we can collect these kinds of errors, would be great if we can figure out how to make these k8s-level errors a little more actionable, like suggesting to bump resources of the task.
155 Views