handsome-airline-36833
01/23/2025, 9:14 AMFlyteEntityNotExistException: USER:EntityNotExist: error=None,
cause=<_InactiveRpcError of RPC that terminated with:
status = StatusCode.NOT_FOUND
details = "missing entity of type execution with identifier
project:"wf-test" domain:"production"
name:"b5e545e91107""
debug_error_string = "UNKNOWN:Error received from peer
{created_time:"2025-01-23T00:04:03.128079309+00:00", grpc_status:5,
grpc_message:"missing entity of type execution with identifier
project:\"wf-test\" domain:\"production\"
name:\"b5e545e91107\""}"
when the entity definitely exists in the UI, and retrying fixes the issue. Any idea what might cause this flakiness?handsome-airline-36833
01/23/2025, 9:15 AMaverage-finland-92144
01/23/2025, 3:30 PMhandsome-airline-36833
01/23/2025, 3:42 PMhandsome-airline-36833
01/23/2025, 3:43 PMhandsome-airline-36833
01/23/2025, 3:46 PMmsg: "Failed to update workflow. Error [Put "<https://10.3.80.1:443/apis/flyte.lyft.com/v1alpha1/namespaces/production/flyteworkflows/b5e545e91107?timeout=30s>": unexpected EOF]"
average-finland-92144
01/23/2025, 4:31 PMkube-apiserver
because seems to be some latency there as propeller eventually fails to update the CRD.
The timeout mentioned in the error is controlled with this setting: https://docs.flyte.org/en/latest/deployment/configuration/generated/flytepropeller_config.html#kube-client-config-config-kubeclientconfig
You could increase timeout but again, understanding better what's happening in the API server could help.handsome-airline-36833
01/23/2025, 4:33 PMaverage-finland-92144
01/23/2025, 4:34 PMworkqueue_depth
average-finland-92144
01/23/2025, 4:34 PMhandsome-airline-36833
01/23/2025, 4:35 PM