Frank Shen
02/22/2023, 7:14 PMDavid Espejo (he/him)
02/22/2023, 7:39 PMFrank Shen
02/22/2023, 8:04 PMDavid Espejo (he/him)
02/22/2023, 9:00 PMWill clicking the rerun button resume the workflow or just rerun the failed task?cc @Eduardo Apolinario (eapolinario) / @Dan Rammer (hamersaw) will this require the use of Caching?
Dan Rammer (hamersaw)
02/22/2023, 9:24 PMcache_version
and input values will use the same cache.
The "rerun" button the task should only execute the individual task, and not resume the workflow.
I should make sure you're aware of the "recover" functionality as well, although it may not directly apply here (ie. task updates). Recovery means that Flyte relaunches the workflow, but all of the nodes that were previously completed will be re-used. So if you have a workflow that had 3 tasks, where the first and second succeeded and the third failed due to pods being OOM killed or something. You could recover the workflow and Flyte would recover task executions 1 and 2 before reattempting the 3rd task execution. So this is basically what you want, but it will use the version of the task associated with the initial workflow rather than the updated one.Frank Shen
02/22/2023, 9:45 PMDavid Espejo (he/him)
02/22/2023, 9:51 PMFrank Shen
02/22/2023, 10:01 PMDavid Espejo (he/him)
02/22/2023, 10:15 PMFelix Ruess
02/23/2023, 8:10 AMSamhita Alla
This is a bit confusing in the UI...Would you let us know why it isn't clear?
If we could also manually abort specific tasks and then recover the whole workflow, that would be exactly what my colleagues are asking for.You can terminate a workflow, not the tasks within it. @Yee / @Eduardo Apolinario (eapolinario), is this something we want to support?
Felix Ruess
02/27/2023, 10:21 AM