Frank Shen02/22/2023, 7:14 PM
David Espejo (he/him)02/22/2023, 7:39 PM
Frank Shen02/22/2023, 8:04 PM
David Espejo (he/him)02/22/2023, 9:00 PM
Will clicking the rerun button resume the workflow or just rerun the failed task?cc @Eduardo Apolinario (eapolinario) / @Dan Rammer (hamersaw) will this require the use of Caching?
Dan Rammer (hamersaw)02/22/2023, 9:24 PM
and input values will use the same cache. The "rerun" button the task should only execute the individual task, and not resume the workflow. I should make sure you're aware of the "recover" functionality as well, although it may not directly apply here (ie. task updates). Recovery means that Flyte relaunches the workflow, but all of the nodes that were previously completed will be re-used. So if you have a workflow that had 3 tasks, where the first and second succeeded and the third failed due to pods being OOM killed or something. You could recover the workflow and Flyte would recover task executions 1 and 2 before reattempting the 3rd task execution. So this is basically what you want, but it will use the version of the task associated with the initial workflow rather than the updated one.
Frank Shen02/22/2023, 9:45 PM
David Espejo (he/him)02/22/2023, 9:51 PM
Frank Shen02/22/2023, 10:01 PM
David Espejo (he/him)02/22/2023, 10:15 PM
Felix Ruess02/23/2023, 8:10 AM
This is a bit confusing in the UI...Would you let us know why it isn't clear?
If we could also manually abort specific tasks and then recover the whole workflow, that would be exactly what my colleagues are asking for.You can terminate a workflow, not the tasks within it. @Yee / @Eduardo Apolinario (eapolinario), is this something we want to support?
Felix Ruess02/27/2023, 10:21 AM