victorious-jelly-28729
03/18/2025, 3:49 AMjolly-florist-66728
03/18/2025, 3:49 AMvictorious-jelly-28729
03/18/2025, 3:51 AMjolly-florist-66728
03/18/2025, 3:51 AMuseOffloadedWorkflowClosure
flag in your Flyte configuration. This feature offloads the workflow's static information from the Custom Resource Definition (CRD) to a blob store, reducing the size of the data stored in etcd.
configmap:
adminServer:
flyteadmin:
useOffloadedWorkflowClosure: true
This configuration should be applied to new workflows, as it does not affect already running workflows. You can find more details in the Flyte documentation.
2. Optimize Workflow Design: Consider restructuring your workflow to reduce its size. This can involve breaking down large workflows into smaller sub-workflows or using sub-launch plans. By doing so, each sub-workflow or launch plan is treated as a separate entity, which can help manage the overall size of the workflow state.
3. Monitor Workflow Size: Regularly monitor the size of your workflows and the data they produce. This can help you identify potential issues before they cause failures. You can use Flyte's monitoring tools and metrics to keep track of workflow sizes and execution states.
4. Handle Large Outputs: If your tasks produce large outputs, consider storing these outputs in an external storage system and only keeping references to them in the workflow state. This can significantly reduce the size of the data stored in etcd.
5. Increase etcd Limits: As a last resort, you can consider increasing the etcd size limits, but this is generally not recommended due to potential performance impacts. It's better to optimize the workflow design and use offloading features.
By implementing these strategies, you can manage large workflows more effectively and prevent them from getting stuck in a running state due to size limitations.
Sources:
- Flyte issue on large workflows 👍 0 button 👎 0 button 💡 Hint: Mention @runllm in the thread for followups.average-finland-92144
03/18/2025, 3:34 PMvictorious-jelly-28729
03/19/2025, 2:16 AM