Yes, there are practical limits on the amount of data a task or workflow can process in Flyte. These limits are primarily due to constraints in the underlying Kubernetes infrastructure and Flyte's use of etcd for state storage.
1.
etcd Storage Limit: Flyte uses etcd to store workflow execution state, which has a limit of 1.5MB for object sizes. This can restrict the size of workflows and the amount of metadata that can be stored (
source).
2.
FlyteCopilot Download Limit: There is a limit on the size of data that FlyteCopilot can download, which was previously set to 2MB but can be configured to be unlimited (
source).
Would you like more detail on any specific aspect?
Sources:
-
https://discuss.flyte.org/t/16094488/lets-say-you-have-a-task-that-you-need-to-run-100-000-times-/
-
https://github.com/flyteorg/flyte/issues/1251 👍 0 button 👎 0 button 💡
Hint: Mention @runllm in the thread for followups.