I have been pretty happy with this pattern around `map task` Flyte #flyte-support

I have been pretty happy with this pattern around ...

bored-beard-89967

02/23/2024, 4:11 PM

I have been pretty happy with this pattern around

map_task

using

dynamic

. Sharing here to see if anyone has any “gotchas” or additional ideas. I wrap

map_task

with a

dynamic

task for two reasons: 1. We can parameterize the concurrency arg, this is helpful for us when we have a big variety in the number of inputs between executions. 2. If caching at the task level, we can cache at the dynamic level (I just recently discovered dynamic also supports caching). This (from what I can tell) avoids the need to check the cache on every task in map_task and will check cache for just the dynamic task. This can save a few minutes in some cases!

Copy code

from flytekit import map_task, task, workflow, dynamic


@task(cache=True, cache_version="0.0.1")
def build_inputs() -> list[int]:
    return list(range(10))


@task(cache=True, cache_version="0.0.1")
def example_task(x: int) -> int:
    return x


@dynamic(cache=True, cache_version="0.0.1")
def dynamic_map_task(xs: list[int], concurrency: int) -> list[int]:
    return map_task(example_task, concurrency=concurrency)(x=xs)


@workflow
def example_workflow(concurrency: int = 16) -> list[int]:
    inputs = build_inputs()
    return dynamic_map_task(xs=inputs, concurrency=concurrency)

freezing-airport-6809

02/26/2024, 3:33 PM

Dynamic does support caching

freezing-airport-6809

02/26/2024, 3:35 PM

Your approach is indeed supported

freezing-airport-6809

02/26/2024, 3:35 PM

Can you write a blog to educate others 😍

bored-beard-89967

02/26/2024, 4:49 PM

Where could I get started and is there anything else worth adding around?

freezing-airport-6809

02/26/2024, 5:00 PM

No just write your experience and usecase - where do you work? You can write it there or on Flyte.org. Preferably at your company’s blog. Usecases and education together is better

bored-beard-89967

02/27/2024, 3:08 PM

I am at Pachama. I am not sure about our priority of writing a blog like this and I would probably do so outside of working hours. Maybe I’ll make a short medium article and make sure to add use cases examples and differences in cache hit times.

❤️ 1

🔥 3

freezing-airport-6809

02/27/2024, 3:09 PM

Also happy to peer review suggest edits. Cc @average-finland-92144 / @powerful-horse-58724

bored-beard-89967

02/27/2024, 3:10 PM

Sounds good! No promises, but I’ll put it on the docket!

gratitude thank you 4

faint-machine-61752

04/12/2024, 8:43 AM

Hi @bored-beard-89967 We tried your approach here wrapping map_task using dynamic in one of our long running pipelines. It actually helped us in saving time by avoiding the need to read cached results from each task within a map_task. Thanks for sharing this gratitude thank you

🙌🏽 1

🙌 2

freezing-airport-6809

04/12/2024, 3:31 PM

Why were you reading cached results in a map task would love to understand the problem

bored-beard-89967

04/12/2024, 3:34 PM

I think they mean they saved time in their workflow by checking one cache (the dynamic task) instead of each task in the map task individually. I could be wrong. That is why I often do it. I'll do the same thing when using stubs from workflows in other projects that don't have the most performance caching in sub tasks.

faint-machine-61752

04/16/2024, 6:55 AM

@freezing-airport-6809 We have a cron schedule associated with the launch plan and we prefer to not rerun the map tasks in every run, hence we have enabled caching on the input task. We had a different problem too with the map_tasks that it gets stuck at initialising for a very long time like 1h+ on re-runs and we couldn’t figure out why that was happening as no pods were getting spawned or no logs were present. So an immediate solution for us was to avoid the attempt to rerun the map task and wrap it with dynamic, so that it only checks the cache at dynamic level. @bored-beard-89967 solution saved the time for our workflows.

bored-beard-89967

04/16/2024, 1:19 PM

@faint-machine-61752 How many tasks and what type of objects are being cached? Just curious.

faint-machine-61752

04/24/2024, 1:57 PM

Hey @bored-beard-89967 So those are tensors task that is used by our model training. Number of tasks varies between different entities that we run. Minimum is 40 and maximum that we have is 1600

❤️ 1

freezing-airport-6809

04/24/2024, 2:03 PM

We can support caching at the map level it self

bored-beard-89967

04/24/2024, 2:09 PM

Gotcha, for data like this we usually implement writing out to zarr, tfrecords, etc and return the paths to the artifacts. We end up writing the files with a plugin that lets us scale up within the task. My most recent preference has been to use dataframes to manage the list of training files and return that since flyte handles dfs nicely.

bored-beard-89967

04/24/2024, 2:10 PM

Then retrieval of cached outputs is much more lightweight since it is basically a list of urls.

8 Views

Open in Slack

Previous Next