Hello What is the best way to develop a producer consumer pi Flyte #flyte-support

Hello :), What is the best way to develop a produc...

bumpy-plumber-6824

03/22/2024, 7:57 PM

Hello :), What is the best way to develop a producer-consumer pipeline using Flyte? Scenario: The producer reads words from a large file and passes each word to a "word queue". Workers listening on that queue pick up the word and perform an operation (say word length) and forward the length to "word length" queue. The worker listening to the "word length" plots a histogram words length in realtime. After the file is done, the workers are shutdown and the final histogram is persisted. The above is a toy example. We want to implement the same architecture for a workload involving protein sequences with a computationally heavy operation. Currently we have built a workflow that chunks the large file and passes to N workers via

map_task()

, however the workers (due to the DAG nature of flyte workflow) have to wait until the chunking finishes. What is a good pattern to do this via Flyte?. Thanks for your help!

freezing-airport-6809

03/22/2024, 10:39 PM

and you cannot collapse the

word length

and

word length histogram

to one python function, or you prefer not to

freezing-airport-6809

03/22/2024, 10:39 PM

at the moment, sadly all map tasks have to finish

freezing-airport-6809

03/22/2024, 10:39 PM

we are indeed working on mapping over

workflows

launchplans

and we will be delivering this later. But would love to understand if you can work with this limitation today

freezing-airport-6809

03/22/2024, 10:40 PM

cc @silly-toddler-37820 / @high-park-82026 - map task over launchplans.

👍 1

bumpy-plumber-6824

03/23/2024, 3:05 AM

@freezing-airport-6809, We are currently using

map_task()

to distribute the work on "workers", however that is not the ask here. The ask is to spin up N consumers and M producers that communicate over a queue.

freezing-airport-6809

03/24/2024, 5:03 PM

At union we are working on reusable containers- we have in early alpha right now. You won’t have to think about queues

👍 1

4 Views

Open in Slack

Previous Next