We have a need to spin up virtual machines in the cloud that Flyte #flyte-support

We have a need to spin up virtual machines in the ...

calm-zoo-68637

03/22/2024, 6:13 PM

We have a need to spin up virtual machines in the cloud that run a simulation. One option to support this use is to write a Flyte agent that spins up the job. Another option is to write a Flyte task that spins up the job, polls for completion, and then downloads the outputs. What do we get from using a flyte agent (option 1) that we couldn't get from a long-running task (option 2)?

tall-lock-23197

03/25/2024, 5:34 AM

cc @glamorous-carpet-83516

thankful-minister-83577

03/25/2024, 6:49 PM

a long running task will take up space/resources. an agent runs as part of pod that can run multiple agents, like a service. it’ll be up all the time (so if you have no other agents, technically it may take up more resources). the one agent will also always be up and and can run multiple task instances, whereas in the long poll case, it sounds like you’d be running one polling task for every real task.

👍 1

glamorous-carpet-83516

03/25/2024, 6:54 PM

yes, if your task is not running in the k8s, it’s better to create an agent to submit your job to external system.

glamorous-carpet-83516

03/25/2024, 6:59 PM

that we couldn’t get from a long-running task (option 2)?

what does this long-running task do? if it just send a http request to the cloud, it will waste many resources because propeller launch a pod for each task.

calm-zoo-68637

03/26/2024, 4:39 AM

How does flyte decide when it needs another replica of the pod hosting an agent? Is there some kind of pod autoscaler deployed by default? Otherwise I worry the agent will run out of resources under heavy loud (thousands of concurrent tasks)

thankful-minister-83577

03/26/2024, 5:28 AM

not that I know of. but thousands of concurrent tasks is still pretty light I feel. I know kevin did some load testing a while back.

👍 1

calm-zoo-68637

03/28/2024, 2:08 AM

Will try and report back!

high-accountant-32689

03/28/2024, 3:43 PM

The agents pods are running behind a k8s

Deployment

and even though we don't ship a pod autoscaler in our helm charts, it'd be pretty simple to add one.

👍 1

7 Views

Open in Slack

Previous Next