thread I have read the documentation about Flyte agents and Flyte #flyte-support

:thread: I have read the documentation about Flyte...

clean-glass-36808

05/30/2024, 6:51 PM

🧵 I have read the documentation about Flyte agents and I feel like I understand how they simplify development of integrations with external services but I'm curious what the tradeoffs are with agents. At what point would I need to develop a plugin instead of going the agent route?

freezing-airport-6809

05/30/2024, 9:25 PM

@clean-glass-36808 great question. the reason is performance

freezing-airport-6809

05/30/2024, 9:26 PM

So Agents, are checked only in a loop. If jobs complete sooner, today they will take atleast 30 seconds to be marked as complete

freezing-airport-6809

05/30/2024, 9:26 PM

the goal is to make this as part of our reactive discovery, but since the status is polled it is slower

clean-glass-36808

05/30/2024, 9:28 PM

Thank you

freezing-airport-6809

05/30/2024, 9:54 PM

@clean-glass-36808 we would love to learn how the experience has been with agents?

clean-glass-36808

05/31/2024, 2:24 AM

Its been okay, but I'll say that the general open source support has been good (especially compared to my experience working with the Armada folks). We have an Armada agent running that we're still evaluating and trying to understand what the limitations are (ie. dynamic tasks, map_task, etc). I think there are still some subtle gotchas in the docs depending on what the agent does. For example.. 1. If your agent is scheduling compute on some other system, you'll need to create a custom image (with pyflyte as a base) that includes your custom task packages. Otherwise the python code run by pyflyte-execute will fail. Default images will never work for these use cases. 2. On top of that, the custom image must not have the your custom agent packages. Otherwise when pyflyte-execute runs your task I believe it will try to run the agent logic locally on that binary as well. This was pretty confusing. Would also be good to understand best practices around logging on the agent and maybe error handling so things bubble up into the flyte UI nicely.

👀 1

freezing-airport-6809

05/31/2024, 2:26 AM

Ohh the 2 seems odd - and a bug. I think it should never load agents - cc @glamorous-carpet-83516 Runtime should bypass agents and go to the executor

freezing-airport-6809

05/31/2024, 2:26 AM

Also 1 - images should be built just the way any flytekit function is

freezing-airport-6809

05/31/2024, 2:26 AM

I also want to understand what are you trying to solve with armada

clean-glass-36808

05/31/2024, 4:07 AM

I think the tl;dr is that we don't want to rely on k8s to queue workloads into a pending state think (think tens of thousands of tasks/jobs kicked off at once). We can't autoscale the cluster since we're not on public cloud. Additionally, we have some use cases for gang scheduling that I don't think the k8s scheduler is suitable for. Happy to invite my colleagues to share some lower level details

freezing-airport-6809

05/31/2024, 4:09 AM

No makes sense - but I think if you use quota Flyte will back off till pods are available- makes sense

freezing-airport-6809

05/31/2024, 4:09 AM

We would infact love a backend plugin for this

freezing-airport-6809

05/31/2024, 4:09 AM

It’s more k8s native

clean-glass-36808

05/31/2024, 4:14 AM

re: 1, It makes sense to me now but following along the flyte agent docs it wasn't obvious to me why I would need to make a custom image to run vanilla python logic in a task (ie. print hello world). It felt like the task decorator was more of a control plane level detail but yeah it makes sense now.

freezing-airport-6809

05/31/2024, 4:17 AM

@clean-glass-36808 can you please help with doc update?

clean-glass-36808

05/31/2024, 4:19 AM

Yeah. I will validate #2 as well

clean-glass-36808

05/31/2024, 4:19 AM

An example skeleton of an agent on GitHub would probably go a long way tbh.

🎯 1

freezing-airport-6809

05/31/2024, 4:24 AM

Cc @glamorous-carpet-83516 / @damp-lion-88352 example skeleton to start different types of agents from would be awesome

freezing-airport-6809

05/31/2024, 4:25 AM

I infact that we have pyflyte init with cookie-cutter why cant we have

pyflyte init async-agent-starter template

something like that?

2 Views

Open in Slack

Previous Next