Hi team I am a DS from gojek currently working on using Flyt Flyte #announcements

Hi team, I am a DS from gojek, currently working o...

average-telephone-1006

01/24/2022, 2:57 AM

Hi team, I am a DS from gojek, currently working on using Flyte for model pipeline. One question I came across is that the resource requested (cpu, memory) is hard coded in the decorator for each task for the current implementation, so it’s the same for different environments. What will be a better way to do overrides for each task in the workflow for different environments? Will it be possible to define it in a config file instead? cc @most-gold-65483 @late-fish-63527

freezing-airport-6809

01/24/2022, 3:19 AM

Aah interesting question- you basically want to change it based on domain. May I ask a few clarifying questions So to understand you want to specify different resources for the same task across different domains? Or do you want a common resource for all task la and change it per domain

freezing-airport-6809

01/24/2022, 3:19 AM

Also what would be the ideal UX

freezing-airport-6809

01/24/2022, 3:30 AM

Like the use experience in the code and how would you want to manage that

freezing-airport-6809

01/24/2022, 3:30 AM

Cc @thankful-minister-83577 / @high-accountant-32689

average-telephone-1006

01/24/2022, 3:55 AM

It is more of the former, specifying different resource requirements for the same task across different domains.

average-telephone-1006

01/24/2022, 3:57 AM

For my implementation, different domains (e.g. staging or production) will have its own config files which contains different config setting for the model (e.g. model hyperparameters). It will be gd if the resource specification can be implemented in the same way.

freezing-airport-6809

01/24/2022, 4:00 AM

Ya so since registration for each domain happens separately - and there is a variable that is exposed at this time, it is possible to use different values

freezing-airport-6809

01/24/2022, 4:00 AM

But would you like config or just python variables would work?

freezing-airport-6809

01/24/2022, 4:03 AM

Will share an example later

freezing-airport-6809

01/24/2022, 4:03 AM

Sorry not near a computer

average-telephone-1006

01/24/2022, 4:04 AM

for domain, do you refer to project domain?

freezing-airport-6809

01/24/2022, 4:14 AM

freezing-airport-6809

01/24/2022, 4:14 AM

I thought that's what you want?

freezing-airport-6809

01/24/2022, 4:50 AM

@average-telephone-1006 ?

freezing-airport-6809

01/24/2022, 6:28 AM

so would something like this be preferrable

Copy code

RES = {
"development": {
   "requests": Resources(...),
   "limits": Resources(...),
},
"production": {
    ...
 },
}

@task(requests=RES[settings.domain]["requests"], limits=RES[settngs.domain]["limits"],)
def foo():
   ...

freezing-airport-6809

01/24/2022, 6:28 AM

wdyt? ^ cc @high-accountant-32689 / @thankful-minister-83577 / @glamorous-carpet-83516

👍 1

thankful-minister-83577

01/24/2022, 5:17 PM

that will work.

thankful-minister-83577

01/24/2022, 5:18 PM

the default (if the task doesn’t specify anything) is already configurable via the matchable resources endpoint

thankful-minister-83577

01/24/2022, 5:18 PM

iirc

average-telephone-1006

01/25/2022, 2:24 AM

sorry for the late reply @freezing-airport-6809 I had a misunderstanding on the domain. Lemme rephrase my reply, the requirement will be specifying different resource requirements for the same task under the same project domain, the same workflow but different launchplans for the workflow. Example: there are 2 launchplans specified for workflow , one is for staging environment and another is for production. I would like to change the resource request based on the parameter

environment

here.

freezing-airport-6809

01/25/2022, 2:51 AM

Why do you need to create 2 launchplans. The same launchplan can be created in staging and production. I am assuming here environment refers to - Flyte domains

freezing-airport-6809

01/25/2022, 2:52 AM

Cc @most-gold-65483 ?

freezing-airport-6809

01/25/2022, 2:53 AM

Would you guys have a few minutes to chat over VC in 2 hours

most-gold-65483

01/25/2022, 3:05 AM

It’s essentially one launchplan but with different fixed input i.e.

Copy code

if os.getenv("WORKFLOW_DOMAIN") == "production":
    # Create scheduled execution in production environment

    # schedule for model training and deployment
    train_and_deploy_wf_schedule = LaunchPlan.get_or_create(
        name=f"{__name__}.train_and_deploy_wf_schedule",
        # name of workflow to be triggered
        workflow=train_and_deploy,
        # fixed_inputs is used to define the workflow input when triggered by scheduler
        # in this case we want to pass "evironment" = "production"
        fixed_inputs={
            "environment": "production"
        },
        schedule=CronSchedule(
            schedule="0 10 * * 1",
            # The execution time is passed to workflow as argument called "kickoff_time"
            kickoff_time_input_arg="kickoff_time",
        ),
    )
elif os.getenv("WORKFLOW_DOMAIN") == "staging":
    # Create scheduled execution in staging environment

    train_and_deploy_wf_schedule = LaunchPlan.get_or_create(
        name=f"{__name__}.train_and_deploy_wf_schedule",
        workflow=train_and_deploy,
        fixed_inputs={
            "environment": "staging"
        },
        schedule=CronSchedule(
            schedule="0 10 * * 1",
            # The execution time is passed to workflow as argument called "kickoff_time"
            kickoff_time_input_arg="kickoff_time",
        ),
    )

Within the workflow, we use

environment

to chose the correct configuration for given domain/environment

most-gold-65483

01/25/2022, 3:07 AM

I think something like this will work https://flyte-org.slack.com/archives/CNMKCU6FR/p1643005698115600?thread_ts=1642993031.095000&cid=CNMKCU6FR

most-gold-65483

01/25/2022, 3:15 AM

I am available for chat, just ping me when you are available

freezing-airport-6809

01/25/2022, 3:41 AM

Ok cool, something like that should be simple

freezing-airport-6809

01/25/2022, 3:41 AM

Let's talk

most-gold-65483

01/25/2022, 10:10 AM

@average-telephone-1006 I had a chat with @freezing-airport-6809 regarding the resource requests. There are 2 ways to do it. 1. Using env var to select the correct resource requests for the given task, for example:

Copy code

domain = os.getenv("WORKFLOW_DOMAIN", "staging")
training_resource_requests = {
    "staging" : Resources(cpu="2", mem="4Gi"),
    "production" : Resources(cpu="4", mem="16Gi"),
}
training_resource_limits = {
    "staging" : Resources(cpu="2", mem="4Gi"),
    "production" : Resources(cpu="4", mem="16Gi"),
}
@task(requests=training_resource_requests[domain], limits=training_resource_limits[domain])
def train_model(train_df: pd.DataFrame, config: DictConfig) -> FlyteFile[TypeVar("joblib.dat")]:

We have

WORKFLOW_DOMAIN

being set in CI during workflow registration which you can use to configure the workflow. 2. Using dynamics workflow (https://docs.flyte.org/projects/flytekit/en/latest/generated/flytekit.dynamic.html#flytekit.dynamic) This is more powerful but more complex. In this case you can use the

config

input variable to store the resource request and the resource request can be adjusted without having to resubmit the workflow. The option 1 above will only configure resource request once during workflow submission.

👍 1

most-gold-65483

01/25/2022, 10:10 AM

Any example for the 2nd option @freezing-airport-6809?

freezing-airport-6809

01/25/2022, 2:41 PM

Cc @thankful-minister-83577 / @high-accountant-32689 / @tall-lock-23197

175 Views

Open in Slack

Previous Next