i created a PythonInstanceTask with the name `onemodel get a Flyte #flytekit

i created a PythonInstanceTask with the name `onem...

famous-businessperson-24711

08/30/2022, 8:41 PM

i created a PythonInstanceTask with the name

onemodel.get_assumptions.base_prices

but it registered as `onemodel.models.annotations.wf.onemodel.models.annotations.onemodel.get_assumptions.base_prices`*.* Any ideas? i'm guessing something around tracked instance is injecting this?

thankful-minister-83577

08/30/2022, 8:45 PM

how did you assign the name?

thankful-minister-83577

08/30/2022, 8:47 PM

oh name constructor

famous-businessperson-24711

08/30/2022, 8:47 PM

https://gist.github.com/dylanwilder/4bf86ee1563fbec12a6497f5091eabf2#file-assumptions-py-L19

famous-businessperson-24711

08/30/2022, 8:48 PM

for context, this class is meant to maintain the correct schema'd type signature but still work as a general utility for fetching any schema type

famous-businessperson-24711

08/30/2022, 8:51 PM

(side note: lmk if this is insane to do)

thankful-minister-83577

08/30/2022, 9:19 PM

this gist is self-contained right?

thankful-minister-83577

08/30/2022, 9:19 PM

i can run it? it looks self contained

famous-businessperson-24711

08/30/2022, 9:19 PM

I think ao

thankful-minister-83577

08/30/2022, 9:19 PM

and where are you seeing the long name?

famous-businessperson-24711

08/30/2022, 9:20 PM

console

famous-businessperson-24711

08/30/2022, 9:22 PM

for the instance task

thankful-minister-83577

08/30/2022, 11:12 PM

so if you comment out the workflow, and you just write a unit test, I think you get what you want.

thankful-minister-83577

08/30/2022, 11:14 PM

there was a desire to support tasks that were declared within a workflow a while back. don’t remember the justification, maybe ketan does. the problem with that comes at execution time… how do you load the task?

famous-businessperson-24711

08/30/2022, 11:15 PM

This won’t execute?

thankful-minister-83577

08/30/2022, 11:15 PM

normally, a task is declared at the .py file layer.

famous-businessperson-24711

08/30/2022, 11:15 PM

because the task loader can’t find it yea

thankful-minister-83577

08/30/2022, 11:15 PM

if it’s embedded within a task, you have to know to look at the workflow class first, to get the task

thankful-minister-83577

08/30/2022, 11:16 PM

so our solution was to make the workflow object itself, a task resolver.

thankful-minister-83577

08/30/2022, 11:16 PM

how does a task know if it needs to use a workflow to resolve? if it’s created during the compilation step of a workflow

famous-businessperson-24711

08/30/2022, 11:16 PM

interesting. So it does work?

famous-businessperson-24711

08/30/2022, 11:17 PM

but has a weird resolution path?

thankful-minister-83577

08/30/2022, 11:17 PM

so what’s happening is that the name is getting overwritten by https://github.com/flyteorg/flytekit/blob/master/flytekit/core/python_auto_container.py#L95

thankful-minister-83577

08/30/2022, 11:17 PM

the normal case works yeah, that is tested i’m pretty sure, though we don’t advertise it much.

thankful-minister-83577

08/30/2022, 11:18 PM

i’m not sure how to make this case work

thankful-minister-83577

08/30/2022, 11:18 PM

there’s an extra layer of indirection here, the assumptions class

thankful-minister-83577

08/30/2022, 11:18 PM

would have to trace through the workflow task loading logic, etc.

famous-businessperson-24711

08/31/2022, 2:42 AM

Do you have an example of what does work?

famous-businessperson-24711

08/31/2022, 2:43 AM

maybe I can refactor this

famous-businessperson-24711

08/31/2022, 3:02 AM

actually. i just need to define a task resolver that can reconstitute this task right?

famous-businessperson-24711

08/31/2022, 3:32 AM

looks like it's being overriden in the init func

famous-businessperson-24711

08/31/2022, 4:11 PM

@freezing-airport-6809 I was told you might be the expert here. I was able to get the above to work by created a custom task resolver and manually overriding the global context like so

Copy code

ctx = FlyteContextManager.current_context()
state = ctx.compilation_state.with_params("", resolver=_resolver)
with FlyteContextManager.with_context(ctx.with_compilation_state(state)):
    task = cls._Task(assumption_name)

But i'm unclear what the intended behavior is ie why does the task contstructor ignore the passed resolver in favor of something in the global context

famous-businessperson-24711

08/31/2022, 4:12 PM

Specifically: what's this logic all about

freezing-airport-6809

08/31/2022, 4:13 PM

I guess it’s because the compilation context can provide a better resolveR

freezing-airport-6809

08/31/2022, 4:14 PM

I will Have to understand the usecase. May very well be defensive

famous-businessperson-24711

08/31/2022, 4:14 PM

probably easiest to chat quickly if you have a min

famous-businessperson-24711

08/31/2022, 4:15 PM

the above thread is a lot to parse 😄

thankful-minister-83577

08/31/2022, 4:16 PM

ketan’s out today, but this is the logic we were talking about. the task defined in a workflow.

thankful-minister-83577

08/31/2022, 4:16 PM

the passed in task resolver is always respected unless you’re in a workflow.

thankful-minister-83577

08/31/2022, 4:17 PM

which you are right?

famous-businessperson-24711

08/31/2022, 4:17 PM

i suppose yes, so the workflow compilation context overrides the task resolver?

thankful-minister-83577

08/31/2022, 4:17 PM

this was the confusing part about the code yesterday and why it took so long to hunt down.

famous-businessperson-24711

08/31/2022, 4:17 PM

i might still expect that if a user specifically passes a resolver they know what they're doing

famous-businessperson-24711

08/31/2022, 4:18 PM

ie this part is a little confusing

thankful-minister-83577

08/31/2022, 4:18 PM

yeah and i think we’re open to changing that logic… but i still think it’s confusing

thankful-minister-83577

08/31/2022, 4:19 PM

Copy code

def test_dwild():
    x = BasePricesSchema.name
    print(x)
    Assumptions.get(version_id="abc", assumption=x)


@task
def t2(pdf: BasePricesSchema) -> int:
    return len(pdf)


@workflow
def wf(version_id: str) -> int:
    bp = Assumptions.get(version_id=version_id, assumption=BasePricesSchema.name)
    return t2(pdf=bp)

this test produces the wrong name.

thankful-minister-83577

08/31/2022, 4:19 PM

however this test produces the right name

thankful-minister-83577

08/31/2022, 4:19 PM

Copy code

def test_dwild():
    x = BasePricesSchema.name
    print(x)
    Assumptions.get(version_id="abc", assumption=x)


# @task
# def t2(pdf: BasePricesSchema) -> int:
#     return len(pdf)
# 
# 
# @workflow
# def wf(version_id: str) -> int:
#     bp = Assumptions.get(version_id=version_id, assumption=BasePricesSchema.name)
#     return t2(pdf=bp)

famous-businessperson-24711

08/31/2022, 4:19 PM

flattered to have my own task named after me 🤩

thankful-minister-83577

08/31/2022, 4:19 PM

hahah

famous-businessperson-24711

08/31/2022, 4:20 PM

i don't care too much about the name, but need the custom resolve to resolve at runtime

famous-businessperson-24711

08/31/2022, 4:23 PM

working code

thankful-minister-83577

08/31/2022, 4:32 PM

you can always just set it after the instantiation too

thankful-minister-83577

08/31/2022, 4:32 PM

not that that looks any cleaner

famous-businessperson-24711

08/31/2022, 4:42 PM

i'm ok with this for now, but am wary of doing stuff that might break later 😄

160 Views

Open in Slack

Previous Next