making this a top level thread for visibility slightly smili Flyte #flytekit

making this a top level thread for visibility :sli...

famous-businessperson-24711

05/02/2022, 3:35 PM

making this a top level thread for visibility 🙂 is it possible to inject env vars into workflows at launch time? or to extend flyte context with user info?

thankful-minister-83577

05/02/2022, 4:59 PM

where would they come from?

famous-businessperson-24711

05/02/2022, 5:01 PM

you mean where would we inject them?

famous-businessperson-24711

05/02/2022, 5:26 PM

for context we'd like to be able to pass environment to the tasks at runtime in order to distinguish between dev/staging/prod. since we can run staging tasks in prod we can't necessarily rely on flyte env from the other thread

thankful-minister-83577

05/02/2022, 5:27 PM

you just need to know the flyte admin domain for the execution?

thankful-minister-83577

05/02/2022, 5:27 PM

i think we can just use that flytekit config for now. it’s not a public api though, so we’ll add it to the context.

famous-businessperson-24711

05/02/2022, 6:25 PM

a better example that will make this clearer: i need to inject the url of a service endpoint which may be different depending on where the workflow was launched

famous-businessperson-24711

05/02/2022, 6:26 PM

since you can technically, launch dev/stagin/prod workflows from anywhere (ie the production flyte instance) i can't rely on that to know if i should be hitting production databases or not

thankful-minister-83577

05/02/2022, 6:49 PM

for the most part we try to avoid adding env vars at run time because that can violate reproducibility, but this is the main situation where we would want to.

thankful-minister-83577

05/02/2022, 6:50 PM

but i think this just exposing the execution domain would be enough right?

thankful-minister-83577

05/02/2022, 6:51 PM

things that were launched into any project’s production domain should hit the service’s production endpoint right?

famous-businessperson-24711

05/02/2022, 6:51 PM

well what is "execution domain" the domain of the top level workflow?

thankful-minister-83577

05/02/2022, 6:51 PM

yeah

thankful-minister-83577

05/02/2022, 6:52 PM

(also fyi we’ve impacted services at lyft because of large workflows hitting them before, but i think those were run on aws batch so thousands of machines.)

famous-businessperson-24711

05/02/2022, 6:56 PM

consider that we have a separate system, which is external to flyte which has it's own staging/prod tiers. we want our users to only use the prod tier, even if they're doing development, but we still need an internal staging tier for ourselves, so we need to determine which system we're running in, irrespective of the flyte domain

famous-businessperson-24711

05/02/2022, 6:57 PM

but yes i hear you about breaking referential transparency, i think this is an exception

thankful-minister-83577

05/02/2022, 11:03 PM

so when would you want flyte to inject what values? what i mean is, you can already configure flyte to add default env vars

thankful-minister-83577

05/02/2022, 11:03 PM

https://github.com/flyteorg/flyte/blob/b7928c075225843d097a1c598ccd7f51b2e87069/deployment/eks/flyte_helm_generated.yaml#L509

thankful-minister-83577

05/02/2022, 11:06 PM

but obviously this is static per flyte installation

famous-businessperson-24711

05/02/2022, 11:55 PM

Launch time

thankful-minister-83577

05/03/2022, 2:30 AM

why not make it an input?

famous-businessperson-24711

05/03/2022, 12:36 PM

could, but that would result in a spaghetti code wherein every workflow/task needs it

thankful-minister-83577

05/03/2022, 4:27 PM

i still don’t understand though… if each flyte installation (i don’t know how many you have) always tells each task the same environment, you can just use the default env vars setting right?

thankful-minister-83577

05/03/2022, 4:27 PM

if any given flyte installation can tell tasks to use different service environments, where does it get that information from and how is that decision made?

thankful-minister-83577

05/03/2022, 4:28 PM

does that make sense? just trying to understand where the source of the information is coming from

brainy-ocean-79995

05/11/2022, 8:58 PM

We would love to be able to keep something like a configmap with external service urls per environment, which could be added to an envFrom on the execution pods in our flyte execution k8s cluster. We currently key off the

FLYTE_INTERNAL_EXECUTION_DOMAIN

and then are hardcoding external service urls in the workflow code for tasks which need to make calls out to other microservices. If any of those change we have to update the workflows.

brainy-ocean-79995

05/11/2022, 9:04 PM

It seems unwieldy to specify as inputs

brainy-ocean-79995

05/11/2022, 9:07 PM

For us, we generally can map Flyte domain to our external services env

brainy-ocean-79995

05/11/2022, 9:21 PM

Thanks for pointing out the default envs per propeller k8s plugin config. We could probably use that, and I was unaware of that feature

thankful-minister-83577

05/11/2022, 9:43 PM

https://github.com/flyteorg/flyte/issues/2447

thankful-minister-83577

05/11/2022, 9:44 PM

created that a couple days ago

thankful-minister-83577

05/11/2022, 9:44 PM

mind adding your thoughts on there @brainy-ocean-79995? sorry i should’ve posted it earlier.

thankful-minister-83577

05/11/2022, 9:45 PM

can you elaborate what you mean by “per environment” as well?

thankful-minister-83577

05/11/2022, 9:45 PM

you mean per flyte domain?

brainy-ocean-79995

05/11/2022, 9:53 PM

Happy to add thoughts to that discussion. Our use case is a bit different than Dylan's in that our configured domains match our external service ecosystem environments, so we generally want workflows in the "development" domain to use the "development" service ecosystem and matching service urls. However we have a bunch of commonly used external services within various projects and workflows. We would love a way to have something like a list of "development" service urls that could get injected for workflows in the development domain, for a bunch of projects/workflows. Currently every project and workflow keys off the internal env var to get the domain, and has a hard coded map of service urls for that domain ie

ZAUTH_UTL

ZREST_URL

etc. Mostly it's a lot of repeated boilerplate config that different project teams are maintaining. We could put it in a Python lib to DRY it up a bit, but it's the kind of thing we often put in configmaps in our other k8s ecosystems

166 Views

Open in Slack

Previous Next