Hi Community I am busy doing a poc to use flyte with apache Flyte #flyte-support

Hi Community I am busy doing a poc to use flyte wi...

dry-tent-11199

10/18/2023, 11:43 AM

Hi Community I am busy doing a poc to use flyte with apache spark. 🙂 So far its been great however i have one requirement i havent been able to solve and that is

To be able to set spark config at task runtime just before the spark session is created instead of at Flyte task creation or registration time

(The reason is we have a multi tenanted platform where we need to control which meta-store to talk to based on config passed to the flyte task at runtime) Does anybody have any suggestions or tips as to how to solve this issue?

freezing-airport-6809

10/18/2023, 2:34 PM

You can set it in the spark context right. If it is only metastore config

freezing-airport-6809

10/18/2023, 2:34 PM

I would still store it at registration time and change it based on project

dry-tent-11199

10/18/2023, 2:39 PM

so we are looking for a way to set the metastore at flyte job runtime. as the code that runs is generic and takes in a tenantIdentifier and that then fetches data from the correct tenant from their metastore. What you are suggesting means we need the exact same job for each tenant which doesnt scale from maintainability standpoint if we have 100 tenants its alot of maintenance if every single job is the same code with a diffrent runtime value hardcoded

dry-tent-11199

10/18/2023, 2:41 PM

Before flyte we had a spark submitting service that would just set this each run of the job in the spark-submit or in other instances in the rest payload to apache livy

freezing-airport-6809

10/18/2023, 2:46 PM

@dry-tent-11199 registration in Flyte is lightweight, you don’t need multiple code copies

freezing-airport-6809

10/18/2023, 2:47 PM

Just parameter iz able spark conf

dry-tent-11199

10/18/2023, 2:48 PM

so you say manually register a new task per tenant with the spark conf hard coded? (we wanted to try avoid this as it makes knowing which tasks to run a problem further up). But its an option (a less desirable option) if there is no other clear way

freezing-airport-6809

10/18/2023, 3:12 PM

You can use dynamic workflows to modify the config

freezing-airport-6809

10/18/2023, 3:12 PM

With overrides

freezing-airport-6809

10/18/2023, 3:12 PM

And eventually we will support runtime overrides

dry-tent-11199

10/18/2023, 3:13 PM

Great ill look into this

dry-tent-11199

10/18/2023, 3:13 PM

thank you

tall-lock-23197

10/24/2023, 6:37 AM

Here's an example: https://discuss.flyte.org/t/15646251/hi-whats-the-best-way-to-pass-a-configurable-value-to-a-task#d37ed6c0-f36a-4b6f-be76-8dd100277e01

calm-pilot-2010

11/01/2023, 2:53 PM

> And eventually we will support runtime overrides +1 for configuring and creating spark session at runtime. I'm pretty sure that will be useful for us in the future 🙂 (mostly for configs that must be set before starting the spark context).

freezing-airport-6809

11/01/2023, 3:06 PM

Let’s make an issue and please chime in on it

freezing-airport-6809

11/01/2023, 3:06 PM

How the experience should be

3 Views

Open in Slack

Previous Next