In case anyone s encountered this particular management task Flyte #flyte-deployment

In case anyone's encountered this particular manag...

elegant-australia-91422

06/08/2023, 3:21 PM

In case anyone's encountered this particular management task: we'd like to switch our S3 bucket that we use for Flyte to one in a new region, and it's a fairly straightforward config change - I wanted to clarify, will redeploying the Flyte backend services after modifying the storage bucket break in-progress runs? I was hoping that they could continue to execute given the IAM roles will still permit reading from the old bucket, but wanted to know if there are any possibilities of breakages if there's a currently-executing dynamic workflow (for ex)

tall-lock-23197

06/09/2023, 5:04 AM

@freezing-airport-6809, would you mind taking a look at this question?

elegant-australia-91422

06/09/2023, 5:06 AM

It looks like this is a somewhat dangerous operation from the channel’s history, but we don’t really care if links for intermediate artifacts are broken (we use a separate bucket for scratch vs final outputs). Mostly curious if this will lead to workflow execution interruptions.

elegant-australia-91422

06/16/2023, 3:43 PM

@freezing-airport-6809 / @high-accountant-32689 do you guys have any idea here?

freezing-airport-6809

06/16/2023, 4:44 PM

ohh sorry just saw this

elegant-australia-91422

06/16/2023, 4:44 PM

Np, totally know you guys are swamped 🙂

freezing-airport-6809

06/16/2023, 4:44 PM

@elegant-australia-91422 what are you changing, are you changing the metadata bucket?

elegant-australia-91422

06/16/2023, 4:44 PM

Yes

elegant-australia-91422

06/16/2023, 4:44 PM

We want to move it from usw1 -> use1

freezing-airport-6809

06/16/2023, 4:44 PM

uhho, then they will break everything

freezing-airport-6809

06/16/2023, 4:45 PM

if you change data bucket (raw data prefix) - then it iwll continue running

freezing-airport-6809

06/16/2023, 4:45 PM

if you change metadata bucket, then previous data wont be found

freezing-airport-6809

06/16/2023, 4:45 PM

here is what i would do, is drain propeller and then transition

freezing-airport-6809

06/16/2023, 4:45 PM

that means, do not pick up new work, but complete all the work

elegant-australia-91422

06/16/2023, 4:45 PM

Ok - how would we "drain" propeller?

elegant-australia-91422

06/16/2023, 4:46 PM

To be fair, we're treating Flyte as "stateless" from the POV of other systems, any important artifacts are saved in a separate bucket and we also track run state separately - we just dont want to impact in-progress runs

freezing-airport-6809

06/16/2023, 4:46 PM

stop sending crds to the k8s cluster

freezing-airport-6809

06/16/2023, 4:46 PM

just create a new cluster, that uses new metadata bucket and send workloads there

freezing-airport-6809

06/16/2023, 4:47 PM

ya so if you do 2 cluster migration, then old cluster can be drained and deleted

freezing-airport-6809

06/16/2023, 4:47 PM

that should not be impacted

elegant-australia-91422

06/16/2023, 4:47 PM

I see - got it. Even if we sync over the s3 bucket it'll break?

elegant-australia-91422

06/16/2023, 4:47 PM

we could take a small downtime window to sync things over and redeploy

freezing-airport-6809

06/16/2023, 4:47 PM

no it will not

freezing-airport-6809

06/16/2023, 4:47 PM

if you can sync, one option then would be to make global s3 buckets possible

freezing-airport-6809

06/16/2023, 4:47 PM

but, it relies on previous nodes data to be available

freezing-airport-6809

06/16/2023, 4:48 PM

it uses tricks like new keys to also maintain consistency, but s3 is now fully consistent

elegant-australia-91422

06/16/2023, 4:48 PM

ok. but to confirm, changing the bucket name won't impact it. I wanted to confirm the state in the db just stores keys and not the full

s3://<bucket name>/...

path

elegant-australia-91422

06/16/2023, 4:48 PM

since then the sync strategy wouldn't work

straight-businessperson-54649

06/19/2023, 2:00 PM

Hi! Also following this thread. Similarly been running into issues trying to change the metadata bucket (gcs). Discussion here. After creating a new bucket, copying all data over from the old bucket, and using the new bucket, all historical workflows are showing "Status code 500: Failed to fetch data". (full error message in thread) Is it not possible to change buckets and be able to access historical workflow data in the console?

🙏 1

freezing-airport-6809

06/19/2023, 2:46 PM

I do not think so, as the bucket name has changed. The db stores links to the bucket. You will have to run a migration on the db

straight-businessperson-54649

06/19/2023, 2:52 PM

I see thanks Ketan. And seems like even keeping the old bucket up, flyte still can't access it via the old links Was thinking it might work by just keeping both buckets up

freezing-airport-6809

06/19/2023, 6:23 PM

it should

freezing-airport-6809

06/19/2023, 6:23 PM

you have to enable multi-container mode in the storage config

straight-businessperson-54649

06/21/2023, 2:01 PM

it worked! thanks for your help Ketan!

elegant-australia-91422

06/21/2023, 2:02 PM

What config changes did you need to make @straight-businessperson-54649? We're looking to do the exact same thing

straight-businessperson-54649

06/21/2023, 2:04 PM

Add this block to your helm values (this setting):

Copy code

storage:
    enableMultiContainer: true

And update flyte to use your new bucket. I believe it will continue reading old workflows from the old bucket while using the new bucket for new workflows

🙏 1

elegant-australia-91422

06/21/2023, 2:04 PM

Awesome! Thank you 🙂

🤙 1

elegant-australia-91422

06/21/2023, 2:05 PM

Going to give this a try today

149 Views

Open in Slack

Previous Next