the new gant chart features is super awesome thanks for this Flyte #flytekit

the new gant chart features is super awesome! tha...

famous-businessperson-24711

04/20/2022, 9:04 PM

the new gant chart features is super awesome! thanks for this release. it does however re-raise a question as to why there is so much runtime overhead launching a workflow

famous-businessperson-24711

04/20/2022, 9:04 PM

cc @lively-sundown-82704

famous-businessperson-24711

04/20/2022, 9:05 PM

we could make this a task, but there are functional differences between tasks/workflows so we'd rather not

famous-businessperson-24711

04/20/2022, 9:05 PM

also, another question: why does cache reading take 13s?

freezing-airport-6809

04/20/2022, 10:00 PM

that sounds wrong

freezing-airport-6809

04/20/2022, 10:00 PM

it seems the deployment needs to be optimized

freezing-airport-6809

04/20/2022, 10:02 PM

we can help

famous-businessperson-24711

04/20/2022, 11:03 PM

Something wrong with our propeller setup?

freezing-airport-6809

04/21/2022, 12:34 AM

Ya seems like it

famous-businessperson-24711

04/22/2022, 7:02 PM

update: flagging this for the admins

famous-businessperson-24711

04/22/2022, 7:02 PM

actually, they're here 🙂 @swift-animal-75798 thoughts on this?

swift-animal-75798

04/22/2022, 7:04 PM

can take a look next week 👍 thanks

swift-animal-75798

04/29/2022, 2:18 PM

is this cache latency affected by the

workflow-reeval-duration

config? Which we intentionally set to 30s (from default 10s) to reduce the amount of checks during peak hours? wdyt @freezing-airport-6809?

freezing-airport-6809

04/29/2022, 2:37 PM

I think the biggest thing would be - max-streaks

freezing-airport-6809

04/29/2022, 2:38 PM

We can safely make it 10 or even more

swift-animal-75798

05/02/2022, 2:39 PM

we set:

Copy code

max-streak-length: 10
workflow-reeval-duration: 10s

but both had 0 effect on these executions

swift-animal-75798

05/02/2022, 2:40 PM

Would it make sense to set a lower value at downstream eval? the current value is 30s, would 5s work or is there risk of side-effects ?

Copy code

downstream-eval-duration: 30s

👍 1

freezing-airport-6809

05/02/2022, 3:02 PM

@swift-animal-75798 that is interesting. Downstream eval is more important than workflow reval. But it seems There is some other problem. Are there too many old workflows hanging around? Cc @hallowed-mouse-14616

swift-animal-75798

05/02/2022, 3:05 PM

about 4000 flyte workflow resources in our gke cluster currently, which seems a bit elevated

freezing-airport-6809

05/02/2022, 3:06 PM

That is tiny

freezing-airport-6809

05/02/2022, 3:06 PM

Let me ping

swift-animal-75798

05/02/2022, 3:07 PM

we may expect around 2000+- at this time of the day i guess. (rough numbers)

freezing-airport-6809

05/02/2022, 3:07 PM

I have not seen a problem till 50k+

freezing-airport-6809

05/02/2022, 3:07 PM

But, mostly because of k8s throttling

freezing-airport-6809

05/02/2022, 3:08 PM

We can also use sharded propeller

famous-businessperson-24711

05/02/2022, 5:18 PM

sounds fancy 😉

163 Views

Open in Slack

Previous Next