Hi team I m running into the same issue as what s discussed Flyte #flyte-support

Hi team.. I’m running into the same issue as what’...

little-cricket-84530

10/24/2024, 7:10 PM

Hi team.. I’m running into the same issue as what’s discussed here. It’s a dynamic task that is used to chunk up and trigger around 8000 map tasks… the input to the dynamic task is a list of inputs (dataclasses) for these map tasks… and each one is just 4 s3 urls.. To get around this I serialized the data classes into a single file and read that in the dynamic task to recreate the inputs to the map tasks.. but still running into this error

TooLarge: Event message exceeds maximum gRPC size limit, caused by [rpc error: code = ResourceExhausted desc = grpc: received message larger than max (6192640 vs. 4194304)]

little-cricket-84530

10/24/2024, 7:11 PM

I currently trigger the map tasks in 2 chunks.. 5k and 3k.. should I make smaller chunks?

little-cricket-84530

10/24/2024, 7:14 PM

Guessing it’s at the point of triggering the map tasks in which case the first workaround is useless

thankful-minister-83577

10/24/2024, 7:52 PM

this isn’t in the running of the task itself it looks like. it looks like it’s the sending of the event to admin which contains the inputs. just too many s3 paths.

thankful-minister-83577

10/24/2024, 7:54 PM

the short-term fix is to bump the limit, it’s not that much over the limit.

thankful-minister-83577

10/24/2024, 7:54 PM

or split up the map task more as you say.

thankful-minister-83577

10/24/2024, 7:55 PM

longer term, offloading should help

thankful-minister-83577

10/24/2024, 7:55 PM

cc @high-accountant-32689

freezing-airport-6809

10/24/2024, 8:17 PM

@little-cricket-84530 in Union we have automatic offloading - so that this problem will never happen

little-cricket-84530

10/24/2024, 8:27 PM

@thankful-minister-83577 where do you propose bumping this limit up?

average-finland-92144

10/24/2024, 8:38 PM

I think it is

Copy code

admin:
  server:
    grpc:
      maxMessageSizeBytes: <limit-in-bytes>

prehistoric-kangaroo-27449

11/25/2024, 8:59 AM

Running into the same issue, thanks for the solution. I think it's quite common to have long list of paths in the dynamic workflows, would it make sense to bump the limit to a higher safe value? 🙂

prehistoric-kangaroo-27449

11/25/2024, 8:59 AM

Also @little-cricket-84530 did this work for you?

little-cricket-84530

11/25/2024, 9:01 AM

I didn't bump the limit up.. just got around the problem by serializing and writing the data classes into a file and giving the map task a partition number to read.

prehistoric-kangaroo-27449

11/25/2024, 9:02 AM

ah, that's not a bad solution either, thanks!

26 Views

Open in Slack

Previous Next