gentle-night-59824
06/17/2024, 11:48 PMpyflyte serialize workflows
is it possible to constrain its memory utilization? we're seeing this OOM quite frequentlyfreezing-airport-6809
freezing-airport-6809
glamorous-carpet-83516
06/18/2024, 8:36 AMthankful-journalist-40373
06/18/2024, 8:48 AMgentle-night-59824
06/18/2024, 2:29 PMworkflow_packages
in the config file
and we run this on a devbox ec2 instance which has about 32 Gis of memoryfreezing-airport-6809
gentle-night-59824
06/18/2024, 2:31 PMgentle-night-59824
06/18/2024, 2:33 PMserialization doesn't involve non-blocking mechanismfrom the logs it always gets killed at the same spot btw, these are the final logs
from pyarrow import HadoopFileSystem
278147
Killed
gentle-night-59824
06/18/2024, 2:34 PMfreezing-airport-6809
gentle-night-59824
06/18/2024, 2:35 PMfrom pyarrow import HadoopFileSystem
196688
17639897
===== experiment_0 =====
merchant_id
it prints out two numbers and then prints some tabulated experiment data which I assume is specific to our workflow π€gentle-night-59824
06/18/2024, 2:37 PMpapermill/papermill/papermill/iorw.py:50: FutureWarning: pyarrow.HadoopFileSystem is deprecated as of 2.0.0, please use pyarrow.fs.HadoopFileSystem instead.
looks like it comes from papermillgentle-night-59824
06/18/2024, 2:38 PMfreezing-airport-6809
gentle-night-59824
06/18/2024, 2:39 PMgentle-night-59824
06/18/2024, 2:41 PMfreezing-airport-6809
gentle-night-59824
06/18/2024, 5:58 PMgentle-night-59824
06/18/2024, 5:59 PMif __name__ == "__main__"
and that worked, thank you for the insights!