Just noticed some interesting behavior in GitHub A...
# flytekit
e
Just noticed some interesting behavior in GitHub Actions when executing a launch plan during CI which had objects that used
PickleTransformer
passed as args to the LaunchPlan 🧵
Is there a particular reason
S3Persistence
is implemented using subprocess calls to awscli vs using boto3 to directly interact with S3? It appears that the version of awscli in our github actions runner image expects python 3.8, but python 3.9 is the only available distribution in the environment. Seems like this could be sidestepped entirely by avoiding the subprocess calls?
t
cc: @freezing-airport-6809
f
Yes this is legacy reason. As earlier we used subprocess, but we are actively recommending using fsspec - https://docs.flyte.org/projects/flytekit/en/latest/plugins/fsspec.html
You can install it using the data persistence plugins and we are thinking of making it default - and not breaking folks will be hard - cc @high-accountant-32689 / @thankful-minister-83577
thx 1
@tall-lock-23197 can you point to docs of how to install flytekitplugins-data-fsspec[aws]
e
Got it, that makes sense. It’d be nice if it was the default, but understand not wanting to break workflows. Maybe something for the flytekit 2.x millstone?
f
Hmm we are actually thinking we can sidestep that - Flyte 1.3?
Would you be open to helping
let’s bring this up In a townhall
e
Sure, happy to chat more about this
f
Need help In benchmarking too
👍 1
We are actually thinking of writing a faster layer
t
flytekitplugins-data-fsspec[aws]
requires
flytekitplugins-data-fsspec
and
s3fs>=2021.7.0
libraries.
👍 1
e
Thanks very much @tall-lock-23197, will give that a try
👍 1
154 Views