https://flyte.org
Join Slack
Hi there! How to increase the allowed output size of the node? ```RuntimeExecutionError: failed du...
a

Alexey Kharlamov

over 2 years ago
Hi there! How to increase the allowed output size of the node?
RuntimeExecutionError: failed during plugin execution, caused by: failed to execute handle for plugin [k8s-array]: output file @[] is too large [255540150] bytes, max allowed [10485760] bytes
a
v
s
  • 3
  • 2
  • 280
Hi all, I am trying run a task within a container that I am building within a local deployment of fl...
j

John Lawlor

about 3 years ago
Hi all, I am trying run a task within a container that I am building within a local deployment of flyte-sandbox. This is what I run:
flytectl sandbox start --source . --imagePullPolicy IfNotPresent
flytectl sandbox exec -- docker build -t new_image .
When I run
flytectl sandbox exec -- docker images
I see my image inside of the flyte-sandbox container.
REPOSITORY                                     TAG                       IMAGE ID       CREATED          SIZE
new_image                                      latest                    044c6f6eb860   44 minutes ago   1.49GB
Within my
@task
, I reference
new_image
. When I go to run the workflow remotely, I receive the following error after about 30 seconds:
[1/1] currentAttempt done. Last Error: USER::containers with unready status: [ffdb7fdd18e544b99a24-n0-0]|Back-off pulling image "new_image:latest"
Any thoughts?
j
y
  • 2
  • 2
  • 280
Hi all, I'm getting an OOM error and concerned the limits I'm attempting to provide aren't being tak...
r

Ruksana Kabealo

over 3 years ago
Hi all, I'm getting an OOM error and concerned the limits I'm attempting to provide aren't being taken. When I kubectl describe the pod that failed due to OOMKilled - I see that it says the pod has a memory limit of 2G. However, in the flyte-deployment-config.yaml that I deployed using helm, I have allocated for a memory limit of 8G. See attached images for screenshots of the limit and the description. None of the @tasks have any resource limits imposed. Any help would be much appreciated!
r
n
  • 2
  • 2
  • 280
Hi flyte community, happy monday :laughing: So I am trying to get some clarification on some task re...
j

Jay Ganbat

over 3 years ago
Hi flyte community, happy monday 😆 So I am trying to get some clarification on some task related question: I have a
regular python task(T1)
that calls another
regular python task(T2)
without it being dynamic; T2 is a very simple function that runs quickly. I did not want to make T1 dynamic since i didnt wanna add overhead of launching pod for T2. Also I am using T2 in a workflow as a regular python task. My question is
is it a correct way to use it?
or is it fine to call regular task from another regular task. i could create a regular python function and call that from T1 but then i need to create another wrapper for T2 which just calls the regular function and i wanted to avoid that extra stuff. From this page https://docs.flyte.org/projects/cookbook/en/latest/auto/core/flyte_basics/task.html#sphx-glr-auto-core-flyte-basics-task-py i read that
You can execute a Flyte task as any normal function.
maybe i took this statement a bit too liberally
j
y
  • 2
  • 12
  • 279
Bad news <@U02SJRLAPKM>, I bounced to Flyte Console `v0.55.0` and I'm back to `Failed to fetch data:...
a

Alex Bain

over 3 years ago
Bad news @Nastya Rusina, I bounced to Flyte Console
v0.55.0
and I'm back to
Failed to fetch data: invalid wire type 4 at offset 1
for
/console
a
n
+2
  • 4
  • 5
  • 278
Issue: I have been following the docs <here>, but while running the code `unionml serve app:app -...
a

Akarsh Upadhyay

over 3 years ago
Issue: I have been following the docs here, but while running the code
unionml serve app:app --reload --model-path /tmp/model_object.pt
, I get this error on the browser
{"detail": "Method Not Allowed"}
and in the terminal, it is showing:
INFO: 127.0.0.1:64825 - "GET /predict HTTP/1.1" 405 Method Not Allowed
My Folder structure looks like this (i.e in same directory): -> app.py -> client.py -> main.py (same code as app.py) Thanks,
a
s
n
  • 3
  • 4
  • 277
Good morning! We’re trying out Dynamic tasks for distributed computing/forecasting and notice it tak...
s

Stefan Avesand

over 3 years ago
Good morning! We’re trying out Dynamic tasks for distributed computing/forecasting and notice it takes over 8 minutes for Flyte to kick off the first task. Is this normal? (@Kelsey Messer FYI)
s
k
+2
  • 4
  • 22
  • 277
been reading up on some of the new features and found out about this approve <https://github.com/fly...
l

Laura Lin

over 2 years ago
been reading up on some of the new features and found out about this approve https://github.com/flyteorg/flytesnacks/blob/master/cookbook/core/control_flow/waiting_for_external_inputs.py#L150-L158. When I tried doing fast registration after copying pasting, it errors out with the following.
RPC Failed, with Status: StatusCode.INTERNAL
	details: failed to compile workflow for [resource_type:WORKFLOW project:"flytetester" domain:"development" name:"flyte.example.reporting_with_approval_wf" version:"unique-version" ] with err failed to compile workflow with err Collected Errors: 2
	Error 0: Code: ParameterNotBound, Node Id: end-node, Description: Parameter not bound [o0].
	Error 1: Code: VariableNameNotFound, Node Id: n3, Description: Variable [o0] not found on node [n3].
l
k
  • 2
  • 5
  • 276
Hey all! anyone has given it a try to convert Kedro Pipelines to Flyte Workflows. We are starting to...
j

Javier Hernandez Montes

over 2 years ago
Hey all! anyone has given it a try to convert Kedro Pipelines to Flyte Workflows. We are starting to use flyte and it would be nice to have some kedro pipelines running there
j
k
s
  • 3
  • 6
  • 276
hi team, what’s the recommended way to pass transformers models and tokenizers between tasks? eg. i ...
m

Melody Lui

over 2 years ago
hi team, what’s the recommended way to pass transformers models and tokenizers between tasks? eg. i load using
model=transformers.AutoModelForCausalLM.from_pretrained(...)
, but i cannot directly return model and pass it to the next tasks because the model cannot be pickled
cannot pickle 'torch._C._distributed_c10d.ProcessGroup' object
. Do i need to save the model with transformers’
save_pretrained
to nfs and load it when i need to use it in the next task?
m
e
e
  • 3
  • 4
  • 275
Previous161718Next

Flyte

Flyte enables production-grade orchestration for machine learning workflows and data processing created to accelerate local workflows to production.

Powered by