Hi is there a preferred approach to deploy Flyte to GKE (Helm or Kustomize)? Currently we’re using ...

Nicholas LoFaso

Hi is there a preferred approach to deploy Flyte to GKE (Helm or Kustomize)? Currently we’re using an older version of the Flyte helm chart. I tried to upgrade to the latest, but I run into two issues. Before I dig too deeply into fixing it I thought I might switch to kustomize if that is the preferred method

but getting an error ```containers with unready status: [f7592492fe4da4f14b2f-n0-0]|Failed to apply ...

Rezwan Abir

over 2 years ago

but getting an error

containers with unready status: [f7592492fe4da4f14b2f-n0-0]|Failed to apply default image tag "<http://cr.flyte.org/flyteorg/flytekit:py3.10-1.3.2:image|cr.flyte.org/flyteorg/flytekit:py3.10-1.3.2:image>": couldn't parse image reference "<http://cr.flyte.org/flyteorg/flytekit:py3.10-1.3.2:image|cr.flyte.org/flyteorg/flytekit:py3.10-1.3.2:image>": invalid reference format

hey all - I'm having an issue launching a task execution on my bare metal k8s cluster - when flytead...

Quinn Romanek

almost 4 years ago

hey all - I'm having an issue launching a task execution on my bare metal k8s cluster - when flyteadmin attempts to create a new FlyteWorkflow resource, it fails because the default namespace ({{project}}-{{domain}}) doesn't exist. Have I misconfigured something? I'm wondering how it works in the sandbox

👋 1

Hi, I am trying to setup flyte-core with minio inside the cluster but i can't seem to get it to work...

Alexander Sarson

about 2 years ago

Hi, I am trying to setup flyte-core with minio inside the cluster but i can't seem to get it to work. I get the following error: "Failed to establish a new connection: [Errno 11001] getaddrinfo failed'))" when i am trying to run a workflow on the cluster. this is my connection setup. Can somebody spot what i am doing wrong?

storage:
  # -- Sets the storage type. Supported values are sandbox, s3, gcs and custom.
  type: custom
  custom:
    type: minio
    container: flyte
    stow:
      kind: s3
      config:
        access_key_id: XXXX
        auth_type: accesskey
        secret_key: XXXX
        disable_ssl: true
        endpoint: <http://minio-hl.minio.svc.cluster.local:9000>
        region: us-east-1

Hi all, I seem to have the following error when running flyte remotely. Anyone has encountered this...

Albert Wibowo

over 2 years ago

Hi all, I seem to have the following error when running flyte remotely. Anyone has encountered this before?

Matplotlib created a temporary config/cache directory at /tmp/matplotlib-dqrfqnc3 because the default path (/home/flytekit/.config/matplotlib) is not a writable directory; it is highly recommended to set the MPLCONFIGDIR environment variable to a writable directory, in particular to speed up the import of Matplotlib and to better support multiprocessing.

My flyte workflow is quite simple and is as follow:

@task()
def get_data() -> pd.DataFrame:
    data_dict = load_breast_cancer(as_frame=True)
    print(data_dict['data'].head())
    return data_dict['data']

@task()
def clean_data_pre_split(data:pd.DataFrame) -> pd.DataFrame:

    return data

@workflow()
def training_workflow():
    data  = get_data()
    data = clean_data_pre_split(data=data)
    return

I'm following the single cluster deployment according to <this link>. Cloud is AWS, db is RDS, and I...

Cody Scandore

over 2 years ago

I'm following the single cluster deployment according to this link. Cloud is AWS, db is RDS, and I'm using eks managed nodes (amazon AMI). My pod almost immediately enters crashloopbackoff. I have confirmed node <> database connectivity by ssh'ing into the node and querying the RDS database. One area of suspicion is the

serviceAccount

annotation. I have set

create: false

and I am using the EKS cluster service role arn.

serviceAccount:
  create: false
  annotations:
  <http://eks.amazonaws.com/role-arn|eks.amazonaws.com/role-arn>: "arn:aws:iam::<id-redacted>:role/eksctl-flyte-cluster-cluster-ServiceRole"

Hello, I have a simple workflow with dataclass_json as the input: ```from flytekit import workflow, ...

Ankit Goyal

over 2 years ago

Hello, I have a simple workflow with dataclass_json as the input:

from flytekit import workflow, LaunchPlan, task
from dataclasses import dataclass
from dataclasses_json import dataclass_json

@dataclass_json
@dataclass
class MyDataClass(object):
    def __init__(self, a: int, b: str):
        self.a = a
        self.b = b

@task
def example_node(input: MyDataClass):
    print(f'{input.a}')
    

@workflow
def my_dataclass_workflow(input: MyDataClass):
    return example_node(input=input)

Should the UI be editable to provide input as json?

Hi, I’m using flyte sandbox and trying to run some tasks in flyte. I have a task that needs to acces...

Gaurav Kumar

about 2 years ago

Hi, I’m using flyte sandbox and trying to run some tasks in flyte. I have a task that needs to access gpu from my host machine. Since, I have gpu is in my host machine, it needs to be passed on to the k8 pod that’s running inside the docker container (sandbox). However, I see that kubectl is not able to schedule the pod.

Events:
  Type     Reason            Age    From               Message
  ----     ------            ----   ----               -------
  Warning  FailedScheduling  11m    default-scheduler  0/1 nodes are available: 1 Insufficient <http://nvidia.com/gpu|nvidia.com/gpu>, 1 node(s) didn't match Pod's node affinity/selector. preemption: 0/1 nodes are available: 1 Preemption is not helpful for scheduling.

I also checked the

kubectl describe node 31e2e2ba6f9f

(

31e2e2ba6f9f

is the flyte-sandbox container in which flyte is running in sandbox environment) Looks like node (docker container in this case) itself doesn’t have gpu to allocate to the pod

>kubectl describe node 31e2e2ba6f9f
...
Capacity:
  cpu:                8                                                                                                                                                       ephemeral-storage:  944801904Ki
  hugepages-1Gi:      0                                                                                                                                                   hugepages-2Mi:      0
  memory:             65774996Ki
  pods:               110
Allocatable:
  cpu:                8
  ephemeral-storage:  919103291491                                                                                                                                        hugepages-1Gi:      0
  hugepages-2Mi:      0                                                                                                                                                   memory:             65774996Ki
  pods:               110
...

I think this is happening because the flyte-sandbox docker container is not able to access gpus. I logged into the container and it doesn’t give output of

nvidia-smi

which is required to work for container to access gpus.

❯ docker exec -it 31e2e2ba6f9f /bin/sh
/ # nvidia-smi
/bin/sh: nvidia-smi: not found
/ #

Note that outside of flyte, I’m able to run the task image in a docker container in my host machine as below which needs
--gpus all
to be passed for container to access gpus

docker run --gpus all <task_image_with_gpu_req>

I think if we can somehow pass the

--gpus all

to the docker run command for sandbox during

flytectl demo start

command, it should work. Please help !!!

Hey all - new to flyte here, trying to set up a demo in an academic context but our university clust...

Fran Jurinec

over 2 years ago

Hey all - new to flyte here, trying to set up a demo in an academic context but our university cluster only provides an OKD Kubernetes instance without root privileges, so far I've tried setting up the sandbox and the helm chart to no success, helm chart breaks when attempting to set up RBAC , and the sandbox k3d/k3s does not run in rootless, considering trying to abstract a layer further and setting up docker-in-docker and k3d inside of that, but I'm not sure if that's the smartest idea, is there maybe a helm chart without RBAC elements or some more manual way to set up the services?

Hi <!channel>, I hope you all are doing great. I am new to FLyte and I want to learn it as I really ...

Ahmed Laadraoui

over 2 years ago

Hi <!channel>, I hope you all are doing great. I am new to FLyte and I want to learn it as I really see a big potential in it. I just deployed Flyte in my AWS account using Opta. Now I am trying to create a project and push some workflows and run them. Any documentation link, articles or explanations will be very helpful to help me establish a connection with my deployed Flyte instance and be able to process tasks and workflows remotely. Many Tanks Ahmed

Previous 323334 Next

Flyte

Flyte enables production-grade orchestration for machine learning workflows and data processing created to accelerate local workflows to production.