enough-notebook-15673
03/20/2023, 5:30 AMOOMKilled
error, and I found many similar questions in this channel. According to the discussion, the reason is resource shortage, so I set @task(limits=Resources(storage="20Gi", ephemeral_storage="20Gi", mem="40Gi", cpu="6"))
which is enough for the error task. But I still got the same error. I do not use GPU. Any other reasons for this?
[1/1] currentAttempt done. Last Error: USER::Pod failed. No message received from kubernetes.
[f52215af773e248b0af3-n2-0] terminated with exit code (1). Reason [OOMKilled]. Message:
asctime": "2023-03-20 05:09:46,909", "name": "flytekit", "levelname": "WARNING", "message": "FlyteSchema is deprecated, use Structured Dataset instead."}
tar: Removing leading `/' from member names
{"asctime": "2023-03-20 05:09:48,309", "name": "flytekit", "levelname": "WARNING", "message": "FlyteSchema is deprecated, use Structured Dataset instead."}
...
tall-lock-23197
enough-notebook-15673
03/20/2023, 7:05 AMenough-notebook-15673
03/20/2023, 10:06 AMenough-notebook-15673
03/20/2023, 10:08 AMOOMKilled
quick-salesclerk-18019
03/20/2023, 11:04 AMenough-notebook-15673
03/20/2023, 11:05 AMenough-notebook-15673
03/20/2023, 11:05 AMquick-salesclerk-18019
03/20/2023, 11:09 AMenough-notebook-15673
03/20/2023, 11:10 AMminikube
. flyte use it?enough-notebook-15673
03/20/2023, 11:11 AMryo@ryo:~$ kubectl get po
NAME READY STATUS RESTARTS AGE
py39-cacher 0/1 Completed 0 25m
ryo@ryo:~$ kubectl get node
NAME STATUS ROLES AGE VERSION
344a31d3f672 Ready control-plane,master 26m v1.24.4+k3s1
enough-notebook-15673
03/20/2023, 11:13 AMryo@ryo:~$ kubectl get po -n flyte
NAME READY STATUS RESTARTS AGE
flyte-sandbox-docker-registry-b5c57c55-rtc4q 1/1 Running 0 29m
flyte-sandbox-kubernetes-dashboard-6757db879c-4q6cm 1/1 Running 0 29m
flyte-sandbox-proxy-d95874857-8t8h7 1/1 Running 0 29m
flyte-sandbox-postgresql-0 1/1 Running 0 29m
flyte-sandbox-75c5d88454-tpm9c 1/1 Running 0 29m
flyte-sandbox-minio-645c8ddf7c-9f9zg 1/1 Running 0 29m
enough-notebook-15673
03/20/2023, 11:14 AMryo@ryo:~$ kubectl get node -n flyte
NAME STATUS ROLES AGE VERSION
344a31d3f672 Ready control-plane,master 30m v1.24.4+k3s1
enough-notebook-15673
03/20/2023, 11:16 AMryo@ryo:~$ kubectl top node -n flyte
W0320 20:15:45.557393 176553 top_node.go:119] Using json format to get metrics. Next release will switch to protocol-buffers, switch early by passing --use-protocol-buffers flag
NAME CPU(cores) CPU% MEMORY(bytes) MEMORY%
344a31d3f672 185m 2% 1797Mi 2%
enough-notebook-15673
03/20/2023, 11:22 AMryo@ryo:~$ minikube delete
๐ "minikube" profile does not exist, trying anyways.
๐ Removed all traces of the "minikube" cluster.
quick-salesclerk-18019
03/20/2023, 11:25 AMquick-salesclerk-18019
03/20/2023, 11:27 AMflytectl demo start
flyte will set up a k8s environment for you... https://docs.flyte.org/projects/flytectl/en/latest/gen/flytectl_demo.htmlenough-notebook-15673
03/20/2023, 11:27 AMryo@ryo:~$ kubectl top node -n flyte
W0320 20:26:20.436047 179361 top_node.go:119] Using json format to get metrics. Next release will switch to protocol-buffers, switch early by passing --use-protocol-buffers flag
NAME CPU(cores) CPU% MEMORY(bytes) MEMORY%
344a31d3f672 167m 2% 1793Mi 2%
quick-salesclerk-18019
03/20/2023, 11:27 AMenough-notebook-15673
03/20/2023, 11:28 AMquick-salesclerk-18019
03/20/2023, 11:29 AMminikube profile list
?quick-salesclerk-18019
03/20/2023, 11:29 AMenough-notebook-15673
03/20/2023, 11:30 AM๐ maybe list the profiles withyes, I already got this suggestion and tried it.?minikube profile list
ryo@ryo:~$ minikube stop
๐คท Profile "minikube" not found. Run "minikube profile list" to view all profiles.
๐ To start a cluster, run: "minikube start"
ryo@ryo:~$ minikube profile list
๐คน Exiting due to MK_USAGE_NO_PROFILE: No minikube profile was found.
๐ก Suggestion:
You can create one using 'minikube start'.
enough-notebook-15673
03/20/2023, 11:31 AMfreezing-airport-6809
freezing-airport-6809
enough-notebook-15673
03/20/2023, 2:24 PMtall-lock-23197
increasing limits is futileMy bad. Haven't closely looked at what's being assigned.
tall-lock-23197
enough-notebook-15673
03/20/2023, 11:25 PMRequest rejected by the API, due to Invalid input.
when I set requests
instead of limits
.
$ pyflyte run --remote --image <http://gcr.io/myimage:mytag|gcr.io/myimage:mytag> src/train.py training_workflow
{"asctime": "2023-03-21 08:22:47,049", "name": "flytekit", "levelname": "WARNING", "message": "FlyteSchema is deprecated, use Structured Dataset instead."}
Request rejected by the API, due to Invalid input.
Reason:
Input Request: {
"id": {
...
enough-notebook-15673
03/20/2023, 11:36 PMenough-notebook-15673
03/20/2023, 11:39 PMenough-notebook-15673
03/20/2023, 11:49 PMenough-notebook-15673
03/20/2023, 11:51 PMenough-notebook-15673
03/21/2023, 12:04 AMenough-notebook-15673
03/21/2023, 12:22 AMminikube start --cpus 6 --memory 40G --disk-size 50G --kubernetes-version v1.21.14 --driver=kvm2
but nothing changed.tall-lock-23197
enough-notebook-15673
03/21/2023, 8:02 AM@task(requests=Resources(storage="20Gi", mem="50Gi", cpu="6"))
tall-lock-23197
tall-lock-23197
limits
instead of requests
?enough-notebook-15673
03/21/2023, 11:18 PMtall-lock-23197