bored-needle-72209
06/10/2024, 2:58 PMabundant-laptop-64153
06/10/2024, 3:32 PMabundant-laptop-64153
06/10/2024, 3:32 PMbored-needle-72209
06/10/2024, 3:42 PMabundant-laptop-64153
06/10/2024, 4:11 PMAZRebalance
in https://docs.aws.amazon.com/eks/latest/userguide/managed-node-groups.html
if the error is consistently reproducible with the same pods then its likely not a rebalance issue. perhaps the job is crashing the node? at one point we had crashes because of too little diskspace, fixed by increasing the diskspace. the point is, if the pod is completely gone then the node is likely gone, so you will need to debug reasons as to why the node was removed/crashed/etcbored-needle-72209
06/10/2024, 4:24 PM