hi, I am facing issues while performing ray tuning...
# ray-integration
p
hi, I am facing issues while performing ray tuning in flyte. I have used KubeRay operator v0.5.2. Using RayJobConfig, I am requestion for head node and 1 worker node.
Copy code
ray_config = RayJobConfig(
      head_node_config=HeadNodeConfig(ray_start_params={"log-color": "True"}),
      worker_node_config=[WorkerNodeConfig(group_name="ray-group", replicas=1)],
      runtime_env={"pip": ["numpy", "pandas"]},
)
For instance: The pod is assigned to a node of IP 172.22.1.123 and it has secondary IPs 172.22.1.234 and 172.22.1.456. The trials of the tuning process is running in secondary IPs and the trials running in 172.22.1.234 alone is running and providing proper results but when the trials running in other secondary IPs are getting failed with error. I have attached the error screenshot. The trials that are assigned to IP 172.22.1.234 alone is getting passed and trials assigned to other IPs are getting failed with the attached error message. Why are the trials getting assigned to the secondary IPs and why the trials running in a single IP alone is getting passed and trials trying to assign to other IPs are getting error ?
hi, do anyone has any suggestion or solution for this ?