I assume this one?
@swift-scooter-38934 can you judge whether yunikorn gang scheduling will be usable also with other plugins than ray like pytorch, mpi, tensorflow, … with your PR? 🙏
cool-lifeguard-49380
09/25/2024, 6:33 AM
We use scheduler plugins co-scheduling but unfortunately observed some stability issues when there were many unschedulable pods due to resource constraints.
s
swift-scooter-38934
09/30/2024, 4:50 AM
@cool-lifeguard-49380 Yes.
Plugins in this PR add task group annotations to pods when a new job submit.
According to task group annotations, pods under the job are scheduled with gang scheduling by Yunikorn.
So, gang scheduling is available when pods with task group annotations.
f
freezing-airport-6809
09/30/2024, 6:05 AM
Its not just the annotation right Chen?
s
swift-scooter-38934
10/02/2024, 12:19 PM
Yes.
Application ID and queue name are also important.
when defining partition and queue configuration in Yunikorn to control resource, queue priority and preemption.