Fabio Grätz
04/13/2023, 6:06 PMtorch.distributed.init_process_group()
, see here. Back then I just created a kubeflow PytorchJob to run it which worked. Image needed nvidia-cuda-toolkit
. To summarize, at the state of ~1.5 years ago I think it would already have been supported.