Do we have to change the default USER for spark tasks like a Flyte #flyte-support

Join Slack

Do we have to change the default USER for spark ta...

# flyte-support

salmon-refrigerator-32115

01/17/2023, 7:39 PM

Do we have to change the default USER for spark tasks like above?

tall-lock-23197

01/18/2023, 5:27 AM

It's not mandatory but it's beneficial (and preferred) to set it to a non-root user.

salmon-refrigerator-32115

01/24/2023, 8:15 PM

@tall-lock-23197, @glamorous-carpet-83516, @thankful-minister-83577, Could you elaborate? I have a task that utilizes Ray running locally. And Ray needs to mkdir internally and it got denied when the task is run in spark SA. I am thinking to switch to default USR for the image that has spark capability to prevent endless permission issues like that.

Copy code

File "/opt/venv/lib/python3.9/site-packages/flytekit/exceptions/scopes.py", line 203, in user_entry_point
        return wrapped(*args, **kwargs)
      File "/opt/venv/lib/python3.9/site-packages/dai_mle_models/xgboost/xgboost_tune_wf.py", line 83, in xgb_tune
        tuner = tune.Tuner(
      File "/opt/venv/lib/python3.9/site-packages/ray/tune/tuner.py", line 152, in __init__
        self._local_tuner = TunerInternal(**kwargs)
      File "/opt/venv/lib/python3.9/site-packages/ray/tune/impl/tuner_internal.py", line 104, in __init__
        self._experiment_checkpoint_dir = self._setup_create_experiment_checkpoint_dir(
      File "/opt/venv/lib/python3.9/site-packages/ray/tune/impl/tuner_internal.py", line 273, in _setup_create_experiment_checkpoint_dir
        os.makedirs(path, exist_ok=True)
      File "/usr/lib/python3.9/os.py", line 215, in makedirs
        makedirs(head, exist_ok=exist_ok)
      File "/usr/lib/python3.9/os.py", line 225, in makedirs
        mkdir(name, mode)

Message:

    [Errno 13] Permission denied: '/ray_results'

tall-lock-23197

01/25/2023, 7:51 AM

May I know why you are using spark service account to run Ray tasks?

salmon-refrigerator-32115

01/25/2023, 7:23 PM

@tall-lock-23197, @glamorous-carpet-83516, @thankful-minister-83577, Isn’t this a common scenario when you have non-spark (that uses Ray) and spark tasks co-exist in a workflow and they use the same docker image that is based of flyte spark?https://github.com/flyteorg/flytesnacks/blob/master/cookbook/integrations/kubernetes/k8s_spark/Dockerfile? Since the Flyte Dockerfile has:

Copy code

# Set /root user and group
# RUN chown -R ${spark_uid}:${spark_uid} /root
...
# USER ${spark_uid}

salmon-refrigerator-32115

01/25/2023, 7:23 PM

Therefore the Ray task must also run under USER ${spark_uid}, and that is causing the failure.

glamorous-carpet-83516

01/25/2023, 7:37 PM

The best practice is using different images for those two tasks.

glamorous-carpet-83516

01/25/2023, 7:37 PM

you can follow this guide to set multiple container images in a single workflow

salmon-refrigerator-32115

01/25/2023, 7:49 PM

@glamorous-carpet-83516, thanks. However when I have non-spark and spark tasks in the same workflow file (with imports from both custom and spark dependencies), I still have to have both custom dependencies and spark dependencies built in to the same image. Therefore both images will be the same except one is using the root user and the other is using spark_uid user _*USER ${spark_uid}*_. What’s your comment on that?

glamorous-carpet-83516

01/25/2023, 9:07 PM

one way to avoid import spark dependencies in the ray task is to import spark specific package in the @task, like

Copy code

@task
def spark_task:
    import spark
    ...

glamorous-carpet-83516

01/25/2023, 9:08 PM

another way is use reference task in your workflow

👍 1

salmon-refrigerator-32115

01/25/2023, 10:25 PM

@glamorous-carpet-83516, by spark task I meant pyspark task

Copy code

@task
    task_config=Spark(
        spark_conf={
def a()

salmon-refrigerator-32115

01/25/2023, 10:27 PM

import pyspark in task body won’t work correct?

tall-lock-23197

01/27/2023, 5:11 AM

Yeah, it won't. You need to import the plugins globally. @glamorous-carpet-83516 @high-accountant-32689, I think we need to improve the UX here. If all the tasks are associated with custom images, we need to enable some sort of mechanism to simplify specifying the default image. Not sure if that's a possibility though. In this case, there needs to be a single image with user set to root or three images — a default image with the two plugins installed, one for Spark and one for Ray.

salmon-refrigerator-32115

01/27/2023, 5:24 AM

I used a single image with all dependencies combined, and with user set to root. It runs with spark SA and it runs workflow with spark and non-spark tasks (including Ray) fine.

👍 1

179 Views

Open in Slack

Previous Next