Current state (please correct me where I'm wrong) ...
# slurm-flyte-wg
e
Current state (please correct me where I'm wrong) 🧵
3 task types: • SlurmTask: most simple, runs an existing script on the Slurm cluster • SlurmShellTask: define a script in the task body, which gets copied and run on Slurm cluster, supports input/output substitution • SlurmFunctionTask: passed in to regular @task config, allows defining the python function as you normally would and runs it on the Slurm cluster. This is the one I have the most questions about. ◦ It seems to require a container, I assume this is just for the calling env i.e. no docker deps on Slurm cluster ◦ The example defines a script in the config, is this required? If so, what are the gains over SlurmShellTask? I've uncovered some bugs with the basic ShellTask as I'm still trying to create a workflow out of those. I'll follow up in another thread once I verify my understanding of the above. The code looks much more fully-formed and production-ready overall though! I'm thinking we want to feature freeze around the above and work on stability / usability / documentation though. Let me know your thoughts!
f
this is amazing also @eager-processor-63090 we should probably demo at bioit world?
d
SlurmShellTask
has a bug about output prefix when we are going to write output to blob store.
Slurm function task is a superset of slurm shell task and SlurmTask, we can use slurm function task only
e
we can use slurm function task only
What do you mean by this? As in that's the only one that's currently functional? I saw the fk PR got merged for func task
@freezing-airport-6809 absolutely this, workspaces + notebooks, and the folding
d
yes only function task is merged now
for other task type
I'll do it with abao this week later
I'm working on running gpu task on a slurm gpu cluster
> we can use slurm function task only I mean we can use slurm function task to achieve what
slurm shell task
and
slurm task
can do
e
Interesting. And you're building your GPU task using function task?
Also I see SlurmShellTask and SlurmTask aren't included in that PR that merged. Are we going to orphan both then since FunctionTask is supposed to cover their functionality?
d
Interesting. And you're building your GPU task using function task?
yes doing with abao
the gpu setup is done on an aws ec2 cluster
Also I see SlurmShellTask and SlurmTask aren't included in that PR that merged. Are we going to orphan both then since FunctionTask is supposed to cover their functionality?
yes we are going to merge them after more testing
e
Okay. I'm guessing I should be building on function task then. I'm still curious where an image is required for function task.
d
nope its not required
e
Wow stunning docs, this is really great! I will throw out some assumptions and start over on the current "official path". Thanks for your help 🙏
d
yes no problem
e
Hmm unfortunately this will be a non-starter for a lot of folks. I'm going to keep working with the humble SlurmTask and hopefully we can get that somewhat production ready over the next month.
d
okok i will do it with abao tmr
e
I'll open a PR with all my pain points so we can discuss more concretely. Have a good evening!
d
okok
thank you Pryce
f
Great catch Pryce - that’s really odd