Current Problems
Scheduling by Core with Hyperthreading enabled
For the time being, we have disabled hyperthreading on the compute nodes. There had been too many issues with HT being enabled. We may look into this further in the future.
Multinode Jobs, which need ssh and do not use Slurm mechanisms (like srun) to get on the remote hosts
We have installed pam_slurm_adopt in the meantime. This allows you to ssh to nodes, which belong to your running jobs. Multinode jobs with CFX (fluent maybe as well) and STAR-CCM+ should be possible now.
X11-Forwarding
Some kind of unintuative X11-Forwarding is possible at the moment. There still does not exist something like sbatch --x11
but you can do native ssh-X11-Forwarding now that pam_slurm_adopt is installed.
You have two options for the time being:
- Use
salloc
to get an allocation of nodes. Start SSH with X-forwarding to the allocated hosts.- The drawback is, that
salloc
injects many Slurm variables into the environment, which still exist once the allocation ended. So please use a new shell forsalloc
- The drawback is, that
- Use a normal batch script, which includes e.g. a sleep command or similar, then use
ssh
to the remote nodes as soon as the job runs- This is the preferred way and we will provide a small program to ease this for you as soon as possible
- At the moment, we are distributing
guishell
throughout the cluster.
- At the moment, we are distributing
- This is the preferred way and we will provide a small program to ease this for you as soon as possible
Drawback of that method is, that you "just" do a ssh to one compute node, so it is a plain environment without all the slurm variables, you would expect in a real job.
"Full" X11-Forwarding would mean, that in the batchscript you can e.g. directly start Intel Vtune or an xterm, without any additional work. Also the environment set there is the real job environment. This is still to be implemented.