Dear PBS community,
First, I am sorry if this question has been asked before.
I am a new user of PBS Pro who is being tasked to manage a cluster.
I have some problem when a job tries to use more than
ncpus than it requests such as in the
following job script:
#!/bin/bash #PBS -q longnormal #PBS -l select=1:ncpus=20 cd $PBS_O_WORKDIR mpirun -n 40 ./a.out
This job will result in one node (each node have max 20 cores) having load about 4000 percent, although
resources_used.cpupercent is only showing 2000 percent.
Probably this can be avoided by using wrapper to
mpirun which I have seen somewhere during my experience as user in some HPC facilities several years ago (then I was only user, not an admin).
I want to know whether there are any solutions to avoid this (by using hooks or other alternatives).
Thank you very much in advance.
I am using PBSPro_18.104.22.168487