the need was to stop running (“R”) jobs in an emergency maintenance. I could set server scheduling to False, but “R” jobs will keep running (expected). scenario:
- dedicated time queue between 10 and 16z and jobs are “Q” during.
- at 16z an X number of jobs kicked in and started running.
- at 16:15z “set server scheduling = False” is issued.
Had to kill running and exiting jobs. There was 400+ jobs between the two and PBS was sluggish.
How can this be addressed in case of an emergency ? Would it be possible to hold/queue all of the X jobs ?