I have encountered what must be a pretty normal problem.
If a user submits a jobs from a directory where he/she does not have write access, then apparently the job gets stuck in “Exiting” state - presumably because the log files cannot be written to that dir at job-end(?) At this point, the jobs cannot be deleted with qdel, and they do not seem to actually end, so the lock up resources.
This raises a few questions from me:
How do I “clear” the system from such jobs.
How do I ensure that lusers cannot hang the entire system in this way. (Do I need to write a submitjob hook to check the write permissions on the submit-dir or something?)
I assume this is a pretty ordinary issue?