I am working on PP-315, and I have few questions regarding JobArray, before working
on the fix for this bug I need more understanding as well what would be the best approach.
Q1. Why deleted subjobs and finished subjobs are treated as expired, shouldn’t they be different ?
Q2. Why a Jobarray can not be moved to a different queue when it is in “begun” state while all the subjobs are either in expired or queued state. i.e none of the subjobs are running. ?
Q3. As far as I understand there are no restriction on the order of subjobs to be selected or the nodes/vnodes in which subjobs should run. Hence I don’t understand why then moving a Jobarray shouldn’t be valid when none of the subjobs are running ( which includes subjobs to be in queued or expired state)
Q4. When a subjob goes into execution, Parent job state changes to ‘B’, and if for some reason the running jobs gets requed, the Parent job still reamins in ‘B’ state even though all the subjobs are in 'Q’queued state. Is this behavior correct?
Also if I put hold on the arrayjob parent it will goto ‘H’ state from ‘B’ and when I release the parent job it goes to ‘Q’ instead of ‘B’. If the above behavior is correct then shouldn’t parent job move to ‘B’ state.