Each PBS restart created a jump in the JOBID

#1

HI,
we have observed , each PBS restart will jump in the JOB ID, any solution for it
Database view:

83.dxc-login-01 | 9 | 1557218646
84.dxc-login-01 | 9 | 1557218877
85.dxc-login-01 | 9 | 1557218966
86.dxc-login-01 | 9 | 1557219031
87.dxc-login-01 | 9 | 1557223984
88.dxc-login-01 | 9 | 1557224004
89.dxc-login-01 | 9 | 1557230993
90.dxc-login-01 | 9 | 1557231566
91.dxc-login-01 | 9 | 1557234065
92.dxc-login-01 | 9 | 1557235055
93.dxc-login-01 | 9 | 1557235872
94.dxc-login-01 | 9 | 1557236133
95.dxc-login-01 | 9 | 1557236378
96.dxc-login-01 | 9 | 1557236909
97.dxc-login-01 | 9 | 1557237199
98.dxc-login-01 | 9 | 1557237211
99.dxc-login-01 | 9 | 1557237249
100.dxc-login-01 | 9 | 1557237461
101.dxc-login-01 | 9 | 1557237570
102.dxc-login-01 | 9 | 1557237699
103.dxc-login-01 | 9 | 1557237796
104.dxc-login-01 | 9 | 1557237917
105.dxc-login-01 | 9 | 1557238083
106.dxc-login-01 | 9 | 1557238810
107.dxc-login-01 | 9 | 1557239462
108.dxc-login-01 | 9 | 1557239869
1085.dxc-login-01 | 9 | 1557258012
1088.dxc-login-01 | 9 | 1557306775
1089.dxc-login-01 | 9 | 1557311226
1090.dxc-login-01 | 9 | 1557314460
1091.dxc-login-01 | 9 | 1557314537
1092.dxc-login-01 | 9 | 1557314736
1093.dxc-login-01 | 9 | 1557314943
1094.dxc-login-01 | 9 | 1557315230
1095.dxc-login-01 | 9 | 1557315818
1096.dxc-login-01 | 9 | 1557316069
1097.dxc-login-01 | 9 | 1557316165
1098.dxc-login-01 | 9 | 1557316231
1099.dxc-login-01 | 9 | 1557316560
1100.dxc-login-01 | 9 | 1557316747
1101.dxc-login-01 | 9 | 1557317041
1102.dxc-login-01 | 9 | 1557317498
1104.dxc-login-01 | 9 | 1557317577
1105.dxc-login-01 | 9 | 1557317600
1110.dxc-login-01 | 9 | 1557321260
1111.dxc-login-01 | 9 | 1557321296
1112.dxc-login-01 | 9 | 1557321416
1113.dxc-login-01 | 9 | 1557321505
1114.dxc-login-01 | 9 | 1557321838
1116.dxc-login-01 | 9 | 1557321915
1117.dxc-login-01 | 9 | 1557322147
1118.dxc-login-01 | 9 | 1557322340
2117.dxc-login-01 | 9 | 1557344963
2118.dxc-login-01 | 9 | 1557345518
2119.dxc-login-01 | 9 | 1557345857
2120.dxc-login-01 | 9 | 1557345994
3117.dxc-login-01 | 9 | 1557394266
3119.dxc-login-01 | 9 | 1557394543
3120.dxc-login-01 | 9 | 1557401033
3121.dxc-login-01 | 9 | 1557432493
3122.dxc-login-01 | 9 | 1557432973
3123.dxc-login-01 | 9 | 1557434199
3124.dxc-login-01 | 9 | 1557438191
3125.dxc-login-01 | 9 | 1557438273
3126.dxc-login-01 | 9 | 1557474582
3127.dxc-login-01 | 9 | 1557474596
3128.dxc-login-01 | 9 | 1557474617
3129.dxc-login-01 | 9 | 1557474623

#2

This is done on purpose in case of unclean server shutdown to avoid potential job ID duplication, but should not happen on clean shutdown:

user1@osuse15:~> echo ‚Äúsleep 10‚ÄĚ | qsub
0.osuse15

osuse15:~ # killall -9 pbs_server.bin
osuse15:~ # pbs_server
Connecting to PBS dataservice…connected to PBS dataservice@osuse15

user1@osuse15:~> echo ‚Äúsleep 10‚ÄĚ | qsub
1000.osuse15

osuse15:~ # killall -15 pbs_server.bin
osuse15:~ # pbs_server
Connecting to PBS dataservice…connected to PBS dataservice@osuse15

user1@osuse15:~> echo ‚Äúsleep 10‚ÄĚ | qsub
1001.osuse15

osuse15:~ # qterm
osuse15:~ # pbs_server
Connecting to PBS dataservice…connected to PBS dataservice@osuse15

user1@osuse15:~> echo ‚Äúsleep 10‚ÄĚ | qsub
1002.osuse15

osuse15:~ # systemctl restart pbs
user1@osuse15:~> echo ‚Äúsleep 10‚ÄĚ | qsub
1003.osuse15

osuse15:~ # killall -11 pbs_server.bin
osuse15:~ # pbs_server
Connecting to PBS dataservice…connected to PBS dataservice@osuse15

user1@osuse15:~> echo ‚Äúsleep 10‚ÄĚ | qsub
2003.osuse15

So, how are you restarting PBS when you see this?

1 Like
#3

ok, i need to restart pbs carefully ,Thanks for reply