Monsoon status?

My UM nesting suite is on the queue for 5 days now. As you can see below it was running fine until that time, just stopped suddenly and has not run again for a few days.
Is something going on on Monsoon or is it something like I’ve used up my allowance for the month?
Masaru

Hi Masaru,

I’m not sure why but looking in the XCS queue the jobs are being held for a reason I don’t fully understand.

rhatcher@xcs-c$ qstat_snapshot | grep u-ck605
8994277.xcs00   myosh      shared       Regn1_IberiaSea_RA2M_um_createbc_000.20190712T0000Z.u-ck605     --       1     4     15gb 00:20 H    --         job held, too many failed attempts to run
8994353.xcs00   myosh      shared       Regn1_IberiaSea_RA2M_um_createbc_001.20190712T0000Z.u-ck605     --       1     4     15gb 00:20 H    --         job held, too many failed attempts to run
8996125.xcs00   myosh      shared       glm_archive.20190712T0000Z.u-ck605     --       1     1      2gb 01:00 H    --         job held, too many failed attempts to run

I would suggest either contacting Monsoon and asking them what’s happened or kill the jobs and then re-trigger them and see if that fixes the problem.

I also note that asci is running with a -12:00 hour fairshare boost.

Regards,
Ros.

Hi Ros.,
Killing and retriggering worked. I was a bit scared of doing that.
Thanks.
Masaru

This topic was automatically closed 2 days after the last reply. New replies are no longer allowed.