Frequent Timeout and Submit Failures

Dear CMS,

My suites (u-dt221 and u-ds899) are experiencing more frequent timeout issues, regularly crashing due to time limit despite the cycles having ample wallclock time to complete under normal cirumstances.

In addition, over this weekend when tasks have failed on submit or due to time limit, I have tried to re-trigger them but they hang on “ready” status for a long time and fail to submit to ARCHER.

Are these common issues that other users are experiencing at the moment and is there a fix I can implement?

Thanks,
Alfred Wilson

Alfred

Others are experiencing slow running jobs - when this happens, please report it to ARCHER (cc CMS), send them the CYLC_BATCH_SYS_JOB_ID (from job.status).

What error is reported for the failure to submit?

Grenville

Hi Alfred,

We have had a message from Archer2 that there was a hardware issue over the weekend which was fixed early this morning.

Please try resubmiting your jobs now and do report any further issues to Archer2.

Annette

Hi Both,

Thanks for the feedback, I can confirm my jobs are running normally again. If I see any further problems, I will report them to ARCHER.

Alfred

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.