Install_ancil failing

HI,

I have two very related jobs which are both failing in install_ancil. They both are using prebuild which points to the same directory. When they run the workflow instal_ancil is failing and on puma/cylc gui I can’t see any .err or .out files – see /home/n02/n02/tetts/cylc-run/u-ds995/run2/log/job/20100901T0000Z/install_ancil/02 (or 01)

On archer2 I can see the log files which tells me that run_ancil is running out of time. See /work/n02/n02/tetts/cylc-run/u-ds995/run2/log/job/20100901T0000Z/install_ancil/02/job.err (or 01).

(ModelOptimisation) tetts@ln01:/work/n02/shared/tetts/test_run> more /work/n02/n02/tetts/cylc-run/u-ds995/run2/log/job/20100901T0000Z/install_ancil/02/job.err
slurmstepd: error: *** JOB 11015280 ON dvn01 CANCELLED AT 2025-10-02T10:29:57 DUE
TO TIME LIMIT ***

Everyone should have read access to those files.

So, what should I do? I suspect there is some temp file system problem somewhere…

Simon

I tried again – releasing the instal_ancil task and seems to have worked. I guess a glitch somewhere in the systems….

Simon

try giving it more time, PT2M maybe?

[[INSTALL_RESOURCE]]
    inherit = HPC_SERIAL
    execution time limit = PT1M

I had similar problems with other cases and reported to archer2 help desk. They told me it was because the n02 quota was full. And they’d tell n02 so!

Simon

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.