Job Submission on ARCHER2

Hi,

I am trying to run my first job on ARCHER2 and have copied the u-cc519 suite from the NCAS um-training guide, however the fcm_make2 is failing. I have followed the instructions to change username, account, reservation and queue but no other changes have been made.

This is the fcm_make2 job stderr:

Unloading /usr/local/share/epcc-module/epcc-module-loader

Warning: Unloading the epcc-setup-env module will stop many
modules being available on the system. If you do this by
accident, you can recover the situation with the command:

module load /work/y07/shared/archer2-modules/modulefiles-cse/epcc-setup-env

Unloading /work/y07/shared/archer2-modules/modulefiles-cse/epcc-setup-env
Unloading bolt/0.7
Loading bolt/0.7
Loading /work/y07/shared/archer2-modules/modulefiles-cse/epcc-setup-env
Loading cray-hdf5/1.12.0.2
Loading cray-netcdf/4.7.4.2
[FAIL] use = /home1/home/n02/n02/ros/cylc-run/u-cc519/share/fcm_make: incorrect value in declaration
[FAIL] config-file=/work/n02/n02/s2261584/cylc-run/u-cj055/share/fcm_make/fcm-make2.cfg:6
[FAIL] /home1/home/n02/n02/ros/cylc-run/u-cc519/share/fcm_make/.fcm-make/ctx.gz: cannot retrieve cache
[FAIL] No such file or directory at /lus/cls01095/work/y07/shared/umshared/software/fcm-2019.09.0/bin/…/lib/FCM/System/Make/Share/Dest.pm line 106, <$handle> line 6.

[FAIL] fcm make -C /work/n02/n02/s2261584/cylc-run/u-cj055/share/fcm_make -n 2 -j 32 --ignore-lock # return-code=2
Received signal ERR
cylc (scheduler - 2021-11-02T11:33:39Z): CRITICAL Task job script received signal ERR at 2021-11-02T11:33:39Z
cylc (scheduler - 2021-11-02T11:33:39Z): CRITICAL failed at 2021-11-02T11:33:39Z

Best,
Hannah

Hi Hannah,

We are currently working on all the practical exercises/suites to iron out any issues ahead of a training course in a couple of weeks’ time so they are in a slight state of flux. The error you detail above has been fixed since you checked the suite out.

Best Regards,
Ros

Hi Ros,

Thank you!

Best,
Hannah

Hi,

Just another question regarding job submission on ARCHER2: I am now trying to run a copy of u-be303/archer2 (UKESM AMIP) and am receiving a RosePOpen error of ‘return-code=255, stderr= Host key verification failed’ with the key mentioned being @login.archer2.ac.uk.

Following the advice given for ticket #2865 which produced a similar error, I have checked my known_hosts file. Here I only have the @login-4c.archer2.ac.uk login, not the @login.archer2.ac.uk. Within the rose editor I am only able to select Archer2 as a host site and can no longer find the option as in the copy of u-cc519 suite to change HPC_HOST to the login-4c.archer2,.ac.uk option.

Thanks,
Hannah

Hi Hannah,

If you use grep -r login.archer2.ac.uk * in the rose suite directory you will find it in the site/archer2.rc file.

Regards,
Ros.