I do not think the changes in fcm_make_pp and fcm_make_um (see snapshot) will work.
For pp- there is no branch with that name, so you might have to find out the appropriate one by looking at recent ARCHER suites.
For um- the config_revision is essential, otherwise the build will try to use code from head of the trunk and cause a mismatch with the setup.
I added the config_revision back in but the fcm_make_um error has persisted.
For the fcm_make_pp error, I tried with Ros’ postproc_2.4_archer2_jasmin_rewrite@5092 and also with pp_sources blank and, on both occasions, it generated the same error as above.
I haven’t found another vn13.3 Archer2 suite - I was hoping to use vn13.3 to make use of Dan Grosvenor’s updated hygroscopicity fix.
The name-spaces error in fcm_make_um particularly confusing.
should have said - that’s in this section
[[UMBUILD]]
[[[environment]]]
CONFIG = ncas-ex-cce
OPENMP= true
OPTIM = safe
PREBUILD =
ROSE_APP_OPT_CONF_KEYS = archer2
The changes that Mohit and Grenville may have got your suite working.
If your suite is nudged, another option could be to take a look at my suite u-db533 - this is a UM13.1 nudged UKESM1.1 AMIP suite on ARCHER2. It should be possible to upgrade this to UM13.3 and then apply the differences from your Monsoon2 suite to get a working equivalent.
Thanks, Grenville. I removed the ROSE_APP_OPT_CONF_KEYS = archer2 in the UMBUILD section which in my AMIP suite is closest to your coupled suite u-dj927 but the I got the same error messages.
I then removed it from the ATMOS_RESOURCE section as well and got the same error messages. Finally I removed it from the EXTRACT_RESOURCE section, which left the [[Environment]] subsection blank and the run failed as before.
The suite.rc file also has:
ROSE_APP_OPT_CONF_KEYS = {{CONFIG_OPT}} {{BITCOMP_NRUN_OPT}}
Should that be removed/altered too>
Thanks, Luke. I’m not planning to run nudged but could try upgrading the suite. Is there any documentation on how to do that?
Similarly delete the contents of the app/fcm_make_um/opt/rose-app-archer2.conf as it is still picking up the UM12.1 config branch due to a higher level setting.
section as I get the below error when I try to run
[FAIL] cylc validate -o /tmp/tmp4FDt5S --strict u-dk142 # return-code=1, stderr= [FAIL] Jinja2Error: [FAIL] File “/home/n02/n02/jweber/cylc-run/u-dk142/jdma.rc”, line 1, in top-level template code [FAIL] {% if RUN and POSTPROC and PPTRANSFER and JDMA %} [FAIL] UndefinedError: ‘RUN’ is undefined
I haven’t done the pp transfer tasks yet as I need to work out how and where to put the data on Jasmin. Are they required for the jdma?
Thanks, Ros. u-dk142 now runs but doesn’t seem to archive anything on Archer2 (I have a directory on work /work/n02/n02/jweber/archive) - can this be done separately to the pptransfer to Jasmin?
I will also look to get the Jasmin transfer working.
The postproc step hasn’t run successfully yet and this is the step that puts data in the /work/n02/n02/jweber/archive directory.
If you look in the job.err file for the postproc task you will see that it ran out memory.
In site/archer2.rc in section [[POSTPROC_RESOURCE]] try up’ing the memory requested for this task by adding
[[[directives]]]
--mem = 25Gb
Reload the suite and retrigger the postproc task.
Once we’ve got postproc working, if you’re transferring the data to JASMIN we should then configure the suite so it automatically deletes the data from ARCHER2 once it has been successfully transferred.
Let me know once you’ve got postproc working ok and I’ll tell you what to do next.
Thanks, Ros. u-dk142 now sends output to the archive directory. I will follow the steps for setting up the transfer to Jasmin and let you know if I have any problems.
error: Unable to check destination url for sync: gsiftp://gridftp1.jasmin.ac.uku-dk142/19800101T0000Z/
globus_xio: Unable to connect to gridftp1.jasmin.ac.uku-dk142:2811
globus_xio: globus_libc_getaddrinfo failed.
globus_common: Name or service not known
I thought I had done all the changes but realised the transfer_dir had not been defined. When I added that, u-dk142 throw an error (below) in the atmos_main (despite completing it successfully before) so I am trying to work why that has happened.
? Error code: 58
? Error from routine: UKCA_CHEMISTRY_CTL
? Error message: ERROR: Number of chemical active species /= jpctr
? Error from processor: 464
Once I have solved that, I will return to the pptransfer challenge and let you know how I get one.
I’ve switched to use u-dk384 which is a copy of u-dk142@301081 before I made the stash changes which caused the above atmos_main error. u-dk384 has all the modification listed in the Configuring PPTransfer link, including the transfer dir and completes atmos_main.
I also checked the credential is valid.
However, it is still failing on pptransfer with
Lmod is automatically replacing “cce/15.0.0” with “gcc/11.2.0”.
Due to MODULEPATH changes, the following have been reloaded:
error: Unable to check destination url for sync: gsiftp://gridftp1.jasmin.ac.uk/gws/nopw/j04/sheffield/jweber/archive/u-dk384/19800101T0000Z/
globus_ftp_client: the server responded with an error
530 530-Login incorrect. : globus_gss_assist: Gridmap lookup failure: Could not map /DC=uk/DC=ac/DC=jasmin/O=STFC RAL/CN=jmw240
530-
530 End.
I’ve just seen that my user role for the hpxfer server has expired. When I try to apply to extend it, I am asked for “The IP address from which you will be accessing the high-performance transfer machines. If you are not sure what to put here, please contact your local network administrator.”
Would this be my reading IP address or an Archer2 address?
Hi James,
On that page it should give a dummy IP address you can enter if you going from ARCHER2 as they already know about it, otherwise ARCHER2 IP address is fine.