UKCA recon failed

Hi there,

I made a copy (u-do789) of my own suite, and only changed it with a new restarting files I generated. It stopped at “recon” with error message:
???!!!???!!!???!!!???!!!???!!! ERROR ???!!!???!!!???!!!???!!!???!!!
? Error code: 24
? Error from routine: io:buffin
? Error message: Error in buffin errorCode= 1.00 len=0/256
? Error from processor: 0
? Error number: 3
[0] exceptions: An non-exception application exit occured.

Could you help on what caused the crash? Thanks

Best,
Xin

Hi Xin,

The AINITIAL_FILE setting in rose-suite.conf is blank, so Recon is unable to find the input file.


Mohit

Hi Mohit,

After adding the AINITIAL_FILE to the suite, I did fcm commit for it and then made a cold run. However, I received new error messages:
RROR: file not found: /home/n02/n02/emxin/cylc-run/u-do789/log/job/20200201T0000Z/install_cold/NN/job.out
Traceback (most recent call last):
File “/home4/home/n02-puma/fcm/metomi/cylc-7.8.12/bin/cylc-cat-log”, line 439, in
main()
File “/home4/home/n02-puma/fcm/metomi/cylc-7.8.12/bin/cylc-cat-log”, line 435, in main
tmpfile_edit(out, options.geditor)
File “/home4/home/n02-puma/fcm/metomi/cylc-7.8.12/bin/cylc-cat-log”, line 265, in tmpfile_edit
modtime1 = os.stat(tmpfile).st_mtime
TypeError: coercing to Unicode: need string or buffer, int found

Best,
Xin

This email and any attachments are intended solely for the use of the named recipients. If you are not the intended recipient you must not use, disclose, copy or distribute this email or any of its attachments and should notify the sender immediately and delete this email from your system. UK Research and Innovation (UKRI) has taken every reasonable precaution to minimise risk of this email or any attachments containing viruses or malware but the recipient should carry out its own virus and malware checks before opening the attachments. UKRI does not accept any liability for any losses or damages which the recipient may sustain due to presence of any viruses.

Hi Xin,

That is Cylc complaining that it cannot find log files. This is because the install_cold task could not be launched on Archer2. On PUMA: ~emxin/cylc-run/u-do789/log/job/20200201T0000Z/install_cold/job-activity.log:

jobs-submit cmd] (remote host select)
[jobs-submit ret_code] 1
[jobs-submit err]
rose host-select archer2: host selection failed:
COMMAND FAILED (124): rose host-select archer2
COMMAND STDERR: [WARN] ln01: (ssh failed)
[jobs-submit cmd] (remote host select)
[jobs-submit ret_code] 1
[jobs-submit err]
rose host-select archer2: host selection failed:
COMMAND FAILED (124): rose host-select archer2
COMMAND STDERR: [WARN] ln01: (ssh failed)

This looks like a problem connecting to Archer2, which may be temporary as another task ‘fcm_make2_um’ has run successfully on ARCHER.
Try to re-submit the install_cold task again.


M