I’ve started getting file reading errors in the createbc step of u-co433. It only appears at the second cycle and appears to be related to the input.nl file not being shared between tasks (ie. only used in the first cycle but not subsequent ones). I’ve gotten this impression based on warnings about IOSCNTL and SHARED_NLIST not being set in the .err file and the fact that the FAIL error seems to be related to the read-in of the input.nl.
The fort.103 file seems to be empty and I can’t remember how to open the core file (could someone please remind me how?) to dig further. Have you come across this error before?
[0] exceptions: An non-exception application exit occured.
[0] exceptions: whilst in a serial region
[0] exceptions: Task had pid=0 on host
[0] exceptions: Program is “/work/n02/n02/dcase/random_bin/um-createbc.exe”
Warning in umPrintMgr: umPrintExceptionHandler : Handler Invoked
gc_abort (Processor 0): um_abort called
[FAIL] /work/n02/n02/dcase/random_bin/um-createbc.exe input.nl <<‘STDIN’
[FAIL]
[FAIL] ‘STDIN’ # return-code=-6
2022-10-18T10:17:27Z CRITICAL - failed/EXIT
please change permissions on /mnt/lustre/a2fs-work2/work/n02/n02/shakka/cylc-run/u-co433/work/20211201T1200Z/Antarctic_12km_ERA5_JULES10_cyc12_fcst24_um_createbc_000/core so we can read it.
This file looks dodgy /home/n02/n02/shakka/cylc-run/u-co433/work> xconv /work/n02/n02/shakka/cylc-run/u-co433/share/cycle/20211201T1200Z/ec/um/ec_cb036
xconv can not determine its file type, same for home/n02/n02/shakka/cylc-run/u-co433/work> xconv /work/n02/n02/shakka/cylc-run/u-co433/share/cycle/20211201T1200Z/ec/um/ec_cb048
ec_cb000, 012, and 024 look OK (xconv can read them)
ahh - just seen this – I’m guessing this is the problem?
grenvill@ln01:/work/n02/n02/shakka/cylc-run/u-co433/share/cycle/20211201T1200Z/ec/um> ls -lrt
total 17976876
-rw-r–r-- 1 shakka n02 0 Sep 3 10:39 ec_cb048
-rw-r–r-- 1 shakka n02 0 Sep 3 10:39 ec_cb036
-rw-r–r-- 1 shakka n02 6138568704 Oct 18 14:42 ec_cb024
-rw-r–r-- 1 shakka n02 6138568704 Oct 18 14:42 ec_cb012
-rw-r–r-- 1 shakka n02 6138568704 Oct 18 14:42 ec_cb000
gdb /work/n02/n02/dcase/random_bin/um-createbc.exe core (but it didn’t help much here)