BUFFIN error in createbc

Hi CMS,

I’ve started getting file reading errors in the createbc step of u-co433. It only appears at the second cycle and appears to be related to the input.nl file not being shared between tasks (ie. only used in the first cycle but not subsequent ones). I’ve gotten this impression based on warnings about IOSCNTL and SHARED_NLIST not being set in the .err file and the fact that the FAIL error seems to be related to the read-in of the input.nl.

The fort.103 file seems to be empty and I can’t remember how to open the core file (could someone please remind me how?) to dig further. Have you come across this error before?

BUFFIN: Read Failed: No such file or directory

???
???!!!???!!!???!!!???!!!???!!! ERROR ???!!!???!!!???!!!???!!!???!!!
? Error code: 24
? Error from routine: io:buffin
? Error message: Error in buffin errorCode= 0.00 len=0/1
? Error from processor: 0
? Error number: 2
???

[0] exceptions: An non-exception application exit occured.
[0] exceptions: whilst in a serial region
[0] exceptions: Task had pid=0 on host
[0] exceptions: Program is “/work/n02/n02/dcase/random_bin/um-createbc.exe”
Warning in umPrintMgr: umPrintExceptionHandler : Handler Invoked
gc_abort (Processor 0): um_abort called
[FAIL] /work/n02/n02/dcase/random_bin/um-createbc.exe input.nl <<‘STDIN
[FAIL]
[FAIL] ‘STDIN’ # return-code=-6
2022-10-18T10:17:27Z CRITICAL - failed/EXIT

please change permissions on /mnt/lustre/a2fs-work2/work/n02/n02/shakka/cylc-run/u-co433/work/20211201T1200Z/Antarctic_12km_ERA5_JULES10_cyc12_fcst24_um_createbc_000/core so we can read it.

Hi Grenville, should be good to go now. can you also please remind me what program I need to use to read core files? E

Ella

This file looks dodgy
/home/n02/n02/shakka/cylc-run/u-co433/work> xconv /work/n02/n02/shakka/cylc-run/u-co433/share/cycle/20211201T1200Z/ec/um/ec_cb036

xconv can not determine its file type, same for home/n02/n02/shakka/cylc-run/u-co433/work> xconv /work/n02/n02/shakka/cylc-run/u-co433/share/cycle/20211201T1200Z/ec/um/ec_cb048

ec_cb000, 012, and 024 look OK (xconv can read them)

ahh - just seen this – I’m guessing this is the problem?

grenvill@ln01:/work/n02/n02/shakka/cylc-run/u-co433/share/cycle/20211201T1200Z/ec/um> ls -lrt
total 17976876
-rw-r–r-- 1 shakka n02 0 Sep 3 10:39 ec_cb048
-rw-r–r-- 1 shakka n02 0 Sep 3 10:39 ec_cb036
-rw-r–r-- 1 shakka n02 6138568704 Oct 18 14:42 ec_cb024
-rw-r–r-- 1 shakka n02 6138568704 Oct 18 14:42 ec_cb012
-rw-r–r-- 1 shakka n02 6138568704 Oct 18 14:42 ec_cb000

gdb /work/n02/n02/dcase/random_bin/um-createbc.exe core (but it didn’t help much here)

Grenville

Aha! Have deleted the offending files and the step seems to be running smoothly…

Thanks Grenville!
E

This topic was automatically closed 2 days after the last reply. New replies are no longer allowed.