Hi CMS team,
My suites u-dr800 and u-dr928 both fail with some python errors that didn’t use to occur.. have there been any python updates on ARCHER2?
the tasks that fail are
unicicles_gris; unicicles_ais; postproc_atmos
For the unicicles tasks, the errors are:
srun: error: nid005710: tasks 0-21,23-47,49-127: Exited with exit code 127 srun: launch/slurm: _step_signal: Terminating StepId=11209862.0 /work/n02/n02/adittus/cylc-run/u-dr928/run2/share/fcm_make_unicicles/unicicles/bin/unicicles: error while loading shared libraries: libpython3.9.so.1.0: cannot open shared object file: No such file or directory /work/n02/n02/adittus/cylc-run/u-dr928/run2/share/fcm_make_unicicles/unicicles/bin/unicicles: error while loading shared libraries: libpython3.9.so.1.0: cannot open shared object file: No such file or directory slurmstepd: error: *** STEP 11209862.0 ON nid005710 CANCELLED AT 2025-10-16T18:01:37 *** srun: error: nid005710: tasks 22,48: Exited with exit code 127 [FAIL] $IEXECDIR/unicicles_wrapper # return-code=127 2025-10-16T17:01:38Z CRITICAL - failed/ERR
Note this happened before, this was a fix from Grenville:
in site/archer2-unicicles.rc add --export=all thus
ROSE_LAUNCHER_PREOPTS = --hint=nomultithread --distribution=block:block --cpu-bind=cores --nodes=1 --ntasks=128 --export=allin [[UNI_EXEC_RESOURCE]]
This currently is included in the suite.
Then Grenville wrote:
same for [[CAP_RESOURCE]] in site/archer2-unicicles.rc
ROSE_LAUNCHER_PREOPTS = --hint=nomultithread --distribution=block:block --cpu-bind=cores --nodes=1 --ntasks=1 –export=all Fixed elsewhere - nothing needed
Could it be that this now needs an update elsewhere? I saw another ticket about issues with CAP ( Issue with CAP9.1 after archer2 update )
For postproc_atmos, the error I get is:
Traceback (most recent call last):
File “/work/n02/n02/adittus/cylc-run/u-dr928/run2/share/fcm_make_pp/build/bin/main_pp.py”, line 119, in
main()
File “/work/n02/n02/adittus/cylc-run/u-dr928/run2/share/fcm_make_pp/build/bin/main_pp.py”, line 112, in main
run_postproc()
File “/work/n02/n02/adittus/cylc-run/u-dr928/run2/share/fcm_make_pp/build/bin/main_pp.py”, line 83, in run_postproc
getattr(model, meth)()
File “/mnt/lustre/a2fs-work2/work/n02/n02/adittus/cylc-run/u-dr928/run2/share/fcm_make_pp/build/bin/timer.py”, line 115, in wrapper
out = function(*args, **kw)
File “/mnt/lustre/a2fs-work2/work/n02/n02/adittus/cylc-run/u-dr928/run2/share/fcm_make_pp/build/bin/atmos.py”, line 491, in do_meaning
icode = self.update_meanfile(meanfile, setend)
File “/mnt/lustre/a2fs-work2/work/n02/n02/adittus/cylc-run/u-dr928/run2/share/fcm_make_pp/build/bin/timer.py”, line 115, in wrapper
out = function(*args, **kw)
File “/mnt/lustre/a2fs-work2/work/n02/n02/adittus/cylc-run/u-dr928/run2/share/fcm_make_pp/build/bin/atmos.py”, line 445, in update_meanfile
rcode = climatemean.create_mean(meanfile,
File “/mnt/lustre/a2fs-work2/work/n02/n02/adittus/cylc-run/u-dr928/run2/share/fcm_make_pp/build/bin/timer.py”, line 115, in wrapper
out = function(*args, **kw)
File “/mnt/lustre/a2fs-work2/work/n02/n02/adittus/cylc-run/u-dr928/run2/share/fcm_make_pp/build/bin/climatemean.py”, line 256, in create_mean
icode, output = target_app(meanfile, **kwargs)
File “/mnt/lustre/a2fs-work2/work/n02/n02/adittus/cylc-run/u-dr928/run2/share/fcm_make_pp/build/bin/timer.py”, line 115, in wrapper
out = function(*args, **kw)
File “/mnt/lustre/a2fs-work2/work/n02/n02/adittus/cylc-run/u-dr928/run2/share/fcm_make_pp/build/bin/atmos_transform.py”, line 275, in create_um_mean
load_mule = [mule.load_umfile(f) for f in meanfile.component_files]
File “/mnt/lustre/a2fs-work2/work/n02/n02/adittus/cylc-run/u-dr928/run2/share/fcm_make_pp/build/bin/atmos_transform.py”, line 275, in
load_mule = [mule.load_umfile(f) for f in meanfile.component_files]
File “/work/y07/shared/umshared/lib/python3.9/mule/init.py”, line 1845, in load_umfile
result = _load_umfile(file_path, open_file)
File “/work/y07/shared/umshared/lib/python3.9/mule/init.py”, line 1836, in _load_umfile
raise ValueError(msg)
ValueError: Unknown dataset_type 0, supported types are dict_keys([1, 2, 3, 4, 5])
[FAIL] main_pp.py atmos # return-code=1
2025-10-17T21:20:43Z CRITICAL - failed/ERR
Thanks for any help!
Cheers,
Andrea