Hi CMS Helpdesk,
I am trying to run suite u-bl274 but am getting an error after running the command rose suite-run
:
[FAIL] ssh -oBatchMode=yes -oStrictHostKeyChecking=no -oConnectTimeout=8 -n {ARCHER_USERNAME}}@ln02 env\ ROSE_VERSION=2019.01.3\ CYLC_VERSION=7.8.12\ bash\ -l\ -c\ \'\"$0\"\ \"$@\"\'\ rose\ suite-run\ -vv\ -n\ u-bl274\ --run=run\ --remote=uuid=b2df5a44-b5f2-4400-a1f1-281cc1a0d159,now-str=20250514T134101Z,root-dir=\'$DATADIR\' # return-code=255, stderr=
[FAIL] remote username contains invalid characters
I am not too sure how to fix this error, so any help would be greatly appreciated. In case you need it my archer2 username it is gcj_archer2.
Also for a little bit of context, this suite is a few years old and was configured to run on niwa-cray, so I have been trying to configure it to get it to run on archer2. Therefore, there might be some other places that still need to be changed, but this error is stumping me.
Thanks for your help,
Gabrielle
In case you want the full debug info here it is:
[gcj_archer2@puma2 u-bl274]$ rose suite-run -v
[INFO] Configuration: /home4/home/n02-puma/gcj_archer2/roses/u-bl274/
[INFO] file: rose-suite.conf
[INFO] export CYLC_VERSION=7.8.12
[INFO] export ROSE_ORIG_HOST=puma2.archer2.ac.uk
[INFO] export ROSE_SITE=
[INFO] export ROSE_VERSION=2019.01.3
[INFO] create: log.20250514T134101Z
[INFO] delete: log
[INFO] symlink: log.20250514T134101Z <= log
[INFO] log.20250514T121135Z.tar.gz <= log.20250514T121135Z
[INFO] delete: log.20250514T121135Z/
[INFO] create: log/suite
[INFO] create: log/rose-conf
[INFO] symlink: rose-conf/20250514T144101-run.conf <= log/rose-suite-run.conf
[INFO] symlink: rose-conf/20250514T144101-run.version <= log/rose-suite-run.version
[INFO] unchanged: ana/mule_cumf.py
[INFO] source: svn://puma2.archer2.ac.uk/um.xm/main/trunk/rose-stem/ana/mule_cumf.py@50973 (fcm:um.xm_tr/rose-stem/ana/mule_cumf.py@vn11.0)
[INFO] unchanged: app
[INFO] source: /home4/home/n02-puma/gcj_archer2/roses/u-bl274/app
[INFO] unchanged: bin
[INFO] source: /home4/home/n02-puma/gcj_archer2/roses/u-bl274/bin
[INFO] unchanged: bin/create_means.py
[INFO] source: svn://puma2.archer2.ac.uk/moci.xm/main/trunk/Utilities/lib/create_means.py@2546 (fcm:moci.xm_tr/Utilities/lib/create_means.py@2546)
[INFO] unchanged: meta
[INFO] source: /home4/home/n02-puma/gcj_archer2/roses/u-bl274/meta
[INFO] unchanged: rose-suite.info
[INFO] source: /home4/home/n02-puma/gcj_archer2/roses/u-bl274/rose-suite.info
[INFO] unchanged: tests-graph.rc
[INFO] source: /home4/home/n02-puma/gcj_archer2/roses/u-bl274/tests-graph.rc
[INFO] unchanged: tests-runtime.rc
[INFO] source: /home4/home/n02-puma/gcj_archer2/roses/u-bl274/tests-runtime.rc
[INFO] unchanged: ukesm-graph.rc
[INFO] source: /home4/home/n02-puma/gcj_archer2/roses/u-bl274/ukesm-graph.rc
[INFO] unchanged: ukesm-runtime.rc
[INFO] source: /home4/home/n02-puma/gcj_archer2/roses/u-bl274/ukesm-runtime.rc
[INFO] source: /home4/home/n02-puma/gcj_archer2/roses/u-bl274/site
[INFO] install: site
[INFO] source: /home4/home/n02-puma/gcj_archer2/roses/u-bl274/site
[INFO] REGISTERED u-bl274 -> /home/n02/n02/gcj_archer2/cylc-run/u-bl274
[INFO] Valid for cylc-7.8.12
[INFO] WARNING - deprecated items were automatically upgraded in 'suite definition':
[INFO] WARNING - * (6.11.0) [runtime][RETRIES][retry delays] -> [runtime][RETRIES][job][execution retry delays] - value unchanged
[FAIL] ssh -oBatchMode=yes -oStrictHostKeyChecking=no -oConnectTimeout=8 -n {ARCHER_USERNAME}}@ln02 env\ ROSE_VERSION=2019.01.3\ CYLC_VERSION=7.8.12\ bash\ -l\ -c\ \'\"$0\"\ \"$@\"\'\ rose\ suite-run\ -vv\ -n\ u-bl274\ --run=run\ --remote=uuid=b2df5a44-b5f2-4400-a1f1-281cc1a0d159,now-str=20250514T134101Z,root-dir=\'$DATADIR\' # return-code=255, stderr=
[FAIL] remote username contains invalid characters
Gabrielle
Gabrielle
The HPC directives aren’t right for ARCHER2 (it uses slurm not pbs)
you need something like
submission retry delays = 3*PT5M,3*PT30M,3*PT3H
batch system = slurm
[[[directives]]]
--chdir=/work/n02/n02/{{ARCHER_USERNAME}}
--partition=standard
--qos={{ARCHER_QUEUE}}
--account={{ARCHER_GROUP}}
--export=none
and see mismatched } in
[[[remote]]]
host = $(rose host-select {{ SITE }})
owner = {ARCHER_USERNAME}}
You need single quotes around archer username and account group.
SCRATCH is an unbound variable (I didn’t investigate.)
Maybe compare with a working ARCHER job (I think the choice of ARCHER2 queue won’t work - it needs some extra jinja, have a look in u-cc654 or hard wire the queue)
Grenville