Error after archer2 restart

returning to run an old suite after archer2 was off line, I am getting the error below.

Please could you assist.

regards,

Jeremy

[jgrist02@puma2 u-da510]$ rose suite-run

[INFO] export CYLC_VERSION=7.8.12

[INFO] export ROSE_ORIG_HOST=puma2.archer2.ac.uk

[INFO] export ROSE_SITE=

[INFO] export ROSE_VERSION=2019.01.3

[INFO] create: log.20250925T130245Z

[INFO] delete: log

[INFO] symlink: log.20250925T130245Z <= log

[INFO] log.20250826T162129Z.tar.gz <= log.20250826T162129Z

[INFO] delete: log.20250826T162129Z/

[INFO] create: log/suite

[INFO] create: log/rose-conf

[INFO] symlink: rose-conf/20250925T140246-run.conf <= log/rose-suite-run.conf

[INFO] symlink: rose-conf/20250925T140246-run.version <= log/rose-suite-run.version

[INFO] install: ana

[INFO] source: /home4/home/n02-puma/jgrist02/roses/u-da510/ana

[INFO] REGISTERED u-da510 → /home/n02/n02/jgrist02/cylc-run/u-da510

[FAIL] bash -ec H=$(rose\ host-select\ archer2);\ echo\ $H # return-code=1, stderr=

[FAIL] [WARN] ln02: (ssh failed)

[FAIL] [WARN] ln01: (ssh failed)

[FAIL] [WARN] ln04: (ssh failed)

[FAIL] [WARN] ln03: (ssh failed)

[FAIL] [FAIL] No hosts selected.

[jgrist02@puma2 u-da510]$

Please use the helpdesk search - something this problem has occurred before and may have the answer:

see https://cms-helpdesk.ncas.ac.uk/t/archer2-restart-ssh-fail/1805

Grenville

Hi -

I looked at the thread.

[jgrist02@puma2 ~]$ ssh-add -l

The agent has no identities.

[jgrist02@puma2 ~]$

but my directory has following contents:

[jgrist02@puma2 ~]$ ls ~/.ssh/

authorized_keys environment.puma2.archer2.ac.uk id_rsa id_rsa_archerum.pub id_rsa_jasmin.pub known_hosts

config environment.pumanew.novalocal id_rsa_archerum id_rsa_jasmin id_rsa.pub ssh-setup

[jgrist02@puma2 ~]$

Am I missing the relevant file or is it one of those?

Jeremy

Hi Jeremy,

Out of those you list, I’d guess it’s the id_rsa one, but it’s impossible for us to tell. Try it and see.

This is why we always suggest when you create an ssh-key you name it sensibly so you can easily tell what machine it relates to.

Cheers,
Ros.

Thank you. I ‘ve applied the command to id_rsa, which cleared the [FAIL] error.

However, on the gui I’m getting ‘submit-failed’ after some time of retrying and waiting, did not seem to appear in the slurm queue - although some initial files appeared on archer2.

with many thanks for your help.

Jeremy

Jeremy

The error is spelled out on puma in /home/n02/n02/jgrist02/cylc-run/u-da510/log/job/19500401T0000Z/fcm_make2_um/10/job-activity.log

Please avail yourself of the information provided by ARCHER about budgets ( General FAQ - ARCHER2 User Documentation )

Grenville

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.