Hi CMS,
I’m encountering some issues with pptransfer to JASMIN via Globus. There are two issues really.
Firstly, the transfer jobs aren’t completing on the first submission (job(01)). If I log in to Globus Online, the error that keeps cropping up is: endpoint too busy
. The below is copied from the Globus event log.
Error (transfer)
Endpoint: JASMIN Default Collection (a2f53b7f-1b4e-4dce-9b7c-349ae760fee0)
Server: 130.246.1.6:443
Command: PASV
Message: The server may be too busy
Explanation: The endpoint has reached its maximum number of connections available to transfer data. This could be a transient problem which can be ignored.
---
Details: 500-Command failed.\r\r\n500- : globus_ftp_control_local_pasv failed.\r\n500-globus_xio: globus_l_xio_tcp_bind failed.\r\n500-globus_xio: System error in bind: Address already in use\r\n500-globus_xio: A system call failed: Address already in use\r\n500 End.\r\n
Is this an error that you are aware of or that other users are experiencing?
Secondly, when the first job times out on ARCHER due to the above, subsequent re-tries also fail but for a different reason: A transfer with identical paths has not yet completed
. The following is from pptransfer/NN/job.err
:
[WARN] file:atmospp.nl: skip missing optional source: namelist:moose_arch
[WARN] file:atmospp.nl: skip missing optional source: namelist:script_arch
[WARN] [SUBPROCESS]: Command: globus transfer --format unix --jmespath task_id --recursive --fail-on-quota-errors --sync-level checksum --label u-dp788/18740101T0000Z --verify-checksum --notify off 3e90d018-0d05-461a-bbaf-aab605283d21:/work/n02/n02/ajw1g19/archive/u-dp788/18740101T0000Z a2f53b7f-1b4e-4dce-9b7c-349ae760fee0:/gws/nopw/j04/mh_gsp/Model_Output/u-dp788/18740101T0000Z
[SUBPROCESS]: Error = 1:
Globus CLI Error: A Transfer API Error Occurred.
HTTP status: 409
request_id: UjMZgjDbH
code: Conflict
message: A transfer with identical paths has not yet completed
[WARN] Transfer command failed: globus transfer --format unix --jmespath 'task_id' --recursive --fail-on-quota-errors --sync-level checksum --label u-dp788/18740101T0000Z --verify-checksum --notify off 3e90d018-0d05-461a-bbaf-aab605283d21:/work/n02/n02/ajw1g19/archive/u-dp788/18740101T0000Z a2f53b7f-1b4e-4dce-9b7c-349ae760fee0:/gws/nopw/j04/mh_gsp/Model_Output/u-dp788/18740101T0000Z
[ERROR] transfer.py: Globus Error: Network or server error occurred (Globus ReturnCode=1)
[FAIL] Command Terminated
[FAIL] Terminating PostProc...
[FAIL] transfer.py <<'__STDIN__'
[FAIL]
[FAIL] '__STDIN__' # return-code=1
2025-06-05T14:04:55Z CRITICAL - failed/EXIT
Meanwhile, on Globus, the original transfer operation still shows as active.
Can you please advise?
Regards,
Alfred