Postproc failing - Connection closed

Hi,

I am running an N512 UM run but my postproc has failed multiple times with the following:

[WARN]  [SUBPROCESS]: Command: rsync -av --stats --rsync-path=mkdir -p /gws/nopw/j04/hiresgw/dg/archer_transfers/u-cg647/19910101T0000Z && rsync /work/n02/n02/dangalea/archive/u-cg647/19910101T0000Z/ hpxfer1.jasmin.ac.uk:/gws/nopw/j04/hiresgw/dg/archer_transfers/u-cg647/19910101T0000Z
[SUBPROCESS]: Error = 10:
	
            Access to this system is monitored and restricted to
            authorised users.   If you do not have authorisation
            to use  this system,  you should not  proceed beyond
            this point and should disconnect immediately.

            Unauthorised use could lead to prosecution.

    (See also - http://www.stfc.ac.uk/aup)

sending incremental file list
./
cg647a.pf1990dec.pp
Connection to hpxfer1.jasmin.ac.uk closed by remote host.
rsync: [sender] write error: Broken pipe (32)
rsync error: error in socket IO (code 10) at io.c(829) [sender=3.1.3]

[WARN]  Transfer command failed: rsync -av --stats --rsync-path="mkdir -p /gws/nopw/j04/hiresgw/dg/archer_transfers/u-cg647/19910101T0000Z && rsync" /work/n02/n02/dangalea/archive/u-cg647/19910101T0000Z/ hpxfer1.jasmin.ac.uk:/gws/nopw/j04/hiresgw/dg/archer_transfers/u-cg647/19910101T0000Z
[ERROR]  transfer.py: Unknown Error - Return Code=10
[FAIL]  Command Terminated
[FAIL] Terminating PostProc...
[FAIL] transfer.py # return-code=1

It seems that JASMIN is closing the connection for rsync but not sure why. Would you be able to help?

Regards,
Daniel

Hi Daniel,

Looking at the logs pptransfer for cycle 19910101T0000Z on try 09 has succeeded and the next cycle pptransfer is now running. JASMIN services were “at risk” today with planned system maintenance so that could have been the issue.

Cheers,
Ros.

Hi,

thanks for that. It happened a few times yesterday too, but all seems well now.

Regards,
Daniel

Hi Daniel,

Most, if not all, of the attempts for that task yesterday had failed due to CPU time limit failures rather than connection to JASMIN issues. The pptransfer task is currently run in the background on the ARCHER2 login nodes and there were ARCHER2 issues yesterday.

Hopefully it will continue ok now.

Cheers,
Ros.

This topic was automatically closed 2 days after the last reply. New replies are no longer allowed.