Sorry for the hopelessly vague title.
I've apparently hit a problem in /projects/v45/apps/payu/aek with access-om2-01.
See run dir: /short/v45/aek156/access-om2/control/01deg_jra55_ryf
.
On the weekend run 29 (job 3477651) crashed after <40min walltime.
archive/restart029 was created but there was no work dir or archive/output029.
see archive/pbs_logs/01deg_jra55_ryf.e3477651:
WARNING: no update with \d+ (\d+) \d+ i2o.nc.
WARNING: no update with \d+ (\d+) \d+ o2i.nc.
WARNING: no update with \w{4} \w{4} LAG=\+(\d+).
WARNING: no update with \d+ (\d+) \d+ i2o.nc.
WARNING: no update with \d+ (\d+) \d+ o2i.nc.
WARNING: no update with \w{4} \w{4} LAG=\+(\d+).
Currently Loaded Modulefiles:
1) payu/aek 2) python/2.7.6 3) openmpi/1.6.3 4) pbs
Traceback (most recent call last):
File "/jobfs/local/pbs/mom_priv/jobs/3477651.r-man2.SC", line 9, in <module>
run_cmd.runscript()
File "/projects/v45/apps/payu/aek/lib/payu/subcommands/run_cmd.py", line 135, in runscript
expt.archive()
File "/projects/v45/apps/payu/aek/lib/payu/experiment.py", line 603, in archive
self.model.archive()
File "/projects/v45/apps/payu/aek/lib/payu/models/access.py", line 187, in archive
shutil.copy2(o2i_src, o2i_dst)
File "/apps/python/2.7.6/lib/python2.7/shutil.py", line 130, in copy2
copyfile(src, dst)
File "/apps/python/2.7.6/lib/python2.7/shutil.py", line 82, in copyfile
with open(src, 'rb') as fsrc:
IOError: [Errno 2] No such file or directory: '/short/v45/aek156/access-om2/work/01deg_jra55_ryf/ocean/o2i.nc'
I thought there might have been something wrong with run 28, so I re-ran 28 and then tried 29 again today (job 3562946) but it failed in exactly the same way. (I kept the previous restart029 in restart029-3477651)