justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 202177.32@justin-prod-sched02.dune.hep.ac.uk

Jobsub ID202177.32@justin-prod-sched02.dune.hep.ac.uk
Workflow ID6807
Stage ID1
User namecalcuttj@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
GPUNo
RSS bytes4193255424 (3999 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2025-05-09 21:11:40
SiteUK_Edinburgh
EntryDUNE_UK_SGridECDF_ce1
Last heartbeat2025-05-09 22:32:25
From worker nodeHostnamenode2b03.ecdf.ed.ac.uk
cpuinfoIntel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4193255424 (3999 MiB)
Wall seconds limit171000 (47 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Allocator namejustin-allocator-pro.dune.hep.ac.uk
Started2025-05-09 22:13:04
Input filesmonte-carlo-006807-000822
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2025-05-09 22:32:25
Saved logsjustin-logs:202177.32-justin-prod-sched02.dune.hep.ac.uk.logs.tgz
List job events     Wrapper job log

Jobscript log (last 10,000 characters)

Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org
Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/
Justin processors: 1
did_pfn_rse monte-carlo-006807-000822 000822 MONTECARLO
32 202177
2025-05-09 23:13:22,194	WARNING	Waiting 0.25s due to reason: server returned 503 
2025-05-09 23:13:23,489	ERROR	ConnectionError: ('Connection aborted.', RemoteDisconnected('Remote end closed connection without response'))
2025-05-09 23:13:39,072	WARNING	Waiting 1.0s due to reason: server returned 503 
Traceback (most recent call last):
  File "/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/ea2508d6d6e358db4f4fe9eb0564660bea41fb50/merge_g4bl.py", line 5, in <module>
    rc = ReplicaClient(account='justinreadonly')
  File "/cvmfs/dune.opensciencegrid.org/products/dune/rucio/v35_4_0/NULL/lib/python3.9/site-packages/rucio/client/baseclient.py", line 206, in __init__
    self.__authenticate()
  File "/cvmfs/dune.opensciencegrid.org/products/dune/rucio/v35_4_0/NULL/lib/python3.9/site-packages/rucio/client/baseclient.py", line 974, in __authenticate
    self.__get_token()
  File "/cvmfs/dune.opensciencegrid.org/products/dune/rucio/v35_4_0/NULL/lib/python3.9/site-packages/rucio/client/baseclient.py", line 867, in __get_token
    if not self.__get_token_x509():
  File "/cvmfs/dune.opensciencegrid.org/products/dune/rucio/v35_4_0/NULL/lib/python3.9/site-packages/rucio/client/baseclient.py", line 738, in __get_token_x509
    raise exc_cls(exc_msg)
rucio.common.exception.RucioException: An unknown exception occurred.
Details: no error information passed (http status code: 503)
Exiting with error
justIN time: 2025-05-23 00:44:02 UTC       justIN version: 01.03.01