justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

21 July 2025: This instance at RAL is read-only. Please do not try submitting new workflows for now.

Jobsub ID 202173.32@justin-prod-sched02.dune.hep.ac.uk

Jobsub ID202173.32@justin-prod-sched02.dune.hep.ac.uk
Workflow ID6807
Stage ID1
User namecalcuttj@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
GPUNo
RSS bytes4193255424 (3999 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2025-05-09 21:10:23
SiteUS_FNAL-FermiGrid
EntryFNAL_GPGrid_ce03_mcore_op_duneonly
Last heartbeat2025-05-09 22:23:38
From worker nodeHostnamedunegli-4875927-0-fnpc23039.fnal.gov
cpuinfoAMD EPYC 7543 32-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4193255424 (3999 MiB)
Wall seconds limit172800 (48 hours)
GPU
Inner Apptainer?True
Job stateoutputting_failed
Allocator namejustin-allocator-pro.dune.hep.ac.uk
Started2025-05-09 22:01:59
Input filesmonte-carlo-006807-000734
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2025-05-09 22:23:38
List job events     Wrapper job log

Jobscript log (last 10,000 characters)

Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org
Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/
Justin processors: 1
did_pfn_rse monte-carlo-006807-000734 000734 MONTECARLO
32 202173
2025-05-09 22:02:17,092	WARNING	Waiting 0.25s due to reason: server returned 503 
2025-05-09 22:02:38,313	ERROR	ConnectionError: ('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))
2025-05-09 22:02:55,244	WARNING	Waiting 0.5s due to reason: server returned 503 
2025-05-09 22:03:11,163	WARNING	Waiting 1.0s due to reason: server returned 503 
Querying usertests:calcuttj_g4bl_prod_full_neg1_042425-w6449s1p1 for 10 files
Query: files from usertests:calcuttj_g4bl_prod_full_neg1_042425-w6449s1p1 where dune.output_status=confirmed ordered skip 7330 limit 10
Getting names and metadata
done
{'beam.momentum': 1.0, 'beam.polarity': -1, 'core.data_stream': 'g4beamline', 'core.data_tier': 'root-tuple', 'core.file_format': 'root', 'core.file_type': 'mc', 'core.group': 'dune', 'core.run_type': 'ehn1-beam-np04', 'dune.output_status': 'confirmed', 'retention.class': 'physics', 'retention.status': 'active', 'core.runs': [202173], 'core.runs_subruns': [20217300032]}
Getting paths from rucio
Traceback (most recent call last):
  File "/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/ea2508d6d6e358db4f4fe9eb0564660bea41fb50/merge_g4bl.py", line 348, in <module>
    do_merge(args)
  File "/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/ea2508d6d6e358db4f4fe9eb0564660bea41fb50/merge_g4bl.py", line 93, in do_merge
    reps = rc.list_replicas(names, schemes=['root'])
  File "/cvmfs/dune.opensciencegrid.org/products/dune/rucio/v35_4_0/NULL/lib/python3.9/site-packages/rucio/client/replicaclient.py", line 207, in list_replicas
    raise exc_cls(exc_msg)
rucio.common.exception.RucioException: An unknown exception occurred.
Details: no error information passed (http status code: 503)
Exiting with error
justIN time: 2025-08-14 21:52:40 UTC       justIN version: 01.03.02