21 July 2025: This instance at RAL is read-only. Please do not try submitting new workflows for now.
Jobsub ID 377221.10@justin-prod-sched01.dune.hep.ac.uk
Jobsub ID | 377221.10@justin-prod-sched01.dune.hep.ac.uk | |
Workflow ID | 6489 | |
Stage ID | 1 | |
User name | avizcaya@fnal.gov | |
HTCondor Group | group_dune | |
Requested | Processors | 1 |
GPU | No | |
RSS bytes | 4193255424 (3999 MiB) | |
Wall seconds limit | 80000 (22 hours) | |
Submitted time | 2025-04-29 19:29:27 | |
Site | CERN | |
Entry | CMSHTPC_T2_CH_CERN_ce515 | |
Last heartbeat | 2025-04-29 19:51:02 | |
From worker node | Hostname | b9p02p8722.cern.ch |
cpuinfo | AMD EPYC 7543 32-Core Processor | |
OS release | Scientific Linux release 7.9 (Nitrogen) | |
Processors | 1 | |
RSS bytes | 4193255424 (3999 MiB) | |
Wall seconds limit | 343800 (95 hours) | |
GPU | ||
Inner Apptainer? | True | |
Job state | jobscript_error | |
Allocator name | justin-allocator-pro.dune.hep.ac.uk | |
Started | 2025-04-29 19:31:02 | |
Input files | monte-carlo-006489-000303 | |
Jobscript | Exit code | 1 |
Real time | 0m (0s) | |
CPU time | 0m (0s = 0%) | |
Max RSS bytes | 0 (0 MiB) | |
Outputting started | ||
Output files | ||
Finished | 2025-04-29 19:32:31 | |
Saved logs | justin-logs:377221.10-justin-prod-sched01.dune.hep.ac.uk.logs.tgz | |
List job events Wrapper job log |
Jobscript log (last 10,000 characters)
Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/ Justin processors: 1 did_pfn_rse monte-carlo-006489-000303 000303 MONTECARLO 10 377221 [31;1m2025-04-29 21:31:13,312 ERROR ConnectionError: ('Connection aborted.', ConnectionResetError(104, 'Connection reset by peer'))[0m [33;1m2025-04-29 21:31:57,785 WARNING Waiting 0.25s due to reason: server returned 503 [0m [33;1m2025-04-29 21:32:13,743 WARNING Waiting 0.5s due to reason: server returned 503 [0m [33;1m2025-04-29 21:32:30,135 WARNING Waiting 1.0s due to reason: server returned 503 [0m Querying usertests:avizcaya_g4bl_prod_041125-w6399s1p1 for 10 files Query: files from usertests:avizcaya_g4bl_prod_041125-w6399s1p1 where dune.output_status=confirmed ordered skip 3020 limit 10 Getting names and metadata done {'beam.momentum': 5.0, 'beam.polarity': -1, 'core.data_stream': 'g4beamline', 'core.data_tier': 'root-tuple', 'core.file_format': 'root', 'core.file_type': 'mc', 'core.group': 'dune', 'core.run_type': 'ehn1-beam-np04', 'dune.output_status': 'confirmed', 'retention.class': 'physics', 'retention.status': 'active', 'core.runs': [377221], 'core.runs_subruns': [37722100010]} Getting paths from rucio Traceback (most recent call last): File "/cvmfs/fifeuser2.opensciencegrid.org/sw/dune/04714e6ef575ca47529605518ed919ebaf29bea8/merge_g4bl.py", line 117, in <module> reps = rc.list_replicas(names, schemes=['root']) File "/cvmfs/dune.opensciencegrid.org/products/dune/rucio/v37_1_0_post1/NULL/lib/python3.9/site-packages/rucio/client/replicaclient.py", line 205, in list_replicas raise exc_cls(exc_msg) rucio.common.exception.RucioException: An unknown exception occurred. Details: no error information passed (http status code: 503) Exiting with error