justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

21 July 2025: This instance at RAL is read-only. Please do not try submitting new workflows for now.

Jobsub ID 232877.144@justin-prod-sched02.dune.hep.ac.uk

Jobsub ID232877.144@justin-prod-sched02.dune.hep.ac.uk
Workflow ID8046
Stage ID1
User namehiguera@fnal.gov
HTCondor Groupgroup_dune.prod_mcsim
RequestedProcessors1
GPUNo
RSS bytes4194304000 (4000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2025-07-01 03:13:10
SiteNL_SURFsara
EntryDUNE_SurfSARA_arc02
Last heartbeat2025-07-01 07:54:47
From worker nodeHostnamewn-lb-11.gina.surf.nl
cpuinfoAMD EPYC 9754 128-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit129600 (36 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Allocator namejustin-allocator-pro.dune.hep.ac.uk
Started2025-07-01 03:37:49
Input filesfardet-vd:prodmarley_nue_cc_flat_radiological_decay0_dunevd10kt_1x8x14_3view_30deg_20250205T004639Z_gen_002845_supernova_g4stage1_g4stage2_detsim_reco.root
JobscriptExit code139
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2025-07-01 07:54:47
Saved logsjustin-logs:232877.144-justin-prod-sched02.dune.hep.ac.uk.logs.tgz
List job events     Wrapper job log

Jobscript log (last 10,000 characters)

he same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  01-Jul-2025 09:31:22 CEST run: 5055 subRun: 0 event: 28448
Comparing two wires in the same plane: return failure
%MSG
Begin processing the 9th record. run: 5055 subRun: 0 event: 28449 at 01-Jul-2025 09:33:41 CEST
Error: A CaloHitList is empty
Error: A CaloHitList is empty
PandoraContentApi::GetList(*this, listname, pCaloHitList) return STATUS_CODE_NOT_INITIALIZED
    in function: Infer
    in file:     /scratch/workspace/build-larsoft/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/AlmaLinux-9.4/build/larpandoracontent/v04_12_00-buildFW/src/larpandoradlcontent/LArVertex/DlVertexingAlgorithm.cc line#: 194
iter->second->Run() throw STATUS_CODE_NOT_INITIALIZED
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0004, LArDLVertexing, STATUS_CODE_NOT_INITIALIZED
DLVertexing: Input vertex list is empty! Can't perform pass 2

========================================================================================================================================
TimeTracker printout (sec)                                Min           Avg           Max         Median          RMS         nEvts   
========================================================================================================================================
Full event                                             0.0136845      1538.42       2148.99       1790.94       617.639         9     
----------------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                                 0.0103744     0.0450614     0.145334      0.0220596     0.0482932        9     
reco:pandora:StandardPandora                            1117.44       1463.45       1908.41       1540.55       284.375         9     
reco:pandoraTrack:LArPandoraTrackCreation             0.000234822    0.0252439     0.167107     0.00496913     0.0508008        9     
reco:pandoraShower:LArPandoraModularShowerCreation    0.000248543   0.00394465     0.0158418    0.00117757    0.00570586        9     
reco:pandoracalo:Calorimetry                          0.000340722   0.00202091    0.00483263    0.000574442   0.00177404        9     
reco:pandorapid:Chi2ParticleID                        4.6069e-05    0.00027679    0.00179232    6.3656e-05    0.000537204       9     
reco:linecluster:LineCluster                           0.989896       1.07428       1.49683       1.01799      0.160063         8     
reco:trajcluster:TrajCluster                           0.359485      0.411842      0.476376      0.409507      0.0336805        8     
reco:pmtracktc:PMAlgTrackMaker                          7.0189        7.56297       8.24286       7.51869      0.426344         8     
reco:emtrkmichelid:EmTrackMichelId                      136.576       213.295       255.216       234.485       42.7764         8     
reco:solarflash:SolarOpFlash                            1.23428       1.31631       1.48661       1.26908       0.09844         8     
[art]:TriggerResults:TriggerResultInserter            2.4246e-05    4.75297e-05   0.00010724    4.5258e-05    2.44762e-05       8     
end_path:out1:RootOutput                               5.378e-06    1.20094e-05   3.5894e-05     9.299e-06    9.25307e-06       8     
end_path:out1:RootOutput(write)                        0.252277       0.27759      0.302548      0.279398      0.0156548        8     
========================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 4038.31 MB
  Peak resident set size usage (VmHWM): 2574.62 MB
====================================================================================================
%MSG-s ArtException:  PostEndJob 01-Jul-2025 09:53:38 CEST ModuleEndJob
---- EventProcessorFailure BEGIN
  EventProcessor: an exception occurred during current event processing
  ---- ScheduleExecutionFailure BEGIN
    Path: ProcessingStopped.
    ---- FileReadError BEGIN
      ---- FatalRootError BEGIN
        Fatal Root Error: TNetXNGFile::ReadBuffer
        [ERROR] Server responded with an error: [3012] Failed to open file (Pool unavailable [1010])
        ROOT severity: 3000
      ---- FatalRootError END
      
      The above exception was thrown while processing module LineCluster/linecluster run: 5055 subRun: 0 event: 28449
    ---- FileReadError END
    Exception going through path reco
  ---- ScheduleExecutionFailure END
---- EventProcessorFailure END
---- FatalRootError BEGIN
  Fatal Root Error: TTree::SetEntries
  Tree branches have different numbers of entries, eg EventAuxiliary has 8 entries while sim::OpDetDivRecs_sipmAr10ppmExt__detsim. has 10 entries.
  ROOT severity: 2000
---- FatalRootError END
---- FatalRootError BEGIN
  Fatal Root Error: TNetXNGFile::Close
  [ERROR] Server responded with an error: [3012] Failed to open file (Pool unavailable [1010])
  ROOT severity: 3000
---- FatalRootError END
%MSG
../justin-jobscript: line 74:  1139 Segmentation fault      (core dumped) lar -c reco2_supernova_dunevd10kt_1x8x14_3view_30deg_prod2024.fcl $FILE -o ${reco2_name}.root -n -1
justIN time: 2025-08-14 18:48:01 UTC       justIN version: 01.03.02