justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

21 July 2025: This instance at RAL is read-only. Please do not try submitting new workflows for now.

Jobsub ID 224233.48@justin-prod-sched02.dune.hep.ac.uk

Jobsub ID224233.48@justin-prod-sched02.dune.hep.ac.uk
Workflow ID7709
Stage ID1
User namelwhite86@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
GPUNo
RSS bytes4194304000 (4000 MiB)
Wall seconds limit3600 (1 hours)
Submitted time2025-06-18 10:28:22
SiteCERN
EntryCMSHTPC_T2_CH_CERN_ce509
Last heartbeat2025-06-18 10:32:40
From worker nodeHostnameb9p02p9149.cern.ch
cpuinfoAMD EPYC 7543 32-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit343800 (95 hours)
GPU
Inner Apptainer?True
Job statefinished
Allocator namejustin-allocator-pro.dune.hep.ac.uk
Started2025-06-18 10:29:38
Input filesfardet-hd:nutau_dune10kt_1x2x6_1105_781_20230826T143808Z_gen_g4_detsim_hitreco__20240406T121347Z_reco2.root
JobscriptExit code0
Real time2m (161s)
CPU time1m (81s = 50%)
Max RSS bytes1448820736 (1381 MiB)
Outputting started2025-06-18 10:32:19
Output fileshttps://fndcadoor.fnal.gov:2880/dune/scratch/users/lwhite86/07709/1/001/trackShowerCountingValidation_nutau_dune10kt_1x2x6_1105_781_20230826T143808Z_gen_g4_detsim_hitreco__20240406T121347Z_reco2.root
Finished2025-06-18 10:32:40
Saved logsjustin-logs:224233.48-justin-prod-sched02.dune.hep.ac.uk.logs.tgz
List job events     Wrapper job log

Jobscript log (last 10,000 characters)

readout planes.
GeoApaChannelGroupService::ctor: Group 11 (apa11) has 2560 channels from 4/4 readout planes.
DuneToolManager::fclFilename: Taking fcl name from command line: lar -c /cvmfs/fifeuser1.opensciencegrid.org/sw/dune/6d1d8dcabd439683fbeb6f0240d24cb2054a5b42/runPandoraValidation.fcl root://dune.dcache.nikhef.nl:1094/pnfs/nikhef.nl/data/dune/generic/rucio/fardet-hd/ac/e3/nutau_dune10kt_1x2x6_1105_781_20230826T143808Z_gen_g4_detsim_hitreco__20240406T121347Z_reco2.root
AcdDigitReader::ctor:     LogLevel: 1
DuneToolManager::getPrivate: ERROR: Tool name is blank
StandardRawDigitExtractService::ctor: Retrieved digit read tool digitReader
StandardRawDigitExtractService::ctor: StandardRawDigitExtractService:
StandardRawDigitExtractService::ctor:         LogLevel: 1
StandardRawDigitExtractService::ctor:    DigitReadTool: digitReader
StandardRawDigitExtractService::ctor:   PedestalOption: 1
StandardRawDigitExtractService::ctor:     FlagStuckOff: 0
StandardRawDigitExtractService::ctor:      FlagStuckOn: 0
StandardRawDigitPrepService::ctor: Fetching extract service.
StandardRawDigitPrepService::ctor: Fetching channel status provider.
StandardRawDigitPrepService::ctor:   Channel status provider: @0xb67cb20
StandardRawDigitPrepService::ctor:   Extract service: @0xcd60530
StandardRawDigitPrepService::ctor: Fetching deconvolution service.
StandardRawDigitPrepService::ctor:   Deconvolution service: @0x3d831f0
StandardRawDigitPrepService::ctor: Fetching ROI building service.
StandardRawDigitPrepService::ctor:   ROI building service: @0x3df7140
StandardRawDigitPrepService::ctor: Fetching wire building service.
StandardRawDigitPrepService::ctor:   Wire building service: @0xb1df890
StandardRawDigitPrepService::ctor: StandardRawDigitPrepService:
StandardRawDigitPrepService::ctor:              LogLevel: 1
StandardRawDigitPrepService::ctor:               SkipBad: 1
StandardRawDigitPrepService::ctor:             SkipNoisy: 0
StandardRawDigitPrepService::ctor:   ChannelStatusOnline: 0
StandardRawDigitPrepService::ctor:          DoMitigation: 0
StandardRawDigitPrepService::ctor:  DoEarlySignalFinding: 0
StandardRawDigitPrepService::ctor:        DoNoiseRemoval: 0
StandardRawDigitPrepService::ctor:       DoDeconvolution: 1
StandardRawDigitPrepService::ctor:  DoPedestalAdjustment: 0
StandardRawDigitPrepService::ctor:                 DoROI: 1
StandardRawDigitPrepService::ctor:               DoWires: 1
StandardRawDigitPrepService::ctor:                DoDump: 0
StandardRawDigitPrepService::ctor:  DoIntermediateStates: 0
StandardRawDigitPrepService::ctor: No display tools.
Warning in <TFile::Append>: Replacing existing TH1: FieldResponse_U (Potential memory leak).
Warning in <TFile::Append>: Replacing existing TH1: FieldResponse_V (Potential memory leak).
Warning in <TFile::Append>: Replacing existing TH1: FieldResponse_Y (Potential memory leak).
18-Jun-2025 12:30:40 CEST  Initiating request to open input file "root://dune.dcache.nikhef.nl:1094/pnfs/nikhef.nl/data/dune/generic/rucio/fardet-hd/ac/e3/nutau_dune10kt_1x2x6_1105_781_20230826T143808Z_gen_g4_detsim_hitreco__20240406T121347Z_reco2.root"
18-Jun-2025 12:30:50 CEST  Opened input file "root://dune.dcache.nikhef.nl:1094/pnfs/nikhef.nl/data/dune/generic/rucio/fardet-hd/ac/e3/nutau_dune10kt_1x2x6_1105_781_20230826T143808Z_gen_g4_detsim_hitreco__20240406T121347Z_reco2.root"
Begin processing the 1st record. run: 1105 subRun: 1 event: 78101 at 18-Jun-2025 12:30:51 CEST
Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_HD_Accel_1_U_v04_06_00.pt'
Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_HD_Accel_1_V_v04_06_00.pt'
Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_HD_Accel_1_W_v04_06_00.pt'
Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_HD_Accel_2_U_v04_06_00.pt'
Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_HD_Accel_2_V_v04_06_00.pt'
Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_HD_Accel_2_W_v04_06_00.pt'
Loaded the TorchScript model '/cvmfs/fifeuser1.opensciencegrid.org/sw/dune/6d1d8dcabd439683fbeb6f0240d24cb2054a5b42/pandora_track_shower_counting_network_v0.pt'
Failure in algorithm Alg0004, LArCNNTrackShowerCounting, unknown exception
PandoraContentApi::GetList(*this, m_inputPfoListName, pPfoList) return STATUS_CODE_NOT_INITIALIZED
    in function: Run
    in file:     /exp/dune/app/users/lwhite86/DUNE-FD/pandoraEventClassification/srcs/larpandoracontent/larpandoradlcontent/LArEventClassification/CNNTrackShowerCountingValidationAlgorithm.cc line#: 52
iter->second->Run() throw STATUS_CODE_NOT_INITIALIZED
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0005, LArCNNTrackShowerCountingValidation, STATUS_CODE_NOT_INITIALIZED
18-Jun-2025 12:31:09 CEST  Opened output file with pattern "%ifb_reco2.root"
Begin processing the 2nd record. run: 1105 subRun: 1 event: 78102 at 18-Jun-2025 12:31:13 CEST
 - Error! Bin 849 outside range 0 to 848
  - Underlying values: 517.5, 121.272, 517.5, 0.4667
PandoraMonitoring, only able to use default TApplication (limited functionality).
Begin processing the 3rd record. run: 1105 subRun: 1 event: 78103 at 18-Jun-2025 12:31:21 CEST
Begin processing the 4th record. run: 1105 subRun: 1 event: 78104 at 18-Jun-2025 12:31:29 CEST
Begin processing the 5th record. run: 1105 subRun: 1 event: 78105 at 18-Jun-2025 12:31:36 CEST
Begin processing the 6th record. run: 1105 subRun: 1 event: 78106 at 18-Jun-2025 12:31:42 CEST
Begin processing the 7th record. run: 1105 subRun: 1 event: 78107 at 18-Jun-2025 12:31:53 CEST
Begin processing the 8th record. run: 1105 subRun: 1 event: 78108 at 18-Jun-2025 12:31:59 CEST
Begin processing the 9th record. run: 1105 subRun: 1 event: 78109 at 18-Jun-2025 12:32:06 CEST
Skipping event as it does not have enough hits or associated primary particles to make a training sample
iter->second->Run() throw STATUS_CODE_FAILURE
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0005, LArCNNTrackShowerCountingValidation, STATUS_CODE_FAILURE
Begin processing the 10th record. run: 1105 subRun: 1 event: 78110 at 18-Jun-2025 12:32:12 CEST
18-Jun-2025 12:32:19 CEST  Closed output file "nutau_dune10kt_1x2x6_1105_781_20230826T143808Z_gen_g4_detsim_hitreco__20240406T121347Z_reco2_reco2.root"
18-Jun-2025 12:32:19 CEST  Closed input file "root://dune.dcache.nikhef.nl:1094/pnfs/nikhef.nl/data/dune/generic/rucio/fardet-hd/ac/e3/nutau_dune10kt_1x2x6_1105_781_20230826T143808Z_gen_g4_detsim_hitreco__20240406T121347Z_reco2.root"

================================================================================================================================
TimeTracker printout (sec)                        Min           Avg           Max         Median          RMS         nEvts   
================================================================================================================================
Full event                                      5.63882       8.32053       20.4111       6.31888       4.31791        10     
--------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                         0.0350232     0.0481042     0.0539208     0.0479187    0.00511223       10     
reco:pandora2:StandardPandora                   3.10165       5.09809        17.71         3.424        4.2524         10     
[art]:TriggerResults:TriggerResultInserter     1.188e-05    2.20071e-05    5.076e-05     1.507e-05    1.17912e-05      10     
end_path:out1:RootOutput                       2.65e-06      3.869e-06     1.046e-05     3.075e-06    2.23056e-06      10     
end_path:out1:RootOutput(write)                 2.46041       3.17409       5.86969       2.72788      0.973184        10     
================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 2451.17 MB
  Peak resident set size usage (VmHWM): 1448.82 MB
====================================================================================================
Art has completed and will exit with status 0.
lar exit code 0
total 103268
-rw-r--r--. 1 duneprd np-comp       216 Jun 18 12:29 all-input-dids.txt
-rw-r--r--. 1 duneprd np-comp         0 Jun 18 12:30 debugprod.log
-rw-r--r--. 1 duneprd np-comp     33215 Jun 18 12:32 jobscript.log
-rw-r--r--. 1 duneprd np-comp       187 Jun 18 12:32 justin-processed-pfns.txt
drwxr-xr-x. 4 duneprd np-comp        48 Jun 18 12:29 larpandoracontent
-rw-r--r--. 1 duneprd np-comp 105681032 Jun 18 12:32 nutau_dune10kt_1x2x6_1105_781_20230826T143808Z_gen_g4_detsim_hitreco__20240406T121347Z_reco2_reco2.root
-rw-r--r--. 1 duneprd np-comp       519 Jun 18 12:32 reco2_hist.root
-rw-r--r--. 1 duneprd np-comp      8422 Jun 18 12:32 trackShowerCountingValidation_nutau_dune10kt_1x2x6_1105_781_20230826T143808Z_gen_g4_detsim_hitreco__20240406T121347Z_reco2.root
justIN time: 2025-08-13 18:38:19 UTC       justIN version: 01.03.02