21 July 2025: This instance at RAL is read-only. Please do not try submitting new workflows for now.
Jobsub ID 224233.48@justin-prod-sched02.dune.hep.ac.uk
Jobsub ID | 224233.48@justin-prod-sched02.dune.hep.ac.uk | |
Workflow ID | 7709 | |
Stage ID | 1 | |
User name | lwhite86@fnal.gov | |
HTCondor Group | group_dune | |
Requested | Processors | 1 |
GPU | No | |
RSS bytes | 4194304000 (4000 MiB) | |
Wall seconds limit | 3600 (1 hours) | |
Submitted time | 2025-06-18 10:28:22 | |
Site | CERN | |
Entry | CMSHTPC_T2_CH_CERN_ce509 | |
Last heartbeat | 2025-06-18 10:32:40 | |
From worker node | Hostname | b9p02p9149.cern.ch |
cpuinfo | AMD EPYC 7543 32-Core Processor | |
OS release | Scientific Linux release 7.9 (Nitrogen) | |
Processors | 1 | |
RSS bytes | 4194304000 (4000 MiB) | |
Wall seconds limit | 343800 (95 hours) | |
GPU | ||
Inner Apptainer? | True | |
Job state | finished | |
Allocator name | justin-allocator-pro.dune.hep.ac.uk | |
Started | 2025-06-18 10:29:38 | |
Input files | fardet-hd:nutau_dune10kt_1x2x6_1105_781_20230826T143808Z_gen_g4_detsim_hitreco__20240406T121347Z_reco2.root | |
Jobscript | Exit code | 0 |
Real time | 2m (161s) | |
CPU time | 1m (81s = 50%) | |
Max RSS bytes | 1448820736 (1381 MiB) | |
Outputting started | 2025-06-18 10:32:19 | |
Output files | https://fndcadoor.fnal.gov:2880/dune/scratch/users/lwhite86/07709/1/001/trackShowerCountingValidation_nutau_dune10kt_1x2x6_1105_781_20230826T143808Z_gen_g4_detsim_hitreco__20240406T121347Z_reco2.root | |
Finished | 2025-06-18 10:32:40 | |
Saved logs | justin-logs:224233.48-justin-prod-sched02.dune.hep.ac.uk.logs.tgz | |
List job events Wrapper job log |
Jobscript log (last 10,000 characters)
readout planes. GeoApaChannelGroupService::ctor: Group 11 (apa11) has 2560 channels from 4/4 readout planes. DuneToolManager::fclFilename: Taking fcl name from command line: lar -c /cvmfs/fifeuser1.opensciencegrid.org/sw/dune/6d1d8dcabd439683fbeb6f0240d24cb2054a5b42/runPandoraValidation.fcl root://dune.dcache.nikhef.nl:1094/pnfs/nikhef.nl/data/dune/generic/rucio/fardet-hd/ac/e3/nutau_dune10kt_1x2x6_1105_781_20230826T143808Z_gen_g4_detsim_hitreco__20240406T121347Z_reco2.root AcdDigitReader::ctor: LogLevel: 1 DuneToolManager::getPrivate: ERROR: Tool name is blank StandardRawDigitExtractService::ctor: Retrieved digit read tool digitReader StandardRawDigitExtractService::ctor: StandardRawDigitExtractService: StandardRawDigitExtractService::ctor: LogLevel: 1 StandardRawDigitExtractService::ctor: DigitReadTool: digitReader StandardRawDigitExtractService::ctor: PedestalOption: 1 StandardRawDigitExtractService::ctor: FlagStuckOff: 0 StandardRawDigitExtractService::ctor: FlagStuckOn: 0 StandardRawDigitPrepService::ctor: Fetching extract service. StandardRawDigitPrepService::ctor: Fetching channel status provider. StandardRawDigitPrepService::ctor: Channel status provider: @0xb67cb20 StandardRawDigitPrepService::ctor: Extract service: @0xcd60530 StandardRawDigitPrepService::ctor: Fetching deconvolution service. StandardRawDigitPrepService::ctor: Deconvolution service: @0x3d831f0 StandardRawDigitPrepService::ctor: Fetching ROI building service. StandardRawDigitPrepService::ctor: ROI building service: @0x3df7140 StandardRawDigitPrepService::ctor: Fetching wire building service. StandardRawDigitPrepService::ctor: Wire building service: @0xb1df890 StandardRawDigitPrepService::ctor: StandardRawDigitPrepService: StandardRawDigitPrepService::ctor: LogLevel: 1 StandardRawDigitPrepService::ctor: SkipBad: 1 StandardRawDigitPrepService::ctor: SkipNoisy: 0 StandardRawDigitPrepService::ctor: ChannelStatusOnline: 0 StandardRawDigitPrepService::ctor: DoMitigation: 0 StandardRawDigitPrepService::ctor: DoEarlySignalFinding: 0 StandardRawDigitPrepService::ctor: DoNoiseRemoval: 0 StandardRawDigitPrepService::ctor: DoDeconvolution: 1 StandardRawDigitPrepService::ctor: DoPedestalAdjustment: 0 StandardRawDigitPrepService::ctor: DoROI: 1 StandardRawDigitPrepService::ctor: DoWires: 1 StandardRawDigitPrepService::ctor: DoDump: 0 StandardRawDigitPrepService::ctor: DoIntermediateStates: 0 StandardRawDigitPrepService::ctor: No display tools. Warning in <TFile::Append>: Replacing existing TH1: FieldResponse_U (Potential memory leak). Warning in <TFile::Append>: Replacing existing TH1: FieldResponse_V (Potential memory leak). Warning in <TFile::Append>: Replacing existing TH1: FieldResponse_Y (Potential memory leak). 18-Jun-2025 12:30:40 CEST Initiating request to open input file "root://dune.dcache.nikhef.nl:1094/pnfs/nikhef.nl/data/dune/generic/rucio/fardet-hd/ac/e3/nutau_dune10kt_1x2x6_1105_781_20230826T143808Z_gen_g4_detsim_hitreco__20240406T121347Z_reco2.root" 18-Jun-2025 12:30:50 CEST Opened input file "root://dune.dcache.nikhef.nl:1094/pnfs/nikhef.nl/data/dune/generic/rucio/fardet-hd/ac/e3/nutau_dune10kt_1x2x6_1105_781_20230826T143808Z_gen_g4_detsim_hitreco__20240406T121347Z_reco2.root" Begin processing the 1st record. run: 1105 subRun: 1 event: 78101 at 18-Jun-2025 12:30:51 CEST Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_HD_Accel_1_U_v04_06_00.pt' Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_HD_Accel_1_V_v04_06_00.pt' Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_HD_Accel_1_W_v04_06_00.pt' Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_HD_Accel_2_U_v04_06_00.pt' Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_HD_Accel_2_V_v04_06_00.pt' Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_HD_Accel_2_W_v04_06_00.pt' Loaded the TorchScript model '/cvmfs/fifeuser1.opensciencegrid.org/sw/dune/6d1d8dcabd439683fbeb6f0240d24cb2054a5b42/pandora_track_shower_counting_network_v0.pt' Failure in algorithm Alg0004, LArCNNTrackShowerCounting, unknown exception PandoraContentApi::GetList(*this, m_inputPfoListName, pPfoList) return STATUS_CODE_NOT_INITIALIZED in function: Run in file: /exp/dune/app/users/lwhite86/DUNE-FD/pandoraEventClassification/srcs/larpandoracontent/larpandoradlcontent/LArEventClassification/CNNTrackShowerCountingValidationAlgorithm.cc line#: 52 iter->second->Run() throw STATUS_CODE_NOT_INITIALIZED in function: RunAlgorithm in file: /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235 Failure in algorithm Alg0005, LArCNNTrackShowerCountingValidation, STATUS_CODE_NOT_INITIALIZED 18-Jun-2025 12:31:09 CEST Opened output file with pattern "%ifb_reco2.root" Begin processing the 2nd record. run: 1105 subRun: 1 event: 78102 at 18-Jun-2025 12:31:13 CEST - Error! Bin 849 outside range 0 to 848 - Underlying values: 517.5, 121.272, 517.5, 0.4667 PandoraMonitoring, only able to use default TApplication (limited functionality). Begin processing the 3rd record. run: 1105 subRun: 1 event: 78103 at 18-Jun-2025 12:31:21 CEST Begin processing the 4th record. run: 1105 subRun: 1 event: 78104 at 18-Jun-2025 12:31:29 CEST Begin processing the 5th record. run: 1105 subRun: 1 event: 78105 at 18-Jun-2025 12:31:36 CEST Begin processing the 6th record. run: 1105 subRun: 1 event: 78106 at 18-Jun-2025 12:31:42 CEST Begin processing the 7th record. run: 1105 subRun: 1 event: 78107 at 18-Jun-2025 12:31:53 CEST Begin processing the 8th record. run: 1105 subRun: 1 event: 78108 at 18-Jun-2025 12:31:59 CEST Begin processing the 9th record. run: 1105 subRun: 1 event: 78109 at 18-Jun-2025 12:32:06 CEST Skipping event as it does not have enough hits or associated primary particles to make a training sample iter->second->Run() throw STATUS_CODE_FAILURE in function: RunAlgorithm in file: /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235 Failure in algorithm Alg0005, LArCNNTrackShowerCountingValidation, STATUS_CODE_FAILURE Begin processing the 10th record. run: 1105 subRun: 1 event: 78110 at 18-Jun-2025 12:32:12 CEST 18-Jun-2025 12:32:19 CEST Closed output file "nutau_dune10kt_1x2x6_1105_781_20230826T143808Z_gen_g4_detsim_hitreco__20240406T121347Z_reco2_reco2.root" 18-Jun-2025 12:32:19 CEST Closed input file "root://dune.dcache.nikhef.nl:1094/pnfs/nikhef.nl/data/dune/generic/rucio/fardet-hd/ac/e3/nutau_dune10kt_1x2x6_1105_781_20230826T143808Z_gen_g4_detsim_hitreco__20240406T121347Z_reco2.root" ================================================================================================================================ TimeTracker printout (sec) Min Avg Max Median RMS nEvts ================================================================================================================================ Full event 5.63882 8.32053 20.4111 6.31888 4.31791 10 -------------------------------------------------------------------------------------------------------------------------------- source:RootInput(read) 0.0350232 0.0481042 0.0539208 0.0479187 0.00511223 10 reco:pandora2:StandardPandora 3.10165 5.09809 17.71 3.424 4.2524 10 [art]:TriggerResults:TriggerResultInserter 1.188e-05 2.20071e-05 5.076e-05 1.507e-05 1.17912e-05 10 end_path:out1:RootOutput 2.65e-06 3.869e-06 1.046e-05 3.075e-06 2.23056e-06 10 end_path:out1:RootOutput(write) 2.46041 3.17409 5.86969 2.72788 0.973184 10 ================================================================================================================================ ==================================================================================================== MemoryTracker summary (base-10 MB units used) Peak virtual memory usage (VmPeak) : 2451.17 MB Peak resident set size usage (VmHWM): 1448.82 MB ==================================================================================================== Art has completed and will exit with status 0. lar exit code 0 total 103268 -rw-r--r--. 1 duneprd np-comp 216 Jun 18 12:29 all-input-dids.txt -rw-r--r--. 1 duneprd np-comp 0 Jun 18 12:30 debugprod.log -rw-r--r--. 1 duneprd np-comp 33215 Jun 18 12:32 jobscript.log -rw-r--r--. 1 duneprd np-comp 187 Jun 18 12:32 justin-processed-pfns.txt drwxr-xr-x. 4 duneprd np-comp 48 Jun 18 12:29 larpandoracontent -rw-r--r--. 1 duneprd np-comp 105681032 Jun 18 12:32 nutau_dune10kt_1x2x6_1105_781_20230826T143808Z_gen_g4_detsim_hitreco__20240406T121347Z_reco2_reco2.root -rw-r--r--. 1 duneprd np-comp 519 Jun 18 12:32 reco2_hist.root -rw-r--r--. 1 duneprd np-comp 8422 Jun 18 12:32 trackShowerCountingValidation_nutau_dune10kt_1x2x6_1105_781_20230826T143808Z_gen_g4_detsim_hitreco__20240406T121347Z_reco2.root