justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 292802.1@justin-prod-sched01.dune.hep.ac.uk

Jobsub ID292802.1@justin-prod-sched01.dune.hep.ac.uk
Workflow ID4123
Stage ID1
User nameimawby@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit43200 (12 hours)
Submitted time2024-11-16 21:37:21
SiteUS_Wisconsin
EntryHCCHTPC_US_Wisconsin_osg01_rhel7
Last heartbeat2024-11-17 00:11:15
From worker nodeHostnamee4044
cpuinfoAMD EPYC 7763 64-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit82800 (23 hours)
Inner Apptainer?True
Job stateoutputting_failed
Allocator namejustin-allocator-pro.dune.hep.ac.uk
Started2024-11-16 21:38:38
Input filesfardet-hd:nue_dune10kt_1x2x6_1115_373_20230827T163357Z_gen_g4_detsim_hitreco__20240221T073004Z_reco2.root
JobscriptExit code0
Real time2h (9142s)
CPU time3h (11672s = 127%)
Outputting started2024-11-17 00:11:00
Output files
Finished2024-11-17 00:11:15
List job events     Wrapper job log

Jobscript log (last 10,000 characters)

Justin processors: 1
Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org
Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/

MRB_PROJECT=larsoft
MRB_PROJECT_VERSION=v09_91_02
MRB_QUALS=e26:prof
MRB_TOP=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/baeee8261a966c7bf48ffdcd205bcc1e0b341bf3
MRB_SOURCE=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/baeee8261a966c7bf48ffdcd205bcc1e0b341bf3/srcs
MRB_BUILDDIR=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/baeee8261a966c7bf48ffdcd205bcc1e0b341bf3/build_slf7.x86_64
MRB_INSTALL=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/baeee8261a966c7bf48ffdcd205bcc1e0b341bf3/localProducts_larsoft_v09_91_02_e26_prof

PRODUCTS=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/baeee8261a966c7bf48ffdcd205bcc1e0b341bf3/localProducts_larsoft_v09_91_02_e26_prof:/cvmfs/dune.opensciencegrid.org/products/dune:/cvmfs/larsoft.opensciencegrid.org/products:/cvmfs/larsoft.opensciencegrid.org/packages:/cvmfs/fermilab.opensciencegrid.org/products/common/db/
CETPKG_INSTALL=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/baeee8261a966c7bf48ffdcd205bcc1e0b341bf3/localProducts_larsoft_v09_91_02_e26_prof

local product directory is /cvmfs/fifeuser3.opensciencegrid.org/sw/dune/baeee8261a966c7bf48ffdcd205bcc1e0b341bf3/localProducts_larsoft_v09_91_02_e26_prof
----------- this block should be empty ------------------
---------------------------------------------------------
Input PFN = root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/fardet-hd/f2/db/nue_dune10kt_1x2x6_1115_373_20230827T163357Z_gen_g4_detsim_hitreco__20240221T073004Z_reco2.root
lar exit code 0
=== Start last 100 lines of lar log file ===
Output 5: 0.994388, 0.0055844, 2.53742e-05, 1.88199e-06, 
Output 6: 0.999378, 0.000583534, 3.12404e-05, 7.73181e-06, 

PandoraContentApi::GetList(*this, m_inputHitListName, pCaloHitList) return STATUS_CODE_NOT_INITIALIZED
    in function: GetVolumeIdToHitListMap
    in file:     /exp/dune/app/users/imawby/dunesw_v09_91_02d00/srcs/larpandoracontent/larpandoracontent/LArControlFlow/MasterAlgorithm.cc line#: 271
this->GetVolumeIdToHitListMap(volumeIdToHitListMap) return STATUS_CODE_NOT_INITIALIZED
    in function: Run
    in file:     /exp/dune/app/users/imawby/dunesw_v09_91_02d00/srcs/larpandoracontent/larpandoracontent/LArControlFlow/MasterAlgorithm.cc line#: 165
iter->second->Run() throw STATUS_CODE_NOT_INITIALIZED
    in function: RunAlgorithm
    in file:     /scratch/workspace/build-larbase/BUILDTYPE/prof/QUAL/s131-e26/label1/swarm/label2/SLF7/build/pandora/v03_16_00l/src/pandora-v03-16-00/PandoraSDK-v03-04-01/src/Api/PandoraContentApiImpl.cc line#: 235
Failure in algorithm Alg0002, LArDLMaster, STATUS_CODE_NOT_INITIALIZED
%MSG-w ShowerPCADirection:   LArPandoraModularShowerCreation:pandoraShowerRedo@BeginModule  16-Nov-2024 18:10:39 CST run: 1115 subRun: 1 event: 37399
0 spacepoints in shower, not calculating direction
%MSG
%MSG-e ShowerProducedPtrsHolder:   LArPandoraModularShowerCreation:pandoraShowerRedo@BeginModule  16-Nov-2024 18:10:39 CST run: 1115 subRun: 1 event: 37399
Trying to add data product: InitialTrack. This element does not exist in the element holder
%MSG
%MSG-e ShowerProducedPtrsHolder:   LArPandoraModularShowerCreation:pandoraShowerRedo@BeginModule  16-Nov-2024 18:10:39 CST run: 1115 subRun: 1 event: 37399
Trying to add data product: InitialTrack. This element does not exist in the element holder
%MSG
%MSG-e ShowerProducedPtrsHolder:   LArPandoraModularShowerCreation:pandoraShowerRedo@BeginModule  16-Nov-2024 18:10:39 CST run: 1115 subRun: 1 event: 37399
Trying to add data product: ShowerPCA. This element does not exist in the element holder
%MSG
%MSG-e ShowerProducedPtrsHolder:   LArPandoraModularShowerCreation:pandoraShowerRedo@BeginModule  16-Nov-2024 18:10:39 CST run: 1115 subRun: 1 event: 37399
Trying to add data product: ShowerPCA. This element does not exist in the element holder
%MSG
%MSG-e ShowerProducedPtrsHolder:   LArPandoraModularShowerCreation:pandoraShowerRedo@BeginModule  16-Nov-2024 18:10:39 CST run: 1115 subRun: 1 event: 37399
Trying to add data product: InitialTrack. This element does not exist in the element holder
%MSG
%MSG-e ShowerProducedPtrsHolder:   LArPandoraModularShowerCreation:pandoraShowerRedo@BeginModule  16-Nov-2024 18:10:39 CST run: 1115 subRun: 1 event: 37399
Trying to add data product: InitialTrack. This element does not exist in the element holder
%MSG
<ERROR>                          : 0-th variable of the event is NaN --> return MVA value -999, 
<ERROR>                          :  that's all I can do, please fix or remove this event.
<ERROR>                          : 0-th variable of the event is NaN --> return MVA value -999, 
<ERROR>                          :  that's all I can do, please fix or remove this event.
<ERROR>                          : 0-th variable of the event is NaN --> return MVA value -999, 
<ERROR>                          :  that's all I can do, please fix or remove this event.
<ERROR>                          : 0-th variable of the event is NaN --> return MVA value -999, 
<ERROR>                          :  that's all I can do, please fix or remove this event.
<ERROR>                          : 0-th variable of the event is NaN --> return MVA value -999, 
<ERROR>                          :  that's all I can do, please fix or remove this event.
Boundary wire vector sizes: 988, 953, 912
minwire 0: 1547
minwire 1: 658
minwire 2: 1380
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Classifier summary: 
Output 0: 0.109129, 
Output 1: 0.000758401, 0.989464, 0.00650774, 0.00326972, 
Output 2: 0.00748816, 0.34093, 0.64966, 0.001922, 
Output 3: 0.248158, 0.564725, 0.17168, 0.0154365, 
Output 4: 0.0567514, 0.301045, 0.565999, 0.0762044, 
Output 5: 0.937876, 0.0615619, 0.000535691, 2.61393e-05, 
Output 6: 0.999102, 0.000799034, 6.7124e-05, 3.15373e-05, 

Boundary wire vector sizes: 596, 636, 577
minwire 0: 1009
minwire 1: 1330
minwire 2: 986
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Used alternate method to get min and max wires due to vertex determination failure: 2379, 2878
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Classifier summary: 
Output 0: 0.319845, 
Output 1: 6.45516e-05, 0.993303, 0.00650983, 0.000122903, 
Output 2: 0.00534249, 0.70864, 0.285025, 0.000992833, 
Output 3: 0.0451812, 0.60576, 0.331087, 0.0179721, 
Output 4: 0.0285176, 0.969673, 0.00178839, 2.11787e-05, 
Output 5: 0.999715, 0.000277917, 5.7487e-06, 1.48201e-06, 
Output 6: 0.990415, 0.00909217, 0.000403529, 8.95788e-05, 

%MSG-e DUNEAna:  dunereco/FDSelections/CCNuSelection:ccnuselection@BeginModule  16-Nov-2024 18:10:55 CST run: 1115 subRun: 1 event: 37400
 Failed to find product with label caldata ... returning empty vector
%MSG
%MSG-e DUNEAna:  dunereco/FDSelections/CCNuSelection:ccnuselection@BeginModule  16-Nov-2024 18:10:55 CST run: 1115 subRun: 1 event: 37400
 Failed to find product with label caldata ... returning empty vector
%MSG
16-Nov-2024 18:10:59 CST  Closed input file "root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/fardet-hd/f2/db/nue_dune10kt_1x2x6_1115_373_20230827T163357Z_gen_g4_detsim_hitreco__20240221T073004Z_reco2.root"
16-Nov-2024 18:10:59 CST  Closed input file "root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/fardet-hd/f2/db/nue_dune10kt_1x2x6_1115_373_20230827T163357Z_gen_g4_detsim_hitreco__20240221T073004Z_reco2.root"

TrigReport ---------- Event summary -------------
TrigReport Events total = 100 passed = 100 failed = 0

TrigReport ---------- Modules in End-path ----------
TrigReport        Run    Success      Error Name
TrigReport        100        100          0 ccnuselection

TimeReport ---------- Time summary [sec] -------
TimeReport CPU = 11629.373184 Real = 9093.646857

MemReport  ---------- Memory summary [base-10 MB] ------
MemReport  VmPeak = 5087.49 VmHWM = 2557.61

Art has completed and will exit with status 0.
=== End last 100 lines of lar log file ===
justIN time: 2024-11-17 06:32:56 UTC       justIN version: 01.01.09