justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 100232.2@justin-prod-sched02.dune.hep.ac.uk

Jobsub ID100232.2@justin-prod-sched02.dune.hep.ac.uk
Workflow ID4117
Stage ID1
User nameimawby@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit43200 (12 hours)
Submitted time2024-11-16 22:27:12
SiteUS_Wisconsin
EntryHCCHTPC_US_Wisconsin_osg01_rhel7
Last heartbeat2024-11-16 23:58:45
From worker nodeHostnamee2553
cpuinfoAMD EPYC 7763 64-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit82800 (23 hours)
Inner Apptainer?True
Job stateoutputting_failed
Allocator namejustin-allocator-pro.dune.hep.ac.uk
Started2024-11-16 22:28:12
Input filesfardet-hd:nue_dune10kt_1x2x6_1063_720_20230823T132247Z_gen_g4_detsim_hitreco__20240221T030525Z_reco2.root
JobscriptExit code0
Real time1h (5419s)
CPU time2h (9098s = 167%)
Outputting started2024-11-16 23:58:32
Output files
Finished2024-11-16 23:58:45
List job events     Wrapper job log

Jobscript log (last 10,000 characters)

Justin processors: 1
Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org
Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/

MRB_PROJECT=larsoft
MRB_PROJECT_VERSION=v09_91_02
MRB_QUALS=e26:prof
MRB_TOP=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/baeee8261a966c7bf48ffdcd205bcc1e0b341bf3
MRB_SOURCE=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/baeee8261a966c7bf48ffdcd205bcc1e0b341bf3/srcs
MRB_BUILDDIR=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/baeee8261a966c7bf48ffdcd205bcc1e0b341bf3/build_slf7.x86_64
MRB_INSTALL=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/baeee8261a966c7bf48ffdcd205bcc1e0b341bf3/localProducts_larsoft_v09_91_02_e26_prof

PRODUCTS=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/baeee8261a966c7bf48ffdcd205bcc1e0b341bf3/localProducts_larsoft_v09_91_02_e26_prof:/cvmfs/dune.opensciencegrid.org/products/dune:/cvmfs/larsoft.opensciencegrid.org/products:/cvmfs/larsoft.opensciencegrid.org/packages:/cvmfs/fermilab.opensciencegrid.org/products/common/db/
CETPKG_INSTALL=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/baeee8261a966c7bf48ffdcd205bcc1e0b341bf3/localProducts_larsoft_v09_91_02_e26_prof

local product directory is /cvmfs/fifeuser3.opensciencegrid.org/sw/dune/baeee8261a966c7bf48ffdcd205bcc1e0b341bf3/localProducts_larsoft_v09_91_02_e26_prof
----------- this block should be empty ------------------
---------------------------------------------------------
Input PFN = root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/fardet-hd/2c/c8/nue_dune10kt_1x2x6_1063_720_20230823T132247Z_gen_g4_detsim_hitreco__20240221T030525Z_reco2.root
lar exit code 0
=== Start last 100 lines of lar log file ===
%MSG-e ShowerProducedPtrsHolder:   LArPandoraModularShowerCreation:pandoraShowerRedo@BeginModule  16-Nov-2024 17:58:15 CST run: 1063 subRun: 1 event: 72098
Trying to add data product: ShowerPCA. This element does not exist in the element holder
%MSG
%MSG-e ShowerProducedPtrsHolder:   LArPandoraModularShowerCreation:pandoraShowerRedo@BeginModule  16-Nov-2024 17:58:15 CST run: 1063 subRun: 1 event: 72098
Trying to add data product: ShowerPCA. This element does not exist in the element holder
%MSG
<ERROR>                          : 0-th variable of the event is NaN --> return MVA value -999, 
<ERROR>                          :  that's all I can do, please fix or remove this event.
<ERROR>                          : 0-th variable of the event is NaN --> return MVA value -999, 
<ERROR>                          :  that's all I can do, please fix or remove this event.
<ERROR>                          : 0-th variable of the event is NaN --> return MVA value -999, 
<ERROR>                          :  that's all I can do, please fix or remove this event.
<ERROR>                          : 0-th variable of the event is NaN --> return MVA value -999, 
<ERROR>                          :  that's all I can do, please fix or remove this event.
<ERROR>                          : 0-th variable of the event is NaN --> return MVA value -999, 
<ERROR>                          :  that's all I can do, please fix or remove this event.
<ERROR>                          : 0-th variable of the event is NaN --> return MVA value -999, 
<ERROR>                          :  that's all I can do, please fix or remove this event.
<ERROR>                          : 0-th variable of the event is NaN --> return MVA value -999, 
<ERROR>                          :  that's all I can do, please fix or remove this event.
<ERROR>                          : 0-th variable of the event is NaN --> return MVA value -999, 
<ERROR>                          :  that's all I can do, please fix or remove this event.
<ERROR>                          : 0-th variable of the event is NaN --> return MVA value -999, 
<ERROR>                          :  that's all I can do, please fix or remove this event.
<ERROR>                          : 0-th variable of the event is NaN --> return MVA value -999, 
<ERROR>                          :  that's all I can do, please fix or remove this event.
Boundary wire vector sizes: 81, 98, 86
minwire 0: 2521
minwire 1: 168
minwire 2: 2741
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Used alternate method to get min and max wires due to vertex determination failure: 2379, 2878
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Classifier summary: 
Output 0: 0.135992, 
Output 1: 0.00259577, 1.94605e-05, 0.00234897, 0.995036, 
Output 2: 0.289404, 0.0300018, 0.00159287, 0.679002, 
Output 3: 0.127466, 0.668551, 0.193682, 0.0103014, 
Output 4: 0.0309523, 0.964728, 0.00421082, 0.000109073, 
Output 5: 0.999927, 6.12108e-05, 6.87595e-06, 5.2337e-06, 
Output 6: 0.983226, 0.0147331, 0.00172857, 0.000312275, 

Boundary wire vector sizes: 899, 850, 823
minwire 0: 367
minwire 1: 2311
minwire 2: 194
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Used alternate method to get min and max wires due to vertex determination failure: 2379, 2878
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Classifier summary: 
Output 0: 0.0411366, 
Output 1: 0.000197497, 0.996915, 0.00177771, 0.00110921, 
Output 2: 0.0199576, 0.137333, 0.842559, 0.000149996, 
Output 3: 0.592983, 0.406215, 0.000771337, 3.10729e-05, 
Output 4: 0.503904, 0.485377, 0.0103114, 0.000408017, 
Output 5: 0.00960309, 0.983734, 0.00661915, 4.32923e-05, 
Output 6: 0.999937, 5.41606e-05, 7.24461e-06, 2.02838e-06, 

%MSG-e ShowerProducedPtrsHolder:   LArPandoraModularShowerCreation:pandoraShowerRedo@BeginModule  16-Nov-2024 17:58:28 CST run: 1063 subRun: 1 event: 72100
Trying to add data product: InitialTrack. This element does not exist in the element holder
%MSG
%MSG-e ShowerProducedPtrsHolder:   LArPandoraModularShowerCreation:pandoraShowerRedo@BeginModule  16-Nov-2024 17:58:28 CST run: 1063 subRun: 1 event: 72100
Trying to add data product: InitialTrack. This element does not exist in the element holder
%MSG
Boundary wire vector sizes: 177, 264, 207
minwire 0: 309
minwire 1: 1943
minwire 2: 194
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Used alternate method to get min and max wires due to vertex determination failure: 2379, 2878
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Classifier summary: 
Output 0: 0.284754, 
Output 1: 0.000116553, 0.000223625, 0.00227978, 0.99738, 
Output 2: 0.387929, 0.264621, 0.0329292, 0.314521, 
Output 3: 0.0288031, 0.0894458, 0.34641, 0.535341, 
Output 4: 0.900956, 0.096917, 0.00197704, 0.000150216, 
Output 5: 0.0648014, 0.931036, 0.00399739, 0.00016534, 
Output 6: 0.988625, 0.0106524, 0.000558554, 0.000164325, 

16-Nov-2024 17:58:30 CST  Closed input file "root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/fardet-hd/2c/c8/nue_dune10kt_1x2x6_1063_720_20230823T132247Z_gen_g4_detsim_hitreco__20240221T030525Z_reco2.root"
16-Nov-2024 17:58:30 CST  Closed input file "root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/fardet-hd/2c/c8/nue_dune10kt_1x2x6_1063_720_20230823T132247Z_gen_g4_detsim_hitreco__20240221T030525Z_reco2.root"

TrigReport ---------- Event summary -------------
TrigReport Events total = 100 passed = 100 failed = 0

TrigReport ---------- Modules in End-path ----------
TrigReport        Run    Success      Error Name
TrigReport        100        100          0 ccnuselection

TimeReport ---------- Time summary [sec] -------
TimeReport CPU = 9068.809086 Real = 5385.879744

MemReport  ---------- Memory summary [base-10 MB] ------
MemReport  VmPeak = 5251.67 VmHWM = 2439.26

Art has completed and will exit with status 0.
=== End last 100 lines of lar log file ===
justIN time: 2024-11-17 06:23:56 UTC       justIN version: 01.01.09