Jobsub ID 92147.55@justin-prod-sched02.dune.hep.ac.uk
Jobsub ID | 92147.55@justin-prod-sched02.dune.hep.ac.uk |
Workflow ID | 3938 |
Stage ID | 1 |
User name | imawby@fnal.gov |
HTCondor Group | group_dune |
Requested | Processors | 1 |
RSS bytes | 4194304000 (4000 MiB) |
Wall seconds limit | 43200 (12 hours) |
Submitted time | 2024-11-06 16:39:30 |
Site | US_FNAL-FermiGrid |
Entry | FNAL_GPGrid_ce03_mcore_op_duneonly |
Last heartbeat | 2024-11-06 17:32:26 |
From worker node | Hostname | dunegli-3991106-0-fnpc9075.fnal.gov |
cpuinfo | Intel(R) Xeon(R) CPU E5-2680 v4 @ 2.40GHz |
OS release | Scientific Linux release 7.9 (Nitrogen) |
Processors | 1 |
RSS bytes | 4194304000 (4000 MiB) |
Wall seconds limit | 172800 (48 hours) |
Inner Apptainer? | True |
Job state | jobscript_error |
Allocator name | justin-allocator-pro.dune.hep.ac.uk |
Started | 2024-11-06 16:41:53 |
Input files | fardet-hd:nutau_dune10kt_1x2x6_1064_227_20230823T141116Z_gen_g4_detsim_hitreco__20240220T193106Z_reco2.root
|
Jobscript | Exit code | 1 |
Real time | 0m (0s) |
CPU time | 0m (0s = 0%) |
Outputting started | |
Output files | |
Finished | 2024-11-06 17:32:26 |
Saved logs | justin-logs:92147.55-justin-prod-sched02.dune.hep.ac.uk.logs.tgz |
List job events Wrapper job log |
Jobscript log (last 10,000 characters)
Justin processors: 1
Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org
Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/
MRB_PROJECT=larsoft
MRB_PROJECT_VERSION=v09_91_02
MRB_QUALS=e26:prof
MRB_TOP=/cvmfs/fifeuser1.opensciencegrid.org/sw/dune/0b7ef0f90224874d955352540c7c9e5d7b54b4e7
MRB_SOURCE=/cvmfs/fifeuser1.opensciencegrid.org/sw/dune/0b7ef0f90224874d955352540c7c9e5d7b54b4e7/srcs
MRB_BUILDDIR=/cvmfs/fifeuser1.opensciencegrid.org/sw/dune/0b7ef0f90224874d955352540c7c9e5d7b54b4e7/build_slf7.x86_64
MRB_INSTALL=/cvmfs/fifeuser1.opensciencegrid.org/sw/dune/0b7ef0f90224874d955352540c7c9e5d7b54b4e7/localProducts_larsoft_v09_91_02_e26_prof
PRODUCTS=/cvmfs/fifeuser1.opensciencegrid.org/sw/dune/0b7ef0f90224874d955352540c7c9e5d7b54b4e7/localProducts_larsoft_v09_91_02_e26_prof:/cvmfs/dune.opensciencegrid.org/products/dune:/cvmfs/larsoft.opensciencegrid.org/products:/cvmfs/larsoft.opensciencegrid.org/packages:/cvmfs/fermilab.opensciencegrid.org/products/common/db/
CETPKG_INSTALL=/cvmfs/fifeuser1.opensciencegrid.org/sw/dune/0b7ef0f90224874d955352540c7c9e5d7b54b4e7/localProducts_larsoft_v09_91_02_e26_prof
local product directory is /cvmfs/fifeuser1.opensciencegrid.org/sw/dune/0b7ef0f90224874d955352540c7c9e5d7b54b4e7/localProducts_larsoft_v09_91_02_e26_prof
----------- this block should be empty ------------------
---------------------------------------------------------
Input PFN = root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/fardet-hd/8b/d2/nutau_dune10kt_1x2x6_1064_227_20230823T141116Z_gen_g4_detsim_hitreco__20240220T193106Z_reco2.root
../justin-jobscript: line 64: 1399 Aborted (core dumped) lar -c $FCL_FILE $events_option "$pfn" > ${fname}_reco_${now}.log 2>&1
lar exit code 134
=== Start last 100 lines of lar log file ===
: Booked classifier "BDT" of type: "BDT"
06-Nov-2024 17:05:04 UTC Initiating request to open input file "root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/fardet-hd/8b/d2/nutau_dune10kt_1x2x6_1064_227_20230823T141116Z_gen_g4_detsim_hitreco__20240220T193106Z_reco2.root"
06-Nov-2024 17:05:04 UTC Initiating request to open input file "root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/fardet-hd/8b/d2/nutau_dune10kt_1x2x6_1064_227_20230823T141116Z_gen_g4_detsim_hitreco__20240220T193106Z_reco2.root"
06-Nov-2024 17:05:26 UTC Opened input file "root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/fardet-hd/8b/d2/nutau_dune10kt_1x2x6_1064_227_20230823T141116Z_gen_g4_detsim_hitreco__20240220T193106Z_reco2.root"
06-Nov-2024 17:05:26 UTC Opened input file "root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/fardet-hd/8b/d2/nutau_dune10kt_1x2x6_1064_227_20230823T141116Z_gen_g4_detsim_hitreco__20240220T193106Z_reco2.root"
Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_HD_Accel_1_U_v04_06_00.pt'
Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_HD_Accel_1_V_v04_06_00.pt'
Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_HD_Accel_1_W_v04_06_00.pt'
Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_HD_Accel_2_U_v04_06_00.pt'
Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_HD_Accel_2_V_v04_06_00.pt'
Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_HD_Accel_2_W_v04_06_00.pt'
daughterClusterListU.size(): 7
daughterClusterListV.size(): 8
daughterClusterListW.size(): 11
daughterClusterListU.size(): 7
daughterClusterListV.size(): 8
daughterClusterListW.size(): 11
daughterClusterListU.size(): 7
daughterClusterListV.size(): 8
daughterClusterListW.size(): 11
daughterClusterListU.size(): 7
daughterClusterListV.size(): 8
daughterClusterListW.size(): 11
daughterClusterListU.size(): 0
daughterClusterListV.size(): 0
daughterClusterListW.size(): 0
daughterClusterListU.size(): 0
daughterClusterListV.size(): 0
daughterClusterListW.size(): 0
daughterClusterListU.size(): 0
daughterClusterListV.size(): 0
daughterClusterListW.size(): 0
%MSG-w ShowerPCADirection: LArPandoraModularShowerCreation:pandoraShowerRedo@BeginModule 06-Nov-2024 17:05:55 UTC run: 1064 subRun: 1 event: 22701
0 spacepoints in shower, not calculating direction
%MSG
%MSG-e ShowerProducedPtrsHolder: LArPandoraModularShowerCreation:pandoraShowerRedo@BeginModule 06-Nov-2024 17:05:55 UTC run: 1064 subRun: 1 event: 22701
Trying to add data product: InitialTrack. This element does not exist in the element holder
%MSG
%MSG-e ShowerProducedPtrsHolder: LArPandoraModularShowerCreation:pandoraShowerRedo@BeginModule 06-Nov-2024 17:05:55 UTC run: 1064 subRun: 1 event: 22701
Trying to add data product: InitialTrack. This element does not exist in the element holder
%MSG
%MSG-e ShowerProducedPtrsHolder: LArPandoraModularShowerCreation:pandoraShowerRedo@BeginModule 06-Nov-2024 17:05:55 UTC run: 1064 subRun: 1 event: 22701
Trying to add data product: ShowerPCA. This element does not exist in the element holder
%MSG
%MSG-e ShowerProducedPtrsHolder: LArPandoraModularShowerCreation:pandoraShowerRedo@BeginModule 06-Nov-2024 17:05:55 UTC run: 1064 subRun: 1 event: 22701
Trying to add data product: ShowerPCA. This element does not exist in the element holder
%MSG
: Rebuilding Dataset Default
<ERROR> : 0-th variable of the event is NaN --> return MVA value -999,
<ERROR> : that's all I can do, please fix or remove this event.
<ERROR> : 0-th variable of the event is NaN --> return MVA value -999,
<ERROR> : that's all I can do, please fix or remove this event.
<ERROR> : 0-th variable of the event is NaN --> return MVA value -999,
<ERROR> : that's all I can do, please fix or remove this event.
<ERROR> : 0-th variable of the event is NaN --> return MVA value -999,
<ERROR> : that's all I can do, please fix or remove this event.
<ERROR> : 0-th variable of the event is NaN --> return MVA value -999,
<ERROR> : that's all I can do, please fix or remove this event.
: Rebuilding Dataset Default
daughterClusterListU.size(): 118
daughterClusterListV.size(): 92
daughterClusterListW.size(): 125
daughterClusterListU.size(): 116
daughterClusterListV.size(): 89
daughterClusterListW.size(): 124
daughterClusterListU.size(): 111
daughterClusterListV.size(): 77
daughterClusterListW.size(): 117
daughterClusterListU.size(): 98
daughterClusterListV.size(): 61
daughterClusterListW.size(): 97
daughterClusterListU.size(): 0
daughterClusterListV.size(): 1
daughterClusterListW.size(): 1
daughterClusterListU.size(): 0
daughterClusterListV.size(): 1
daughterClusterListW.size(): 1
daughterClusterListU.size(): 0
daughterClusterListV.size(): 1
daughterClusterListW.size(): 1
Boundary wire vector sizes: 841, 856, 845
minwire 0: 1019
minwire 1: 1397
minwire 2: 416
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 499
Classifier summary:
Output 0: 0.349452,
Output 1: 0.00369769, 0.000121513, 0.0210017, 0.975179,
Output 2: 0.490939, 0.148579, 0.206933, 0.153549,
Output 3: 0.801054, 0.194545, 0.00422345, 0.000176816,
Output 4: 0.881157, 0.11745, 0.00137484, 1.77933e-05,
Output 5: 0.990677, 0.00836585, 0.000840953, 0.00011644,
Output 6: 0.997821, 0.00197232, 0.000162605, 4.36442e-05,
: Rebuilding Dataset Default
: Rebuilding Dataset Default
HT SHOWER: NOT ALL ORIENTATIONS COVERED
terminate called without an active exception
=== End last 100 lines of lar log file ===