justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 301707.70@justin-prod-sched01.dune.hep.ac.uk

Jobsub ID301707.70@justin-prod-sched01.dune.hep.ac.uk
Workflow ID4213
Stage ID1
User nameismerio@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2024-11-23 04:33:23
SiteUK_Imperial
EntryDUNE_T2_UK_London_IC_ceprod01
Last heartbeat2024-11-23 04:46:50
From worker nodeHostnamewj47.grid.hep.ph.ic.ac.uk
cpuinfoIntel(R) Xeon(R) CPU E5-2620 v4 @ 2.10GHz
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit171000 (47 hours)
Inner Apptainer?True
Job stateoutputting_failed
Allocator namejustin-allocator-pro.dune.hep.ac.uk
Started2024-11-23 04:34:50
Input filesfardet-hd:atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50515285_270_20231201T122216Z_gen_g4_detsim_hitreco__20240507T191911Z_reco2.root
JobscriptExit code0
Real time11m (700s)
CPU time1m (106s = 15%)
Outputting started2024-11-23 04:46:31
Output files
Finished2024-11-23 04:46:50
List job events     Wrapper job log

Jobscript log (last 10,000 characters)

/srcs/larpandoracontent/larpandoradlcontent/LArVertex/DlVertexingAlgorithm.cc line#: 624
Failure in algorithm Alg0003, LArDLVertexing, STATUS_CODE_NOT_INITIALIZED
> Running Algorithm: Alg0003, LArNeutrinoEventValidation
---RAW-MATCHING-OUTPUT--------------------------------------------------------------------------
Found: 13 with KE=0.356449 GeV, E=0.462108 GeV, p2=0.20238
Found: 2212 with KE=0.347103 GeV, E=1.28538 GeV, p2=0.771836
OTHER_INTERACTION (Nuance 1000, Nu 1, CR 0)
IsLost (NNuLosses: 2) 
Parent Neutrino: 14
Neutrino Energy: 0.931254
Neutrino Momentum:   x: 0.425439  y: 0.52459  z: 0.641131 length: 0.931258
PrimaryId 1, Nu 1, CR 0, MCPDG 13, Energy 0.462108, Dist. 156.606, nMCHits 727 (269, 187, 271)
-No matched Pfo
Parent Neutrino: 14
Neutrino Energy: 0.931254
Neutrino Momentum:   x: 0.425439  y: 0.52459  z: 0.641131 length: 0.931258
PrimaryId 2, Nu 1, CR 0, MCPDG 2212, Energy 1.28538, Dist. 30.6536, nMCHits 86 (36, 36, 14)
-No matched Pfo

------------------------------------------------------------------------------------------------

---INTERPRETED-MATCHING-OUTPUT------------------(my version)-----------------------------------------
Found: 13 with KE=0.356449 GeV, E=0.462108 GeV, p2=0.20238
Found: 2212 with KE=0.347103 GeV, E=1.28538 GeV, p2=0.771836
OTHER_INTERACTION (Nuance 1000, Nu 1, CR 0)
IsLost (NNuLosses: 2) 
Parent Neutrino: 14
Neutrino Energy: 0.931254
Neutrino Momentum:   x: 0.425439  y: 0.52459  z: 0.641131 length: 0.931258
PrimaryId 1, Nu 1, CR 0, MCPDG 13, Energy 0.462108, Dist. 156.606, nMCHits 727 (269, 187, 271)
-No matched Pfo
Parent Neutrino: 14
Neutrino Energy: 0.931254
Neutrino Momentum:   x: 0.425439  y: 0.52459  z: 0.641131 length: 0.931258
PrimaryId 2, Nu 1, CR 0, MCPDG 2212, Energy 1.28538, Dist. 30.6536, nMCHits 86 (36, 36, 14)
-No matched Pfo

---SUMMARY--------------------------------------------------------------------------------------
#CorrectNu: 0/1, Fraction: 0
#Lost: 1 
------------------------------------------------------------------------------------------------

Found: 13 with KE=0.356449 GeV, E=0.462108 GeV, p2=0.20238
PandoraMonitoring, only able to use default TApplication (limited functionality).
Found: 2212 with KE=0.347103 GeV, E=1.28538 GeV, p2=0.771836
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtrack@BeginModule  23-Nov-2024 04:40:42 GMT run: 50515285 subRun: 1 event: 27001
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtrack@BeginModule  23-Nov-2024 04:40:42 GMT run: 50515285 subRun: 1 event: 27001
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtrack@BeginModule  23-Nov-2024 04:40:42 GMT run: 50515285 subRun: 1 event: 27001
Comparing two wires in the same plane: return failure
%MSG
%MSG-e pma::Track3D:  PMAlgTrackMaker:pmtrack@BeginModule  23-Nov-2024 04:40:43 GMT run: 50515285 subRun: 1 event: 27001
0 enabled hits in AverageDist2 calculation.
%MSG
%MSG-e pma::Track3D:  PMAlgTrackMaker:pmtrack@BeginModule  23-Nov-2024 04:40:43 GMT run: 50515285 subRun: 1 event: 27001
Track empty.
%MSG
%MSG-e pma::Track3D:  PMAlgTrackMaker:pmtrack@BeginModule  23-Nov-2024 04:40:43 GMT run: 50515285 subRun: 1 event: 27001
Hit sorting problem.
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  23-Nov-2024 04:40:43 GMT run: 50515285 subRun: 1 event: 27001
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  23-Nov-2024 04:40:43 GMT run: 50515285 subRun: 1 event: 27001
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  23-Nov-2024 04:40:43 GMT run: 50515285 subRun: 1 event: 27001
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  23-Nov-2024 04:40:43 GMT run: 50515285 subRun: 1 event: 27001
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  23-Nov-2024 04:40:43 GMT run: 50515285 subRun: 1 event: 27001
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrackMaker:pmtracktc@BeginModule  23-Nov-2024 04:40:43 GMT run: 50515285 subRun: 1 event: 27001
Comparing two wires in the same plane: return failure
%MSG
%MSG-e pma::Track3D:  PMAlgTrackMaker:pmtracktc@BeginModule  23-Nov-2024 04:40:44 GMT run: 50515285 subRun: 1 event: 27001
0 enabled hits in AverageDist2 calculation.
%MSG
%MSG-e pma::Track3D:  PMAlgTrackMaker:pmtracktc@BeginModule  23-Nov-2024 04:40:44 GMT run: 50515285 subRun: 1 event: 27001
Track empty.
%MSG
%MSG-e pma::Track3D:  PMAlgTrackMaker:pmtracktc@BeginModule  23-Nov-2024 04:40:44 GMT run: 50515285 subRun: 1 event: 27001
Hit sorting problem.
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrajFitter:pmtrajfittc@BeginModule  23-Nov-2024 04:40:44 GMT run: 50515285 subRun: 1 event: 27001
1st wire C:0 T:7 P:2 W:500 does not exist (max wire number: 480)
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrajFitter:pmtrajfittc@BeginModule  23-Nov-2024 04:40:44 GMT run: 50515285 subRun: 1 event: 27001
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrajFitter:pmtrajfittc@BeginModule  23-Nov-2024 04:40:45 GMT run: 50515285 subRun: 1 event: 27001
1st wire C:0 T:7 P:2 W:480 does not exist (max wire number: 480)
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrajFitter:pmtrajfittc@BeginModule  23-Nov-2024 04:40:45 GMT run: 50515285 subRun: 1 event: 27001
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrajFitter:pmtrajfittc@BeginModule  23-Nov-2024 04:40:45 GMT run: 50515285 subRun: 1 event: 27001
1st wire C:0 T:7 P:2 W:804 does not exist (max wire number: 480)
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrajFitter:pmtrajfittc@BeginModule  23-Nov-2024 04:40:45 GMT run: 50515285 subRun: 1 event: 27001
Comparing two wires in the same plane: return failure
%MSG
%MSG-e WireIDIntersectionCheck:  PMAlgTrajFitter:pmtrajfittc@BeginModule  23-Nov-2024 04:40:45 GMT run: 50515285 subRun: 1 event: 27001
Comparing two wires in the same plane: return failure
%MSG
%MSG-e pma::Track3D:  PMAlgTrajFitter:pmtrajfittc@BeginModule  23-Nov-2024 04:40:46 GMT run: 50515285 subRun: 1 event: 27001
0 enabled hits in AverageDist2 calculation.
%MSG
%MSG-e pma::Track3D:  PMAlgTrajFitter:pmtrajfittc@BeginModule  23-Nov-2024 04:40:46 GMT run: 50515285 subRun: 1 event: 27001
Track empty.
%MSG
%MSG-e pma::Track3D:  PMAlgTrajFitter:pmtrajfittc@BeginModule  23-Nov-2024 04:40:46 GMT run: 50515285 subRun: 1 event: 27001
TuneFullTree failed.
%MSG
Boundary wire vector sizes: 339, 259, 315
minwire 0: 747
minwire 1: 1859
minwire 2: 506
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 199
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 199
Used alternate method to get min and max tdcs due to vertex determination failure: 0, 199
Could not find serving_default in model signatures.
[libprotobuf FATAL /cvmfs/larsoft.opensciencegrid.org/products/protobuf/v3_21_12a/Linux64bit+3.10-2.17-e26/include/google/protobuf/map.h:1300] CHECK failed: it != end(): key not found: serving_default
23-Nov-2024 04:41:26 GMT  Opened output file with pattern "atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50515285_270_20231201T122216Z_gen_g4_detsim_hitreco__20240507T191911Z_reco2_reco_data_2024-11-23T_043455Z.root"
23-Nov-2024 04:46:29 GMT  Closed input file "root://golias100.farm.particle.cz:1094/dpm/farm.particle.cz/home/dune/RSE/fardet-hd/62/69/atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50515285_270_20231201T122216Z_gen_g4_detsim_hitreco__20240507T191911Z_reco2.root"
Malformed TimeTracker database.  The TimeEvent table is empty, but
the TimeModule table is not.  This can happen if an exception has
been thrown from a module while processing the first event.  Any
saved database file is suspect and should not be used.

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 7078.28 MB
  Peak resident set size usage (VmHWM): 1231.74 MB
====================================================================================================
%MSG-s ArtException:  PostEndJob 23-Nov-2024 04:46:30 GMT ModuleEndJob
---- EventProcessorFailure BEGIN
  EventProcessor: an exception occurred during current event processing
  ---- ScheduleExecutionFailure BEGIN
    Path: ProcessingStopped.
    ---- StdException BEGIN
      An exception was thrown while processing module CVNEvaluator/cvneva run: 50515285 subRun: 1 event: 27001
      CHECK failed: it != end(): key not found: serving_default
    ---- StdException END
    Exception going through path reco
  ---- ScheduleExecutionFailure END
---- EventProcessorFailure END
---- FatalRootError BEGIN
  Fatal Root Error: TTree::SetEntries
  Tree branches have different numbers of entries, eg EventAuxiliary has 0 entries while sim::OpDetDivRecs_opdigi__detsim. has 100 entries.
  ROOT severity: 2000
---- FatalRootError END
%MSG
Art has completed and will exit with status 1.
=== End last 100 lines of lar log file ===
RootOutput-d977-5c56-253b-357e.root
Validation_ccnc_atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50515285_270_20231201T122216Z_gen_g4_detsim_hitreco__20240507T191911Z_reco2.root
all-input-dids.txt
atmnu_max_weighted_randompolicy_dune10kt_1x2x6_50515285_270_20231201T122216Z_gen_g4_detsim_hitreco__20240507T191911Z_reco2_reco_2024-11-23T_043455Z.log
debugprod.log
jobscript.log
justin-processed-pfns.txt
reco2_hist.root
MyPandoraSettings_Master_Atmos_DUNEFD.xml
MyPandoraSettings_Master_DUNEFD.xml
build_slf7.x86_64
localProducts_larsoft_v09_91_02_e26_prof
setup_env-testreco.sh
srcs
temp.txt
temp2.txt
test
work
justIN time: 2024-11-23 15:12:37 UTC       justIN version: 01.01.09