justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 270223.0@justin-prod-sched01.dune.hep.ac.uk

Jobsub ID270223.0@justin-prod-sched01.dune.hep.ac.uk
Workflow ID3586
Stage ID1
User namecalcuttj@fnal.gov
HTCondor Groupgroup_dune.prod_mcsim
RequestedProcessors1
RSS bytes4193255424 (3999 MiB)
Wall seconds limit18000 (5 hours)
Submitted time2024-10-05 17:27:25
SiteUS_FNAL-FermiGrid
EntryFNAL_GPGrid_ce04_mcore_op_duneonly
Last heartbeat2024-10-05 20:22:09
From worker nodeHostnamedunegli-4177744-0-fnpc17156.fnal.gov
cpuinfoIntel(R) Xeon(R) Gold 6140 CPU @ 2.30GHz
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4193255424 (3999 MiB)
Wall seconds limit172800 (48 hours)
Inner Apptainer?True
Job statejobscript_error
Allocator namejustin-allocator-pro.dune.hep.ac.uk
Started2024-10-05 19:02:49
Input fileshd-protodune:np04hd_raw_run027766_0053_dataflow0_datawriter_0_20240705T215322.hdf5
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Outputting started 
Output files
Finished2024-10-05 20:22:09
Saved logsjustin-logs:270223.0-justin-prod-sched01.dune.hep.ac.uk.logs.tgz
List job events     Wrapper job log

Jobscript log (last 10,000 characters)

ut nticks=5859 keeping as is
Retagger: tagging trace set: raw with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "raw"
wclsFrameSaver: found empty channel masks for "bad"
RawFrameSource: got 10240 raw::RawDigit objects
	input nticks=5859 keeping as is
Retagger: tagging trace set: raw with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "raw"
wclsFrameSaver: found empty channel masks for "bad"
RawFrameSource: got 10240 raw::RawDigit objects
	input nticks=5859 keeping as is
Retagger: tagging trace set: raw with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "raw"
wclsFrameSaver: found empty channel masks for "bad"
RawFrameSource: got 10240 raw::RawDigit objects
	input nticks=5859 keeping as is
Retagger: tagging trace set: raw with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "raw"
wclsFrameSaver: found empty channel masks for "bad"
RawFrameSource: got 10240 raw::RawDigit objects
	input nticks=5859 keeping as is
Retagger: tagging trace set: raw with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "raw"
wclsFrameSaver: found empty channel masks for "bad"
RawFrameSource: got 10240 raw::RawDigit objects
	input nticks=5859 keeping as is
Retagger: tagging trace set: raw with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "raw"
wclsFrameSaver: found empty channel masks for "bad"
RawFrameSource: got 10240 raw::RawDigit objects
	input nticks=5859 keeping as is
Retagger: tagging trace set: raw with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "raw"
wclsFrameSaver: found empty channel masks for "bad"
RawFrameSource: got 10240 raw::RawDigit objects
	input nticks=5859 keeping as is
Retagger: tagging trace set: raw with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "raw"
wclsFrameSaver: found empty channel masks for "bad"
RawFrameSource: got 10240 raw::RawDigit objects
	input nticks=5859 keeping as is
Retagger: tagging trace set: raw with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "raw"
wclsFrameSaver: found empty channel masks for "bad"
RawFrameSource: got 10240 raw::RawDigit objects
	input nticks=5859 keeping as is
Retagger: tagging trace set: raw with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "raw"
wclsFrameSaver: found empty channel masks for "bad"
RawFrameSource: got 10240 raw::RawDigit objects
	input nticks=5859 keeping as is
Retagger: tagging trace set: raw with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "raw"
wclsFrameSaver: found empty channel masks for "bad"
RawFrameSource: got 10240 raw::RawDigit objects
	input nticks=5859 keeping as is
Retagger: tagging trace set: raw with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "raw"
wclsFrameSaver: found empty channel masks for "bad"
RawFrameSource: got 10240 raw::RawDigit objects
	input nticks=5859 keeping as is
Retagger: tagging trace set: raw with 10240 traces, 0 summary
wclsFrameSaver: saving raw::RawDigits tagged "raw"
wclsFrameSaver: found empty channel masks for "bad"
05-Oct-2024 20:17:29 UTC  Closed output file "np04hd_raw_run027766_0053_dataflow0_datawriter_0_20240705T215322_reco_stage1.root"

===================================================================================================================================
TimeTracker printout (sec)                           Min           Avg           Max         Median          RMS         nEvts   
===================================================================================================================================
Full event                                         33.5594       105.903       371.118       85.7166       80.7752        40     
-----------------------------------------------------------------------------------------------------------------------------------
source:HDF5RawInput3(read)                       2.3222e-05    5.20356e-05   0.000151863   4.9483e-05    2.02014e-05      40     
produce:tpcrawdecoder:PDHDTPCReader                13.6922       19.5382       123.115       16.558        16.6897        40     
produce:triggerrawdecoder:PDHDTriggerReader3      0.0540946     0.0617006     0.314905      0.0551069     0.0405498       40     
produce:timingrawdecoder:PDHDTimingRawDecoder     0.0271617     0.0275231     0.0280708     0.0274992    0.00020287       40     
produce:pdhddaphne:DAPHNEReaderPDHD              0.00039256    0.000571837   0.00113246    0.000512798   0.000154756      40     
produce:fembfilter:PDHDFEMBFilter                7.8823e-05    0.000120296   0.00037939    0.000106849   5.25105e-05      40     
produce:wclsdatahdfilter:WireCellToolkit           11.828        16.0012       26.8572       14.1514       3.89719        40     
[art]:TriggerResults:TriggerResultInserter       1.5007e-05    3.57557e-05   0.000100396   3.2664e-05    1.58684e-05      40     
end_path:out1:RootOutput                          3.084e-06    6.8177e-06    2.4406e-05     6.551e-06    3.51449e-06      40     
end_path:out1:RootOutput(write)                    4.65156       70.272        336.013       48.2079       78.9774        40     
===================================================================================================================================

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 3101.42 MB
  Peak resident set size usage (VmHWM): 1590.36 MB
  Details saved in: 'mem.db'
====================================================================================================
Art has completed and will exit with status 0.
The following module labels are either not assigned to any path,
or they have been assigned to ignored path(s):
  calibana
  crtreco
  crttag
  opdec
  opflash
  ophitspe
  opslicer
  pdhddaphne
  timingrawdecoder
  tpcrawdecoder
  triggerrawdecoder
Info in <TGeoManager::Import>: Reading geometry from file: /cvmfs/dune.opensciencegrid.org/products/dune/dunecore/v09_91_02d01/gdml/protodunehd_v6_refactored.gdml
Info in <TGeoManager::TGeoManager>: Geometry GDMLImport, Geometry imported from GDML created
Info in <TGeoManager::SetTopVolume>: Top volume is volWorld. Master volume is volWorld
Info in <TGeoNavigator::BuildCache>: --- Maximum geometry depth set to 100
Info in <TGeoManager::CheckGeometry>: Fixing runtime shapes...
Info in <TGeoManager::CheckGeometry>: ...Nothing to fix
Info in <TGeoManager::CloseGeometry>: Counting nodes...
Info in <TGeoManager::Voxelize>: Voxelizing...
Info in <TGeoManager::CloseGeometry>: Building cache...
Info in <TGeoManager::CountLevels>: max level = 5, max placements = 1148
Info in <TGeoManager::CloseGeometry>: 25608 nodes/ 5535 volume UID's in Geometry imported from GDML
Info in <TGeoManager::CloseGeometry>: ----------------modeler ready----------------
DAPHNE Channel Map: Building DAPHNE channel map from file DAPHNE_test5_ChannelMap_v1.txt
PD2HD Channel Map: Building TPC wiremap from file PD2HDChannelMap_WIBEth_electronics_v1.txt
tf_graph loaded ProtoBuf graph with status: OK
Inputer: "wclsRawFrameSource"
Outputer: "wclsFrameSaver:spsaver"
wclsFrameSaver: promising to produce recob::Wires named "gauss"
wclsFrameSaver: promising to produce recob::Wires named "wiener"
wclsFrameSaver: promising to produce channel summary named "threshold"
05-Oct-2024 20:19:11 UTC  Initiating request to open input file "np04hd_raw_run027766_0053_dataflow0_datawriter_0_20240705T215322_reco_stage1.root"
05-Oct-2024 20:19:15 UTC  Opened input file "np04hd_raw_run027766_0053_dataflow0_datawriter_0_20240705T215322_reco_stage1.root"
job begin...
job begin...
Scale HitLimit based on readout window size 5859
HitLimit = 19530
Begin processing the 1st record. run: 27766 subRun: 1 event: 2121 at 05-Oct-2024 20:19:15 UTC
05-Oct-2024 20:21:26 UTC  Opened output file with pattern "%ifb_reco_stage2_%tc_keepup.root"
05-Oct-2024 20:21:31 UTC  Closed input file "np04hd_raw_run027766_0053_dataflow0_datawriter_0_20240705T215322_reco_stage1.root"
Malformed TimeTracker database.  The TimeEvent table is empty, but
the TimeModule table is not.  This can happen if an exception has
been thrown from a module while processing the first event.  Any
saved database file is suspect and should not be used.

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 3712.1 MB
  Peak resident set size usage (VmHWM): 1351.81 MB
  Details saved in: 'mem.db'
====================================================================================================
%MSG-s ArtException:  PostEndJob 05-Oct-2024 20:21:43 UTC ModuleEndJob
---- EventProcessorFailure BEGIN
  EventProcessor: an exception occurred during current event processing
  ---- ScheduleExecutionFailure BEGIN
    Path: ProcessingStopped.
    ---- FileReadError BEGIN
      ---- FatalRootError BEGIN
        Fatal Root Error: TFile::ReadBuffer
        error reading all requested bytes from file np04hd_raw_run027766_0053_dataflow0_datawriter_0_20240705T215322_reco_stage1.root, got 12018 of 56539684
        ROOT severity: 3000
      ---- FatalRootError END
      
      The above exception was thrown while processing module WireCellToolkit/wclsdatahd run: 27766 subRun: 1 event: 2121
    ---- FileReadError END
    Exception going through path produce
  ---- ScheduleExecutionFailure END
---- EventProcessorFailure END
---- FatalRootError BEGIN
  Fatal Root Error: TTree::SetEntries
  Tree branches have different numbers of entries, eg EventAuxiliary has 0 entries while dunedaq::trgdataformats::TriggerPrimitives_triggerrawdecoder_daqinTAs_pdhdkeepupstage1. has 40 entries.
  ROOT severity: 2000
---- FatalRootError END
%MSG
Art has completed and will exit with status 1.
Error in reco2
justIN time: 2024-11-24 12:29:47 UTC       justIN version: 01.01.09