justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 97364.0@justin-prod-sched02.dune.hep.ac.uk

Jobsub ID97364.0@justin-prod-sched02.dune.hep.ac.uk
Workflow ID4103
Stage ID1
User nameamoor@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
RSS bytes8388608000 (8000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2024-11-14 23:28:17
SiteUK_RAL-Tier1
EntryLIGO_UK_RAL_arc_ce04
Last heartbeat2024-11-15 01:09:14
From worker nodeHostnamedune001-2910002.0-lcg2725.gridpp.rl.ac.uk
cpuinfoAMD EPYC 9654 96-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes8388608000 (8000 MiB)
Wall seconds limit216000 (60 hours)
Inner Apptainer?True
Job stateoutputting_failed
Allocator namejustin-allocator-pro.dune.hep.ac.uk
Started2024-11-14 23:30:03
Input filesusertests:000852_reco_data_2024-11-14T_093508Z.root
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Outputting started 
Output files
Finished2024-11-15 01:09:14
List job events     Wrapper job log

Jobscript log (last 10,000 characters)

AFM g4 jobscript.
Input PFN = root://eospublic.cern.ch:1094//eos/experiment/neutplatform/protodune/dune/usertests/51/17/000852_reco_data_2024-11-14T_093508Z.root
Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org
Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/
/cvmfs/larsoft.opensciencegrid.org/products/xrootd/v5_4_3b/Linux64bit+3.10-2.17-e20-p3913-prof/lib/libXrdPosixPreload.so
=== Start last 100 lines of lar log file ===
	Brem : 280312
	conv : 45346
	Ion : 121089
%MSG
%MSG-i endOfEventAction:  larg4Main:largeant@BeginModule  15-Nov-2024 00:13:41 UTC run: 20000031 subRun: 0 event: 847 ParticleListAction.cc:701
MCTruth Handles Size: 1
%MSG
%MSG-i endOfEventAction:  larg4Main:largeant@BeginModule  15-Nov-2024 00:13:41 UTC run: 20000031 subRun: 0 event: 847 ParticleListAction.cc:708
mclistHandle Size: 1
%MSG
%MSG-i endOfEventAction:  larg4Main:largeant@BeginModule  15-Nov-2024 00:13:41 UTC run: 20000031 subRun: 0 event: 847 ParticleListAction.cc:711
Found 1 particles
%MSG
%MSG-i NuRandomService:  IonAndScint:IonAndScint@BeginModule  15-Nov-2024 00:13:42 UTC run: 20000031 subRun: 0 event: 847
Random seed for this event, engine 'IonAndScint.ISCalcAlg': 212310818
%MSG
IonAndScint Module Producer
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneUInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCActiveInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneVInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneZInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCActiveOuter
%MSG-i NuRandomService:  SimDriftElectrons:elecDrift@BeginModule  15-Nov-2024 00:13:44 UTC run: 20000031 subRun: 0 event: 847
Random seed for this event, engine 'elecDrift': 618930817
%MSG
%MSG-i NuRandomService:  PDFastSimPAR:PDFastSim@BeginModule  15-Nov-2024 00:14:20 UTC run: 20000031 subRun: 0 event: 847
Random seed for this event, engine 'PDFastSim.photon': 166905747
%MSG
%MSG-i NuRandomService:  PDFastSimPAR:PDFastSim@BeginModule  15-Nov-2024 00:14:20 UTC run: 20000031 subRun: 0 event: 847
Random seed for this event, engine 'PDFastSim.scinttime': 512777870
%MSG
IonAndScint endJob.
15-Nov-2024 01:08:23 UTC  Closed input file "root://eospublic.cern.ch:1094//eos/experiment/neutplatform/protodune/dune/usertests/51/17/000852_reco_data_2024-11-14T_093508Z.root"

================================================================================================================================
TimeTracker printout (sec)                        Min           Avg           Max         Median          RMS         nEvts   
================================================================================================================================
Full event                                    0.00478234      6.85664       3308.05      0.233617       114.13         847    
--------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                        5.6826e-05    0.000631238    0.0380242    0.000128594   0.00352122       847    
simulate:rns:RandomNumberSaver                1.1998e-05    2.76543e-05   0.00022434    2.5338e-05    1.08773e-05      847    
simulate:largeant:larg4Main                   0.00384071     0.383187       33.1725      0.200049       1.32744        847    
simulate:IonAndScint:IonAndScint              6.1333e-05    0.00582918      2.96572     0.000123978    0.102349        847    
simulate:elecDrift:SimDriftElectrons          2.1803e-05     0.068132       35.9204     4.1643e-05       1.242         847    
simulate:PDFastSim:PDFastSimPAR               3.1798e-05      6.30157       3235.99     6.3676e-05      111.618        847    
[art]:TriggerResults:TriggerResultInserter     5.088e-06    9.78993e-06   4.4657e-05     9.374e-06    3.45612e-06      847    
end_path:out1:RootOutput                       1.332e-06    3.68084e-06   3.0887e-05     3.565e-06    1.54641e-06      847    
end_path:out1:RootOutput(write)               0.000418512     0.09686       5.59263     0.00332348     0.347155        846    
================================================================================================================================
%MSG-i NuRandomService:  RootOutput:out1@EndJob 15-Nov-2024 01:08:23 UTC  ModuleEndJob

Summary of seeds computed by the NuRandomService
Random policy: 'perEvent'
  algorithm version: EventTimestamp_v1
   Configured value          Last value   ModuleLabel.InstanceName
        (per event)           212310818   IonAndScint.ISCalcAlg
        (per event)           166905747   PDFastSim.photon
        (per event)           512777870   PDFastSim.scinttime
        (per event)           618930817   elecDrift
        (per event)           173096556   largeant

%MSG

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 12202.6 MB
  Peak resident set size usage (VmHWM): 9983.81 MB
====================================================================================================

TrigReport ---------- Event summary -------------
TrigReport Events total = 847 passed = 847 failed = 0

TrigReport ---------- Modules in End-path ----------
TrigReport        Run    Success      Error Name
TrigReport        847        847          0 out1

TimeReport ---------- Time summary [sec] -------
TimeReport CPU = 5755.436445 Real = 5838.948498

MemReport  ---------- Memory summary [base-10 MB] ------
MemReport  VmPeak = 12202.6 VmHWM = 9983.81

%MSG-s ArtException:  PostEndJob 15-Nov-2024 01:08:56 UTC ModuleEndJob
---- EventProcessorFailure BEGIN
  EventProcessor: an exception occurred during current event processing
  ---- FatalRootError BEGIN
    Fatal Root Error: TBufferFile::WriteByteCount
    bytecount too large (more than 1073741822)
    ROOT severity: 3000
  ---- FatalRootError END
---- EventProcessorFailure END
---- FatalRootError BEGIN
  Fatal Root Error: TTree::SetEntries
  Tree branches have different numbers of entries, eg EventAuxiliary has 846 entries while art::TriggerResults_TriggerResults__MUSUNGen. has 1000 entries.
  ROOT severity: 2000
---- FatalRootError END
%MSG
Art has completed and will exit with status 1.
=== End last 100 lines of lar log file ===
lar exit code 0
Traceback (most recent call last):
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 434, in <module>
    main()
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 373, in main
    mddict = expSpecificMetadata.getmetadata()
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 344, in getmetadata
    jobt = self.get_job(proc)
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 69, in get_job
    raise RuntimeError('sam_metadata_dumper returned nonzero exit status {}.'.format(rc))
RuntimeError: sam_metadata_dumper returned nonzero exit status 1.
extractor_prod.py exit code 1
Error reading metadata from file: Expecting value: line 1 column 1 (char 0)
pdjson2metadata exit code 1
.:
total 2693748
-rw-r--r-- 1 dune001 dune 2755320966 Nov 15 01:08 RootOutput-fde4-157e-1167-f6df.root
-rw-r--r-- 1 dune001 dune    3057048 Nov 15 01:08 000852_reco_data_2024-11-14T_093508Z_reco_2024-11-14T_233008Z.log
-rw-r--r-- 1 dune001 dune       7528 Nov 15 01:09 jobscript.log
-rw-r--r-- 1 dune001 dune        519 Nov 15 01:08 g4_hist.root
-rw-r--r-- 1 dune001 dune        104 Nov 14 23:30 all-input-dids.txt
-rw-r--r-- 1 dune001 dune          0 Nov 15 01:08 000852_reco_data_2024-11-14T_093508Z_reco_data_2024-11-14T_233008Z.root.ext.json
-rw-r--r-- 1 dune001 dune          0 Nov 15 01:09 000852_reco_data_2024-11-14T_093508Z_reco_data_2024-11-14T_233008Z.root.json
justIN time: 2024-11-22 08:53:08 UTC       justIN version: 01.01.09