Jobsub ID 97261.0@justin-prod-sched02.dune.hep.ac.uk
Jobsub ID | 97261.0@justin-prod-sched02.dune.hep.ac.uk |
Workflow ID | 4103 |
Stage ID | 1 |
User name | amoor@fnal.gov |
HTCondor Group | group_dune |
Requested | Processors | 1 |
RSS bytes | 8388608000 (8000 MiB) |
Wall seconds limit | 80000 (22 hours) |
Submitted time | 2024-11-14 22:42:03 |
Site | UK_RAL-Tier1 |
Entry | LIGO_UK_RAL_arc_ce03 |
Last heartbeat | 2024-11-15 00:05:06 |
From worker node | Hostname | dune001-3137520.0-lcg2610.gridpp.rl.ac.uk |
cpuinfo | AMD EPYC 7763 64-Core Processor |
OS release | Scientific Linux release 7.9 (Nitrogen) |
Processors | 1 |
RSS bytes | 8388608000 (8000 MiB) |
Wall seconds limit | 216000 (60 hours) |
Inner Apptainer? | True |
Job state | outputting_failed |
Allocator name | justin-allocator-pro.dune.hep.ac.uk |
Started | 2024-11-14 22:43:06 |
Input files | usertests:000338_reco_data_2024-11-14T_092915Z.root
|
Jobscript | Exit code | 1 |
Real time | 0m (0s) |
CPU time | 0m (0s = 0%) |
Outputting started | |
Output files | |
Finished | 2024-11-15 00:05:06 |
List job events Wrapper job log |
Jobscript log (last 10,000 characters)
AFM g4 jobscript.
Input PFN = root://mover.pp.rl.ac.uk:1094/pnfs/pp.rl.ac.uk/data/dune/usertests/c0/76/000338_reco_data_2024-11-14T_092915Z.root
Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org
Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/
/cvmfs/larsoft.opensciencegrid.org/products/xrootd/v5_4_3b/Linux64bit+3.10-2.17-e20-p3913-prof/lib/libXrdPosixPreload.so
=== Start last 100 lines of lar log file ===
Brem : 191667
conv : 31460
Ion : 84396
%MSG
%MSG-i endOfEventAction: larg4Main:largeant@BeginModule 14-Nov-2024 23:19:02 UTC run: 20000031 subRun: 0 event: 795 ParticleListAction.cc:701
MCTruth Handles Size: 1
%MSG
%MSG-i endOfEventAction: larg4Main:largeant@BeginModule 14-Nov-2024 23:19:02 UTC run: 20000031 subRun: 0 event: 795 ParticleListAction.cc:708
mclistHandle Size: 1
%MSG
%MSG-i endOfEventAction: larg4Main:largeant@BeginModule 14-Nov-2024 23:19:02 UTC run: 20000031 subRun: 0 event: 795 ParticleListAction.cc:711
Found 1 particles
%MSG
%MSG-i NuRandomService: IonAndScint:IonAndScint@BeginModule 14-Nov-2024 23:19:02 UTC run: 20000031 subRun: 0 event: 795
Random seed for this event, engine 'IonAndScint.ISCalcAlg': 56695298
%MSG
IonAndScint Module Producer
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneUInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCActiveInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneVInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneZInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCActiveOuter
%MSG-i NuRandomService: SimDriftElectrons:elecDrift@BeginModule 14-Nov-2024 23:19:04 UTC run: 20000031 subRun: 0 event: 795
Random seed for this event, engine 'elecDrift': 373412406
%MSG
%MSG-i NuRandomService: PDFastSimPAR:PDFastSim@BeginModule 14-Nov-2024 23:19:30 UTC run: 20000031 subRun: 0 event: 795
Random seed for this event, engine 'PDFastSim.photon': 244322826
%MSG
%MSG-i NuRandomService: PDFastSimPAR:PDFastSim@BeginModule 14-Nov-2024 23:19:30 UTC run: 20000031 subRun: 0 event: 795
Random seed for this event, engine 'PDFastSim.scinttime': 284352464
%MSG
IonAndScint endJob.
15-Nov-2024 00:04:21 UTC Closed input file "root://mover.pp.rl.ac.uk:1094/pnfs/pp.rl.ac.uk/data/dune/usertests/c0/76/000338_reco_data_2024-11-14T_092915Z.root"
================================================================================================================================
TimeTracker printout (sec) Min Avg Max Median RMS nEvts
================================================================================================================================
Full event 0.00659068 5.97343 2747.23 0.238985 97.9872 795
--------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read) 6.7402e-05 0.000187634 0.00196469 0.000121334 0.000197225 795
simulate:rns:RandomNumberSaver 1.7331e-05 2.92151e-05 0.00033224 2.6351e-05 1.35448e-05 795
simulate:largeant:larg4Main 0.00497528 0.47879 34.7995 0.209212 1.78296 795
simulate:IonAndScint:IonAndScint 7.8472e-05 0.00479978 2.35344 0.000124423 0.083833 795
simulate:elecDrift:SimDriftElectrons 3.6761e-05 0.051431 25.89 4.6831e-05 0.921868 795
simulate:PDFastSim:PDFastSimPAR 4.6582e-05 5.35459 2684.18 6.8412e-05 95.7144 795
[art]:TriggerResults:TriggerResultInserter 6.78e-06 1.10062e-05 4.7812e-05 9.44e-06 3.683e-06 795
end_path:out1:RootOutput 1.68e-06 3.7967e-06 2.23e-05 3.67e-06 1.08863e-06 795
end_path:out1:RootOutput(write) 0.000552616 0.0831498 6.22424 0.00377906 0.361738 794
================================================================================================================================
%MSG-i NuRandomService: RootOutput:out1@EndJob 15-Nov-2024 00:04:21 UTC ModuleEndJob
Summary of seeds computed by the NuRandomService
Random policy: 'perEvent'
algorithm version: EventTimestamp_v1
Configured value Last value ModuleLabel.InstanceName
(per event) 56695298 IonAndScint.ISCalcAlg
(per event) 244322826 PDFastSim.photon
(per event) 284352464 PDFastSim.scinttime
(per event) 373412406 elecDrift
(per event) 568976875 largeant
%MSG
====================================================================================================
MemoryTracker summary (base-10 MB units used)
Peak virtual memory usage (VmPeak) : 10159.2 MB
Peak resident set size usage (VmHWM): 7187.54 MB
====================================================================================================
TrigReport ---------- Event summary -------------
TrigReport Events total = 795 passed = 795 failed = 0
TrigReport ---------- Modules in End-path ----------
TrigReport Run Success Error Name
TrigReport 795 795 0 out1
TimeReport ---------- Time summary [sec] -------
TimeReport CPU = 4733.253027 Real = 4778.270703
MemReport ---------- Memory summary [base-10 MB] ------
MemReport VmPeak = 10159.2 VmHWM = 7187.54
%MSG-s ArtException: PostEndJob 15-Nov-2024 00:04:45 UTC ModuleEndJob
---- EventProcessorFailure BEGIN
EventProcessor: an exception occurred during current event processing
---- FatalRootError BEGIN
Fatal Root Error: TBufferFile::WriteByteCount
bytecount too large (more than 1073741822)
ROOT severity: 3000
---- FatalRootError END
---- EventProcessorFailure END
---- FatalRootError BEGIN
Fatal Root Error: TTree::SetEntries
Tree branches have different numbers of entries, eg EventAuxiliary has 794 entries while art::TriggerResults_TriggerResults__MUSUNGen. has 1000 entries.
ROOT severity: 2000
---- FatalRootError END
%MSG
Art has completed and will exit with status 1.
=== End last 100 lines of lar log file ===
lar exit code 0
Traceback (most recent call last):
File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 434, in <module>
main()
File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 373, in main
mddict = expSpecificMetadata.getmetadata()
File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 344, in getmetadata
jobt = self.get_job(proc)
File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 69, in get_job
raise RuntimeError('sam_metadata_dumper returned nonzero exit status {}.'.format(rc))
RuntimeError: sam_metadata_dumper returned nonzero exit status 1.
extractor_prod.py exit code 1
Error reading metadata from file: Expecting value: line 1 column 1 (char 0)
pdjson2metadata exit code 1
.:
total 1869056
-rw-r--r-- 1 dune001 dune 1911021178 Nov 15 00:04 RootOutput-c17d-6783-ad72-1cab.root
-rw-r--r-- 1 dune001 dune 2873555 Nov 15 00:04 000338_reco_data_2024-11-14T_092915Z_reco_2024-11-14T_224313Z.log
-rw-r--r-- 1 dune001 dune 7492 Nov 15 00:04 jobscript.log
-rw-r--r-- 1 dune001 dune 519 Nov 15 00:04 g4_hist.root
-rw-r--r-- 1 dune001 dune 104 Nov 14 22:43 all-input-dids.txt
-rw-r--r-- 1 dune001 dune 0 Nov 15 00:04 000338_reco_data_2024-11-14T_092915Z_reco_data_2024-11-14T_224313Z.root.ext.json
-rw-r--r-- 1 dune001 dune 0 Nov 15 00:04 000338_reco_data_2024-11-14T_092915Z_reco_data_2024-11-14T_224313Z.root.json