Jobsub ID 290666.1@justin-prod-sched01.dune.hep.ac.uk
Jobsub ID | 290666.1@justin-prod-sched01.dune.hep.ac.uk |
Workflow ID | 4103 |
Stage ID | 1 |
User name | amoor@fnal.gov |
HTCondor Group | group_dune |
Requested | Processors | 1 |
RSS bytes | 8388608000 (8000 MiB) |
Wall seconds limit | 80000 (22 hours) |
Submitted time | 2024-11-15 14:36:53 |
Site | US_UChicago |
Entry | Engage_US_MWT2_uiuc_gk02_condce_mcore |
Last heartbeat | 2024-11-15 18:28:08 |
From worker node | Hostname | uct2-c641.mwt2.org |
cpuinfo | AMD EPYC 7302 16-Core Processor |
OS release | Scientific Linux release 7.9 (Nitrogen) |
Processors | 1 |
RSS bytes | 8388608000 (8000 MiB) |
Wall seconds limit | 86400 (24 hours) |
Inner Apptainer? | True |
Job state | outputting_failed |
Allocator name | justin-allocator-pro.dune.hep.ac.uk |
Started | 2024-11-15 14:39:16 |
Input files | usertests:000044_reco_data_2024-11-14T_092338Z.root
|
Jobscript | Exit code | 1 |
Real time | 0m (0s) |
CPU time | 0m (0s = 0%) |
Outputting started | |
Output files | |
Finished | 2024-11-15 18:28:08 |
List job events Wrapper job log |
Jobscript log (last 10,000 characters)
AFM g4 jobscript.
Input PFN = root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/usertests/af/40/000044_reco_data_2024-11-14T_092338Z.root
Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org
Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/
/cvmfs/larsoft.opensciencegrid.org/products/xrootd/v5_4_3b/Linux64bit+3.10-2.17-e20-p3913-prof/lib/libXrdPosixPreload.so
../justin-jobscript: line 72: 1126 Aborted (core dumped) lar -c $FCL_FILE $events_option -o $outFile "$pfn" > ${fname}_reco_${now}.log 2>&1
=== Start last 100 lines of lar log file ===
%MSG-i generatePrimaries: larg4Main:largeant@BeginModule 15-Nov-2024 09:15:00 CST run: 20000031 subRun: 0 event: 189 MCTruthEventAction.cc:112
Generating 1 particles
%MSG
%MSG-i ParticleListActionService: larg4Main:largeant@BeginModule 15-Nov-2024 09:17:15 CST run: 20000031 subRun: 0 event: 189
Not Stored Process summary:
compt : 2264906
Pair : 40
annihil : 109728
phot : 733527
Brem : 674065
conv : 109686
Ion : 290441
%MSG
%MSG-i endOfEventAction: larg4Main:largeant@BeginModule 15-Nov-2024 09:17:15 CST run: 20000031 subRun: 0 event: 189 ParticleListAction.cc:701
MCTruth Handles Size: 1
%MSG
%MSG-i endOfEventAction: larg4Main:largeant@BeginModule 15-Nov-2024 09:17:15 CST run: 20000031 subRun: 0 event: 189 ParticleListAction.cc:708
mclistHandle Size: 1
%MSG
%MSG-i endOfEventAction: larg4Main:largeant@BeginModule 15-Nov-2024 09:17:15 CST run: 20000031 subRun: 0 event: 189 ParticleListAction.cc:711
Found 1 particles
%MSG
%MSG-i NuRandomService: IonAndScint:IonAndScint@BeginModule 15-Nov-2024 09:17:16 CST run: 20000031 subRun: 0 event: 189
Random seed for this event, engine 'IonAndScint.ISCalcAlg': 200029968
%MSG
IonAndScint Module Producer
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneUInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCActiveInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneVInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneZInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCActiveOuter
%MSG-i NuRandomService: SimDriftElectrons:elecDrift@BeginModule 15-Nov-2024 09:17:26 CST run: 20000031 subRun: 0 event: 189
Random seed for this event, engine 'elecDrift': 809992814
%MSG
%MSG-i NuRandomService: PDFastSimPAR:PDFastSim@BeginModule 15-Nov-2024 09:19:15 CST run: 20000031 subRun: 0 event: 189
Random seed for this event, engine 'PDFastSim.photon': 700470739
%MSG
%MSG-i NuRandomService: PDFastSimPAR:PDFastSim@BeginModule 15-Nov-2024 09:19:15 CST run: 20000031 subRun: 0 event: 189
Random seed for this event, engine 'PDFastSim.scinttime': 240459937
%MSG
IonAndScint endJob.
15-Nov-2024 12:27:28 CST Closed input file "root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/usertests/af/40/000044_reco_data_2024-11-14T_092338Z.root"
================================================================================================================================
TimeTracker printout (sec) Min Avg Max Median RMS nEvts
================================================================================================================================
Full event 0.0089442 71.4918 11530.6 0.306073 842.305 189
--------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read) 8.8667e-05 0.000276039 0.00391642 0.000192111 0.000401706 189
simulate:rns:RandomNumberSaver 1.6641e-05 3.63836e-05 0.000369914 3.3143e-05 2.63224e-05 189
simulate:largeant:larg4Main 0.00676062 1.25236 136.214 0.27889 9.92486 189
simulate:IonAndScint:IonAndScint 7.993e-05 0.0626243 10.189 0.00016505 0.74379 189
simulate:elecDrift:SimDriftElectrons 3.729e-05 0.666398 109.514 5.331e-05 7.98918 189
simulate:PDFastSim:PDFastSimPAR 4.235e-05 69.2555 11274.7 7.4279e-05 823.41 189
[art]:TriggerResults:TriggerResultInserter 6.231e-06 1.31315e-05 4.3922e-05 1.1892e-05 4.19859e-06 189
end_path:out1:RootOutput 1.593e-06 4.23597e-06 2.097e-05 3.877e-06 1.62731e-06 189
end_path:out1:RootOutput(write) 0.000691649 0.255202 30.6022 0.00580222 2.24022 188
================================================================================================================================
%MSG-i NuRandomService: RootOutput:out1@EndJob 15-Nov-2024 12:27:28 CST ModuleEndJob
Summary of seeds computed by the NuRandomService
Random policy: 'perEvent'
algorithm version: EventTimestamp_v1
Configured value Last value ModuleLabel.InstanceName
(per event) 200029968 IonAndScint.ISCalcAlg
(per event) 700470739 PDFastSim.photon
(per event) 240459937 PDFastSim.scinttime
(per event) 809992814 elecDrift
(per event) 480316838 largeant
%MSG
====================================================================================================
MemoryTracker summary (base-10 MB units used)
Peak virtual memory usage (VmPeak) : 20029.8 MB
Peak resident set size usage (VmHWM): 17921.5 MB
====================================================================================================
TrigReport ---------- Event summary -------------
TrigReport Events total = 189 passed = 189 failed = 0
TrigReport ---------- Modules in End-path ----------
TrigReport Run Success Error Name
TrigReport 189 189 0 out1
TimeReport ---------- Time summary [sec] -------
TimeReport CPU = 13243.663732 Real = 13547.118907
MemReport ---------- Memory summary [base-10 MB] ------
MemReport VmPeak = 20029.8 VmHWM = 17921.5
terminate called after throwing an instance of 'cet::coded_exception<art::errors::ErrorCodes, &art::ExceptionDetail::translate[abi:cxx11]>'
what(): ---- FatalRootError BEGIN
Fatal Root Error: TBufferFile::AutoExpand
Request to expand to a negative size, likely due to an integer overflow: 0x80000022 for a max of 0x7ffffffe.
ROOT severity: 6000
---- FatalRootError END
=== End last 100 lines of lar log file ===
lar exit code 0
Traceback (most recent call last):
File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 434, in <module>
main()
File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 373, in main
mddict = expSpecificMetadata.getmetadata()
File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 344, in getmetadata
jobt = self.get_job(proc)
File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 69, in get_job
raise RuntimeError('sam_metadata_dumper returned nonzero exit status {}.'.format(rc))
RuntimeError: sam_metadata_dumper returned nonzero exit status 1.
extractor_prod.py exit code 1
Error reading metadata from file: Expecting value: line 1 column 1 (char 0)
pdjson2metadata exit code 1
.:
total 907952
-rw-r--r--. 1 dune osgvo 928978914 Nov 15 12:27 RootOutput-024e-b841-9f88-706f.root
-rw-r--r--. 1 dune osgvo 734289 Nov 15 12:27 000044_reco_data_2024-11-14T_092338Z_reco_2024-11-15T_143929Z.log
-rw-r--r--. 1 dune osgvo 7749 Nov 15 12:27 jobscript.log
-rw-r--r--. 1 dune osgvo 274 Nov 15 08:40 TFileService-15f8-c87a-5646-e7fe.root
-rw-r--r--. 1 dune osgvo 104 Nov 15 08:39 all-input-dids.txt
-rw-r--r--. 1 dune osgvo 0 Nov 15 12:27 000044_reco_data_2024-11-14T_092338Z_reco_data_2024-11-15T_143929Z.root.ext.json
-rw-r--r--. 1 dune osgvo 0 Nov 15 12:27 000044_reco_data_2024-11-14T_092338Z_reco_data_2024-11-15T_143929Z.root.json