Jobsub ID 97052.69@justin-prod-sched02.dune.hep.ac.uk
Jobsub ID | 97052.69@justin-prod-sched02.dune.hep.ac.uk |
Workflow ID | 4103 |
Stage ID | 1 |
User name | amoor@fnal.gov |
HTCondor Group | group_dune |
Requested | Processors | 1 |
RSS bytes | 8388608000 (8000 MiB) |
Wall seconds limit | 80000 (22 hours) |
Submitted time | 2024-11-14 14:45:21 |
Site | US_FNAL-FermiGrid |
Entry | FNAL_GPGrid_ce04_mcore_op_duneonly |
Last heartbeat | 2024-11-14 21:57:55 |
From worker node | Hostname | dunegli-4216118-0-fnpc19106.fnal.gov |
cpuinfo | AMD EPYC 7502 32-Core Processor |
OS release | Scientific Linux release 7.9 (Nitrogen) |
Processors | 1 |
RSS bytes | 8388608000 (8000 MiB) |
Wall seconds limit | 172800 (48 hours) |
Inner Apptainer? | True |
Job state | jobscript_error |
Allocator name | justin-allocator-pro.dune.hep.ac.uk |
Started | 2024-11-14 17:16:06 |
Input files | usertests:000194_reco_data_2024-11-14T_092713Z.root
|
Jobscript | Exit code | 1 |
Real time | 0m (0s) |
CPU time | 0m (0s = 0%) |
Outputting started | |
Output files | |
Finished | 2024-11-14 21:57:55 |
Saved logs | justin-logs:97052.69-justin-prod-sched02.dune.hep.ac.uk.logs.tgz |
List job events Wrapper job log |
Jobscript log (last 10,000 characters)
AFM g4 jobscript.
Input PFN = root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/usertests/c4/b3/000194_reco_data_2024-11-14T_092713Z.root
Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org
Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/
/cvmfs/larsoft.opensciencegrid.org/products/xrootd/v5_4_3b/Linux64bit+3.10-2.17-e20-p3913-prof/lib/libXrdPosixPreload.so
../justin-jobscript: line 72: 1129 Aborted (core dumped) lar -c $FCL_FILE $events_option -o $outFile "$pfn" > ${fname}_reco_${now}.log 2>&1
=== Start last 100 lines of lar log file ===
%MSG-i generatePrimaries: larg4Main:largeant@BeginModule 14-Nov-2024 17:43:28 UTC run: 20000031 subRun: 0 event: 721 MCTruthEventAction.cc:112
Generating 1 particles
%MSG
%MSG-i ParticleListActionService: larg4Main:largeant@BeginModule 14-Nov-2024 17:46:35 UTC run: 20000031 subRun: 0 event: 721
Not Stored Process summary:
compt : 4308273
annihil : 199212
Pair : 104
phot : 1311068
Brem : 1202872
conv : 199100
Ion : 483457
%MSG
%MSG-i endOfEventAction: larg4Main:largeant@BeginModule 14-Nov-2024 17:46:35 UTC run: 20000031 subRun: 0 event: 721 ParticleListAction.cc:701
MCTruth Handles Size: 1
%MSG
%MSG-i endOfEventAction: larg4Main:largeant@BeginModule 14-Nov-2024 17:46:35 UTC run: 20000031 subRun: 0 event: 721 ParticleListAction.cc:708
mclistHandle Size: 1
%MSG
%MSG-i endOfEventAction: larg4Main:largeant@BeginModule 14-Nov-2024 17:46:35 UTC run: 20000031 subRun: 0 event: 721 ParticleListAction.cc:711
Found 1 particles
%MSG
%MSG-i NuRandomService: IonAndScint:IonAndScint@BeginModule 14-Nov-2024 17:46:35 UTC run: 20000031 subRun: 0 event: 721
Random seed for this event, engine 'IonAndScint.ISCalcAlg': 647198573
%MSG
IonAndScint Module Producer
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneUInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCActiveInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneVInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneZInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCActiveOuter
%MSG-i NuRandomService: SimDriftElectrons:elecDrift@BeginModule 14-Nov-2024 17:46:50 UTC run: 20000031 subRun: 0 event: 721
Random seed for this event, engine 'elecDrift': 184186539
%MSG
%MSG-i NuRandomService: PDFastSimPAR:PDFastSim@BeginModule 14-Nov-2024 17:49:03 UTC run: 20000031 subRun: 0 event: 721
Random seed for this event, engine 'PDFastSim.photon': 47091852
%MSG
%MSG-i NuRandomService: PDFastSimPAR:PDFastSim@BeginModule 14-Nov-2024 17:49:03 UTC run: 20000031 subRun: 0 event: 721
Random seed for this event, engine 'PDFastSim.scinttime': 773342013
%MSG
IonAndScint endJob.
14-Nov-2024 21:57:31 UTC Closed input file "root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/usertests/c4/b3/000194_reco_data_2024-11-14T_092713Z.root"
================================================================================================================================
TimeTracker printout (sec) Min Avg Max Median RMS nEvts
================================================================================================================================
Full event 0.00750244 23.2189 15215.7 0.267913 566.237 721
--------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read) 9.375e-05 0.00022364 0.00349838 0.000164071 0.000243386 721
simulate:rns:RandomNumberSaver 1.923e-05 3.37344e-05 0.000292382 3.228e-05 1.50192e-05 721
simulate:largeant:larg4Main 0.00476834 0.681264 186.753 0.240263 7.0225 721
simulate:IonAndScint:IonAndScint 0.000108451 0.0217425 14.4728 0.000143272 0.538598 721
simulate:elecDrift:SimDriftElectrons 3.371e-05 0.201258 133.303 4.266e-05 4.96093 721
simulate:PDFastSim:PDFastSimPAR 5.257e-05 22.2406 14881.1 6.4161e-05 553.801 721
[art]:TriggerResults:TriggerResultInserter 8.82e-06 1.18882e-05 4.824e-05 1.064e-05 3.73318e-06 721
end_path:out1:RootOutput 2.41e-06 4.64545e-06 0.000191031 3.87e-06 9.10917e-06 721
end_path:out1:RootOutput(write) 0.000634904 0.0732695 4.32744 0.00401608 0.247501 720
================================================================================================================================
%MSG-i NuRandomService: RootOutput:out1@EndJob 14-Nov-2024 21:57:31 UTC ModuleEndJob
Summary of seeds computed by the NuRandomService
Random policy: 'perEvent'
algorithm version: EventTimestamp_v1
Configured value Last value ModuleLabel.InstanceName
(per event) 647198573 IonAndScint.ISCalcAlg
(per event) 47091852 PDFastSim.photon
(per event) 773342013 PDFastSim.scinttime
(per event) 184186539 elecDrift
(per event) 777949297 largeant
%MSG
====================================================================================================
MemoryTracker summary (base-10 MB units used)
Peak virtual memory usage (VmPeak) : 28042.6 MB
Peak resident set size usage (VmHWM): 21239.9 MB
====================================================================================================
TrigReport ---------- Event summary -------------
TrigReport Events total = 721 passed = 721 failed = 0
TrigReport ---------- Modules in End-path ----------
TrigReport Run Success Error Name
TrigReport 721 721 0 out1
TimeReport ---------- Time summary [sec] -------
TimeReport CPU = 16634.744476 Real = 16788.862277
MemReport ---------- Memory summary [base-10 MB] ------
MemReport VmPeak = 28042.6 VmHWM = 21239.9
terminate called after throwing an instance of 'cet::coded_exception<art::errors::ErrorCodes, &art::ExceptionDetail::translate[abi:cxx11]>'
what(): ---- FatalRootError BEGIN
Fatal Root Error: TBufferFile::AutoExpand
Request to expand to a negative size, likely due to an integer overflow: 0x8000002e for a max of 0x7ffffffe.
ROOT severity: 6000
---- FatalRootError END
=== End last 100 lines of lar log file ===
lar exit code 0
Traceback (most recent call last):
File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 434, in <module>
main()
File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 373, in main
mddict = expSpecificMetadata.getmetadata()
File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 344, in getmetadata
jobt = self.get_job(proc)
File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 69, in get_job
raise RuntimeError('sam_metadata_dumper returned nonzero exit status {}.'.format(rc))
RuntimeError: sam_metadata_dumper returned nonzero exit status 1.
extractor_prod.py exit code 1
Error reading metadata from file: Expecting value: line 1 column 1 (char 0)
pdjson2metadata exit code 1
.:
total 1116684
-rw-r--r-- 1 dunegli fnalgrid 1140843484 Nov 14 21:57 RootOutput-b690-30ec-e6a2-1c6b.root
-rw-r--r-- 1 dunegli fnalgrid 2612135 Nov 14 21:57 000194_reco_data_2024-11-14T_092713Z_reco_2024-11-14T_171610Z.log
-rw-r--r-- 1 dunegli fnalgrid 7751 Nov 14 21:57 jobscript.log
-rw-r--r-- 1 dunegli fnalgrid 274 Nov 14 17:16 TFileService-108d-362a-d537-4e16.root
-rw-r--r-- 1 dunegli fnalgrid 104 Nov 14 17:16 all-input-dids.txt
-rw-r--r-- 1 dunegli fnalgrid 0 Nov 14 21:57 000194_reco_data_2024-11-14T_092713Z_reco_data_2024-11-14T_171610Z.root.ext.json
-rw-r--r-- 1 dunegli fnalgrid 0 Nov 14 21:57 000194_reco_data_2024-11-14T_092713Z_reco_data_2024-11-14T_171610Z.root.json