Jobsub ID 97529.0@justin-prod-sched02.dune.hep.ac.uk
Jobsub ID | 97529.0@justin-prod-sched02.dune.hep.ac.uk |
Workflow ID | 4103 |
Stage ID | 1 |
User name | amoor@fnal.gov |
HTCondor Group | group_dune |
Requested | Processors | 1 |
RSS bytes | 8388608000 (8000 MiB) |
Wall seconds limit | 80000 (22 hours) |
Submitted time | 2024-11-15 02:56:42 |
Site | US_FNAL-FermiGrid |
Entry | FNAL_GPGrid_ce04_mcore_op_duneonly |
Last heartbeat | 2024-11-15 06:23:28 |
From worker node | Hostname | dunegli-4216307-0-fnpc9032.fnal.gov |
cpuinfo | Intel(R) Xeon(R) CPU E5-2680 v4 @ 2.40GHz |
OS release | Scientific Linux release 7.9 (Nitrogen) |
Processors | 1 |
RSS bytes | 8388608000 (8000 MiB) |
Wall seconds limit | 172800 (48 hours) |
Inner Apptainer? | True |
Job state | outputting_failed |
Allocator name | justin-allocator-pro.dune.hep.ac.uk |
Started | 2024-11-15 02:59:52 |
Input files | usertests:000044_reco_data_2024-11-14T_092338Z.root
|
Jobscript | Exit code | 1 |
Real time | 0m (0s) |
CPU time | 0m (0s = 0%) |
Outputting started | |
Output files | |
Finished | 2024-11-15 06:23:28 |
List job events Wrapper job log |
Jobscript log (last 10,000 characters)
AFM g4 jobscript.
Input PFN = root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/usertests/af/40/000044_reco_data_2024-11-14T_092338Z.root
Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org
Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/
/cvmfs/larsoft.opensciencegrid.org/products/xrootd/v5_4_3b/Linux64bit+3.10-2.17-e20-p3913-prof/lib/libXrdPosixPreload.so
../justin-jobscript: line 72: 1126 Aborted (core dumped) lar -c $FCL_FILE $events_option -o $outFile "$pfn" > ${fname}_reco_${now}.log 2>&1
=== Start last 100 lines of lar log file ===
%MSG-i generatePrimaries: larg4Main:largeant@BeginModule 15-Nov-2024 03:30:46 UTC run: 20000031 subRun: 0 event: 189 MCTruthEventAction.cc:112
Generating 1 particles
%MSG
%MSG-i ParticleListActionService: larg4Main:largeant@BeginModule 15-Nov-2024 03:32:38 UTC run: 20000031 subRun: 0 event: 189
Not Stored Process summary:
compt : 2264906
Pair : 40
annihil : 109728
phot : 733527
Brem : 674065
conv : 109686
Ion : 290441
%MSG
%MSG-i endOfEventAction: larg4Main:largeant@BeginModule 15-Nov-2024 03:32:38 UTC run: 20000031 subRun: 0 event: 189 ParticleListAction.cc:701
MCTruth Handles Size: 1
%MSG
%MSG-i endOfEventAction: larg4Main:largeant@BeginModule 15-Nov-2024 03:32:38 UTC run: 20000031 subRun: 0 event: 189 ParticleListAction.cc:708
mclistHandle Size: 1
%MSG
%MSG-i endOfEventAction: larg4Main:largeant@BeginModule 15-Nov-2024 03:32:38 UTC run: 20000031 subRun: 0 event: 189 ParticleListAction.cc:711
Found 1 particles
%MSG
%MSG-i NuRandomService: IonAndScint:IonAndScint@BeginModule 15-Nov-2024 03:32:39 UTC run: 20000031 subRun: 0 event: 189
Random seed for this event, engine 'IonAndScint.ISCalcAlg': 200029968
%MSG
IonAndScint Module Producer
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneUInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCActiveInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneVInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneZInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCActiveOuter
%MSG-i NuRandomService: SimDriftElectrons:elecDrift@BeginModule 15-Nov-2024 03:32:48 UTC run: 20000031 subRun: 0 event: 189
Random seed for this event, engine 'elecDrift': 809992814
%MSG
%MSG-i NuRandomService: PDFastSimPAR:PDFastSim@BeginModule 15-Nov-2024 03:34:23 UTC run: 20000031 subRun: 0 event: 189
Random seed for this event, engine 'PDFastSim.photon': 700470739
%MSG
%MSG-i NuRandomService: PDFastSimPAR:PDFastSim@BeginModule 15-Nov-2024 03:34:23 UTC run: 20000031 subRun: 0 event: 189
Random seed for this event, engine 'PDFastSim.scinttime': 240459937
%MSG
IonAndScint endJob.
15-Nov-2024 06:22:20 UTC Closed input file "root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/usertests/af/40/000044_reco_data_2024-11-14T_092338Z.root"
================================================================================================================================
TimeTracker printout (sec) Min Avg Max Median RMS nEvts
================================================================================================================================
Full event 0.00924704 63.3778 10227.7 0.417766 746.438 189
--------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read) 5.9195e-05 0.0106831 0.676059 0.00013268 0.0661306 189
simulate:rns:RandomNumberSaver 1.5493e-05 2.79431e-05 0.000209418 2.3488e-05 1.6346e-05 189
simulate:largeant:larg4Main 0.00724528 1.15911 113.203 0.286249 8.26363 189
simulate:IonAndScint:IonAndScint 6.0583e-05 0.0582931 9.28456 0.000108625 0.677427 189
simulate:elecDrift:SimDriftElectrons 4.0298e-05 0.569369 94.4082 5.3048e-05 6.88059 189
simulate:PDFastSim:PDFastSimPAR 5.5142e-05 60.2743 10010.8 9.2265e-05 729.684 189
[art]:TriggerResults:TriggerResultInserter 7.095e-06 1.0996e-05 3.7379e-05 1.0067e-05 3.8234e-06 189
end_path:out1:RootOutput 1.303e-06 3.11487e-06 1.9621e-05 2.843e-06 1.99315e-06 189
end_path:out1:RootOutput(write) 0.000426906 1.30162 133.214 0.0087293 9.84685 188
================================================================================================================================
%MSG-i NuRandomService: RootOutput:out1@EndJob 15-Nov-2024 06:22:20 UTC ModuleEndJob
Summary of seeds computed by the NuRandomService
Random policy: 'perEvent'
algorithm version: EventTimestamp_v1
Configured value Last value ModuleLabel.InstanceName
(per event) 200029968 IonAndScint.ISCalcAlg
(per event) 700470739 PDFastSim.photon
(per event) 240459937 PDFastSim.scinttime
(per event) 809992814 elecDrift
(per event) 480316838 largeant
%MSG
====================================================================================================
MemoryTracker summary (base-10 MB units used)
Peak virtual memory usage (VmPeak) : 20035.8 MB
Peak resident set size usage (VmHWM): 18048.6 MB
====================================================================================================
TrigReport ---------- Event summary -------------
TrigReport Events total = 189 passed = 189 failed = 0
TrigReport ---------- Modules in End-path ----------
TrigReport Run Success Error Name
TrigReport 189 189 0 out1
TimeReport ---------- Time summary [sec] -------
TimeReport CPU = 11743.348261 Real = 12063.082130
MemReport ---------- Memory summary [base-10 MB] ------
MemReport VmPeak = 20035.8 VmHWM = 18048.6
terminate called after throwing an instance of 'cet::coded_exception<art::errors::ErrorCodes, &art::ExceptionDetail::translate[abi:cxx11]>'
what(): ---- FatalRootError BEGIN
Fatal Root Error: TBufferFile::AutoExpand
Request to expand to a negative size, likely due to an integer overflow: 0x80000022 for a max of 0x7ffffffe.
ROOT severity: 6000
---- FatalRootError END
=== End last 100 lines of lar log file ===
lar exit code 0
Traceback (most recent call last):
File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 434, in <module>
main()
File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 373, in main
mddict = expSpecificMetadata.getmetadata()
File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 344, in getmetadata
jobt = self.get_job(proc)
File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 69, in get_job
raise RuntimeError('sam_metadata_dumper returned nonzero exit status {}.'.format(rc))
RuntimeError: sam_metadata_dumper returned nonzero exit status 1.
extractor_prod.py exit code 1
Error reading metadata from file: Expecting value: line 1 column 1 (char 0)
pdjson2metadata exit code 1
.:
total 907948
-rw-r--r-- 1 dunegli fnalgrid 928978916 Nov 15 06:22 RootOutput-065c-cf99-0da7-1600.root
-rw-r--r-- 1 dunegli fnalgrid 734289 Nov 15 06:22 000044_reco_data_2024-11-14T_092338Z_reco_2024-11-15T_030005Z.log
-rw-r--r-- 1 dunegli fnalgrid 7749 Nov 15 06:22 jobscript.log
-rw-r--r-- 1 dunegli fnalgrid 274 Nov 15 03:00 TFileService-e718-ff53-2283-08f8.root
-rw-r--r-- 1 dunegli fnalgrid 104 Nov 15 03:00 all-input-dids.txt
-rw-r--r-- 1 dunegli fnalgrid 0 Nov 15 06:22 000044_reco_data_2024-11-14T_092338Z_reco_data_2024-11-15T_030005Z.root.ext.json
-rw-r--r-- 1 dunegli fnalgrid 0 Nov 15 06:22 000044_reco_data_2024-11-14T_092338Z_reco_data_2024-11-15T_030005Z.root.json