justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 289902.0@justin-prod-sched01.dune.hep.ac.uk

Jobsub ID289902.0@justin-prod-sched01.dune.hep.ac.uk
Workflow ID4103
Stage ID1
User nameamoor@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
RSS bytes8388608000 (8000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2024-11-14 21:58:08
SiteUS_FNAL-FermiGrid
EntryFNAL_GPGrid_ce04_mcore_op_duneonly
Last heartbeat2024-11-15 02:16:03
From worker nodeHostnamedunegli-4216308-0-fnpc23043.fnal.gov
cpuinfoAMD EPYC 7543 32-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes8388608000 (8000 MiB)
Wall seconds limit172800 (48 hours)
Inner Apptainer?True
Job stateoutputting_failed
Allocator namejustin-allocator-pro.dune.hep.ac.uk
Started2024-11-14 21:59:56
Input filesusertests:000194_reco_data_2024-11-14T_092713Z.root
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Outputting started 
Output files
Finished2024-11-15 02:16:03
List job events     Wrapper job log

Jobscript log (last 10,000 characters)

AFM g4 jobscript.
Input PFN = root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/usertests/c4/b3/000194_reco_data_2024-11-14T_092713Z.root
Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org
Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/
/cvmfs/larsoft.opensciencegrid.org/products/xrootd/v5_4_3b/Linux64bit+3.10-2.17-e20-p3913-prof/lib/libXrdPosixPreload.so
../justin-jobscript: line 72:  1128 Aborted                 (core dumped) lar -c $FCL_FILE $events_option -o $outFile "$pfn" > ${fname}_reco_${now}.log 2>&1
=== Start last 100 lines of lar log file ===
%MSG-i generatePrimaries:  larg4Main:largeant@BeginModule  14-Nov-2024 22:33:07 UTC run: 20000031 subRun: 0 event: 721 MCTruthEventAction.cc:112
Generating 1 particles
%MSG
%MSG-i ParticleListActionService:  larg4Main:largeant@BeginModule  14-Nov-2024 22:36:15 UTC run: 20000031 subRun: 0 event: 721
Not Stored Process summary:
	compt : 4308273
	annihil : 199212
	Pair : 104
	phot : 1311068
	Brem : 1202872
	conv : 199100
	Ion : 483457
%MSG
%MSG-i endOfEventAction:  larg4Main:largeant@BeginModule  14-Nov-2024 22:36:15 UTC run: 20000031 subRun: 0 event: 721 ParticleListAction.cc:701
MCTruth Handles Size: 1
%MSG
%MSG-i endOfEventAction:  larg4Main:largeant@BeginModule  14-Nov-2024 22:36:15 UTC run: 20000031 subRun: 0 event: 721 ParticleListAction.cc:708
mclistHandle Size: 1
%MSG
%MSG-i endOfEventAction:  larg4Main:largeant@BeginModule  14-Nov-2024 22:36:15 UTC run: 20000031 subRun: 0 event: 721 ParticleListAction.cc:711
Found 1 particles
%MSG
%MSG-i NuRandomService:  IonAndScint:IonAndScint@BeginModule  14-Nov-2024 22:36:15 UTC run: 20000031 subRun: 0 event: 721
Random seed for this event, engine 'IonAndScint.ISCalcAlg': 647198573
%MSG
IonAndScint Module Producer
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneUInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCActiveInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneVInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneZInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCActiveOuter
%MSG-i NuRandomService:  SimDriftElectrons:elecDrift@BeginModule  14-Nov-2024 22:36:25 UTC run: 20000031 subRun: 0 event: 721
Random seed for this event, engine 'elecDrift': 184186539
%MSG
%MSG-i NuRandomService:  PDFastSimPAR:PDFastSim@BeginModule  14-Nov-2024 22:38:10 UTC run: 20000031 subRun: 0 event: 721
Random seed for this event, engine 'PDFastSim.photon': 47091852
%MSG
%MSG-i NuRandomService:  PDFastSimPAR:PDFastSim@BeginModule  14-Nov-2024 22:38:10 UTC run: 20000031 subRun: 0 event: 721
Random seed for this event, engine 'PDFastSim.scinttime': 773342013
%MSG
IonAndScint endJob.
15-Nov-2024 02:15:43 UTC  Closed input file "root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/usertests/c4/b3/000194_reco_data_2024-11-14T_092713Z.root"

================================================================================================================================
TimeTracker printout (sec)                        Min           Avg           Max         Median          RMS         nEvts   
================================================================================================================================
Full event                                     0.0063957      21.1257       13335.1      0.333683       496.264        721    
--------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                        5.4043e-05    0.000851898    0.204511     0.000138833   0.00827971       721    
simulate:rns:RandomNumberSaver                1.2603e-05    3.29376e-05   0.000315147   2.7913e-05    1.57354e-05      721    
simulate:largeant:larg4Main                   0.00512001     0.713048       188.347      0.249852       7.09258        721    
simulate:IonAndScint:IonAndScint              5.9984e-05     0.0202428      9.65015     0.000171725    0.360835        721    
simulate:elecDrift:SimDriftElectrons          2.6309e-05     0.161406       104.912     5.5075e-05      3.90442        721    
simulate:PDFastSim:PDFastSimPAR               3.3283e-05      19.625        13032.1      6.869e-05      484.997        721    
[art]:TriggerResults:TriggerResultInserter     5.02e-06     1.3405e-05    4.4685e-05    1.2734e-05    4.40643e-06      721    
end_path:out1:RootOutput                       1.192e-06    5.06725e-06   1.9277e-05     4.989e-06     1.683e-06       721    
end_path:out1:RootOutput(write)               0.000471854    0.600635       23.9037      0.0055423      2.52283        720    
================================================================================================================================
%MSG-i NuRandomService:  RootOutput:out1@EndJob 15-Nov-2024 02:15:43 UTC  ModuleEndJob

Summary of seeds computed by the NuRandomService
Random policy: 'perEvent'
  algorithm version: EventTimestamp_v1
   Configured value          Last value   ModuleLabel.InstanceName
        (per event)           647198573   IonAndScint.ISCalcAlg
        (per event)            47091852   PDFastSim.photon
        (per event)           773342013   PDFastSim.scinttime
        (per event)           184186539   elecDrift
        (per event)           777949297   largeant

%MSG

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 28040.7 MB
  Peak resident set size usage (VmHWM): 21559.4 MB
====================================================================================================

TrigReport ---------- Event summary -------------
TrigReport Events total = 721 passed = 721 failed = 0

TrigReport ---------- Modules in End-path ----------
TrigReport        Run    Success      Error Name
TrigReport        721        721          0 out1

TimeReport ---------- Time summary [sec] -------
TimeReport CPU = 14697.646716 Real = 15276.324855

MemReport  ---------- Memory summary [base-10 MB] ------
MemReport  VmPeak = 28040.7 VmHWM = 21559.4

terminate called after throwing an instance of 'cet::coded_exception<art::errors::ErrorCodes, &art::ExceptionDetail::translate[abi:cxx11]>'
  what():  ---- FatalRootError BEGIN
  Fatal Root Error: TBufferFile::AutoExpand
  Request to expand to a negative size, likely due to an integer overflow: 0x8000002e for a max of 0x7ffffffe.
  ROOT severity: 6000
---- FatalRootError END

=== End last 100 lines of lar log file ===
lar exit code 0
Traceback (most recent call last):
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 434, in <module>
    main()
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 373, in main
    mddict = expSpecificMetadata.getmetadata()
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 344, in getmetadata
    jobt = self.get_job(proc)
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 69, in get_job
    raise RuntimeError('sam_metadata_dumper returned nonzero exit status {}.'.format(rc))
RuntimeError: sam_metadata_dumper returned nonzero exit status 1.
extractor_prod.py exit code 1
Error reading metadata from file: Expecting value: line 1 column 1 (char 0)
pdjson2metadata exit code 1
.:
total 1117772
-rw-r--r-- 1 dunegli fnalgrid 1140843482 Nov 15 02:15 RootOutput-580e-ee59-a805-c3a3.root
-rw-r--r-- 1 dunegli fnalgrid    2612135 Nov 15 02:15 000194_reco_data_2024-11-14T_092713Z_reco_2024-11-14T_220000Z.log
-rw-r--r-- 1 dunegli fnalgrid       7751 Nov 15 02:15 jobscript.log
-rw-r--r-- 1 dunegli fnalgrid        274 Nov 14 22:00 TFileService-ff29-4ff5-a3ae-b663.root
-rw-r--r-- 1 dunegli fnalgrid        104 Nov 14 21:59 all-input-dids.txt
-rw-r--r-- 1 dunegli fnalgrid          0 Nov 15 02:15 000194_reco_data_2024-11-14T_092713Z_reco_data_2024-11-14T_220000Z.root.ext.json
-rw-r--r-- 1 dunegli fnalgrid          0 Nov 15 02:15 000194_reco_data_2024-11-14T_092713Z_reco_data_2024-11-14T_220000Z.root.json
justIN time: 2024-11-17 14:44:05 UTC       justIN version: 01.01.09