justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 290666.1@justin-prod-sched01.dune.hep.ac.uk

Jobsub ID290666.1@justin-prod-sched01.dune.hep.ac.uk
Workflow ID4103
Stage ID1
User nameamoor@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
RSS bytes8388608000 (8000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2024-11-15 14:36:53
SiteUS_UChicago
EntryEngage_US_MWT2_uiuc_gk02_condce_mcore
Last heartbeat2024-11-15 18:28:08
From worker nodeHostnameuct2-c641.mwt2.org
cpuinfoAMD EPYC 7302 16-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes8388608000 (8000 MiB)
Wall seconds limit86400 (24 hours)
Inner Apptainer?True
Job stateoutputting_failed
Allocator namejustin-allocator-pro.dune.hep.ac.uk
Started2024-11-15 14:39:16
Input filesusertests:000044_reco_data_2024-11-14T_092338Z.root
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Outputting started 
Output files
Finished2024-11-15 18:28:08
List job events     Wrapper job log

Jobscript log (last 10,000 characters)

AFM g4 jobscript.
Input PFN = root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/usertests/af/40/000044_reco_data_2024-11-14T_092338Z.root
Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org
Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/
/cvmfs/larsoft.opensciencegrid.org/products/xrootd/v5_4_3b/Linux64bit+3.10-2.17-e20-p3913-prof/lib/libXrdPosixPreload.so
../justin-jobscript: line 72:  1126 Aborted                 (core dumped) lar -c $FCL_FILE $events_option -o $outFile "$pfn" > ${fname}_reco_${now}.log 2>&1
=== Start last 100 lines of lar log file ===
%MSG-i generatePrimaries:  larg4Main:largeant@BeginModule  15-Nov-2024 09:15:00 CST run: 20000031 subRun: 0 event: 189 MCTruthEventAction.cc:112
Generating 1 particles
%MSG
%MSG-i ParticleListActionService:  larg4Main:largeant@BeginModule  15-Nov-2024 09:17:15 CST run: 20000031 subRun: 0 event: 189
Not Stored Process summary:
	compt : 2264906
	Pair : 40
	annihil : 109728
	phot : 733527
	Brem : 674065
	conv : 109686
	Ion : 290441
%MSG
%MSG-i endOfEventAction:  larg4Main:largeant@BeginModule  15-Nov-2024 09:17:15 CST run: 20000031 subRun: 0 event: 189 ParticleListAction.cc:701
MCTruth Handles Size: 1
%MSG
%MSG-i endOfEventAction:  larg4Main:largeant@BeginModule  15-Nov-2024 09:17:15 CST run: 20000031 subRun: 0 event: 189 ParticleListAction.cc:708
mclistHandle Size: 1
%MSG
%MSG-i endOfEventAction:  larg4Main:largeant@BeginModule  15-Nov-2024 09:17:15 CST run: 20000031 subRun: 0 event: 189 ParticleListAction.cc:711
Found 1 particles
%MSG
%MSG-i NuRandomService:  IonAndScint:IonAndScint@BeginModule  15-Nov-2024 09:17:16 CST run: 20000031 subRun: 0 event: 189
Random seed for this event, engine 'IonAndScint.ISCalcAlg': 200029968
%MSG
IonAndScint Module Producer
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneUInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCActiveInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneVInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneZInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCActiveOuter
%MSG-i NuRandomService:  SimDriftElectrons:elecDrift@BeginModule  15-Nov-2024 09:17:26 CST run: 20000031 subRun: 0 event: 189
Random seed for this event, engine 'elecDrift': 809992814
%MSG
%MSG-i NuRandomService:  PDFastSimPAR:PDFastSim@BeginModule  15-Nov-2024 09:19:15 CST run: 20000031 subRun: 0 event: 189
Random seed for this event, engine 'PDFastSim.photon': 700470739
%MSG
%MSG-i NuRandomService:  PDFastSimPAR:PDFastSim@BeginModule  15-Nov-2024 09:19:15 CST run: 20000031 subRun: 0 event: 189
Random seed for this event, engine 'PDFastSim.scinttime': 240459937
%MSG
IonAndScint endJob.
15-Nov-2024 12:27:28 CST  Closed input file "root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/usertests/af/40/000044_reco_data_2024-11-14T_092338Z.root"

================================================================================================================================
TimeTracker printout (sec)                        Min           Avg           Max         Median          RMS         nEvts   
================================================================================================================================
Full event                                     0.0089442      71.4918       11530.6      0.306073       842.305        189    
--------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                        8.8667e-05    0.000276039   0.00391642    0.000192111   0.000401706      189    
simulate:rns:RandomNumberSaver                1.6641e-05    3.63836e-05   0.000369914   3.3143e-05    2.63224e-05      189    
simulate:largeant:larg4Main                   0.00676062      1.25236       136.214       0.27889       9.92486        189    
simulate:IonAndScint:IonAndScint               7.993e-05     0.0626243      10.189      0.00016505      0.74379        189    
simulate:elecDrift:SimDriftElectrons           3.729e-05     0.666398       109.514      5.331e-05      7.98918        189    
simulate:PDFastSim:PDFastSimPAR                4.235e-05      69.2555       11274.7     7.4279e-05      823.41         189    
[art]:TriggerResults:TriggerResultInserter     6.231e-06    1.31315e-05   4.3922e-05    1.1892e-05    4.19859e-06      189    
end_path:out1:RootOutput                       1.593e-06    4.23597e-06    2.097e-05     3.877e-06    1.62731e-06      189    
end_path:out1:RootOutput(write)               0.000691649    0.255202       30.6022     0.00580222      2.24022        188    
================================================================================================================================
%MSG-i NuRandomService:  RootOutput:out1@EndJob 15-Nov-2024 12:27:28 CST  ModuleEndJob

Summary of seeds computed by the NuRandomService
Random policy: 'perEvent'
  algorithm version: EventTimestamp_v1
   Configured value          Last value   ModuleLabel.InstanceName
        (per event)           200029968   IonAndScint.ISCalcAlg
        (per event)           700470739   PDFastSim.photon
        (per event)           240459937   PDFastSim.scinttime
        (per event)           809992814   elecDrift
        (per event)           480316838   largeant

%MSG

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 20029.8 MB
  Peak resident set size usage (VmHWM): 17921.5 MB
====================================================================================================

TrigReport ---------- Event summary -------------
TrigReport Events total = 189 passed = 189 failed = 0

TrigReport ---------- Modules in End-path ----------
TrigReport        Run    Success      Error Name
TrigReport        189        189          0 out1

TimeReport ---------- Time summary [sec] -------
TimeReport CPU = 13243.663732 Real = 13547.118907

MemReport  ---------- Memory summary [base-10 MB] ------
MemReport  VmPeak = 20029.8 VmHWM = 17921.5

terminate called after throwing an instance of 'cet::coded_exception<art::errors::ErrorCodes, &art::ExceptionDetail::translate[abi:cxx11]>'
  what():  ---- FatalRootError BEGIN
  Fatal Root Error: TBufferFile::AutoExpand
  Request to expand to a negative size, likely due to an integer overflow: 0x80000022 for a max of 0x7ffffffe.
  ROOT severity: 6000
---- FatalRootError END

=== End last 100 lines of lar log file ===
lar exit code 0
Traceback (most recent call last):
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 434, in <module>
    main()
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 373, in main
    mddict = expSpecificMetadata.getmetadata()
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 344, in getmetadata
    jobt = self.get_job(proc)
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 69, in get_job
    raise RuntimeError('sam_metadata_dumper returned nonzero exit status {}.'.format(rc))
RuntimeError: sam_metadata_dumper returned nonzero exit status 1.
extractor_prod.py exit code 1
Error reading metadata from file: Expecting value: line 1 column 1 (char 0)
pdjson2metadata exit code 1
.:
total 907952
-rw-r--r--. 1 dune osgvo 928978914 Nov 15 12:27 RootOutput-024e-b841-9f88-706f.root
-rw-r--r--. 1 dune osgvo    734289 Nov 15 12:27 000044_reco_data_2024-11-14T_092338Z_reco_2024-11-15T_143929Z.log
-rw-r--r--. 1 dune osgvo      7749 Nov 15 12:27 jobscript.log
-rw-r--r--. 1 dune osgvo       274 Nov 15 08:40 TFileService-15f8-c87a-5646-e7fe.root
-rw-r--r--. 1 dune osgvo       104 Nov 15 08:39 all-input-dids.txt
-rw-r--r--. 1 dune osgvo         0 Nov 15 12:27 000044_reco_data_2024-11-14T_092338Z_reco_data_2024-11-15T_143929Z.root.ext.json
-rw-r--r--. 1 dune osgvo         0 Nov 15 12:27 000044_reco_data_2024-11-14T_092338Z_reco_data_2024-11-15T_143929Z.root.json
justIN time: 2024-11-17 08:15:10 UTC       justIN version: 01.01.09