justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 289756.44@justin-prod-sched01.dune.hep.ac.uk

Jobsub ID289756.44@justin-prod-sched01.dune.hep.ac.uk
Workflow ID4103
Stage ID1
User nameamoor@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
RSS bytes8388608000 (8000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2024-11-14 14:44:18
SiteUS_FNAL-T1
EntryCMSHTPC_T1_US_FNAL_condce_opp1_whole
Last heartbeat2024-11-14 22:59:26
From worker nodeHostnamedunegli-36753-0-cmswn2153.fnal.gov
cpuinfoAMD Opteron(tm) Processor 6376
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes8388608000 (8000 MiB)
Wall seconds limit171000 (47 hours)
Inner Apptainer?True
Job stateoutputting_failed
Allocator namejustin-allocator-pro.dune.hep.ac.uk
Started2024-11-14 16:35:36
Input filesusertests:000044_reco_data_2024-11-14T_092338Z.root
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Outputting started 
Output files
Finished2024-11-14 22:59:26
List job events     Wrapper job log

Jobscript log (last 10,000 characters)

AFM g4 jobscript.
Input PFN = root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/usertests/af/40/000044_reco_data_2024-11-14T_092338Z.root
Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org
Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/
/cvmfs/larsoft.opensciencegrid.org/products/xrootd/v5_4_3b/Linux64bit+3.10-2.17-e20-p3913-prof/lib/libXrdPosixPreload.so
../justin-jobscript: line 72:  1126 Aborted                 (core dumped) lar -c $FCL_FILE $events_option -o $outFile "$pfn" > ${fname}_reco_${now}.log 2>&1
=== Start last 100 lines of lar log file ===
%MSG-i generatePrimaries:  larg4Main:largeant@BeginModule  14-Nov-2024 17:32:32 UTC run: 20000031 subRun: 0 event: 189 MCTruthEventAction.cc:112
Generating 1 particles
%MSG
%MSG-i ParticleListActionService:  larg4Main:largeant@BeginModule  14-Nov-2024 17:37:12 UTC run: 20000031 subRun: 0 event: 189
Not Stored Process summary:
	compt : 2264906
	Pair : 40
	annihil : 109728
	phot : 733527
	Brem : 674065
	conv : 109686
	Ion : 290441
%MSG
%MSG-i endOfEventAction:  larg4Main:largeant@BeginModule  14-Nov-2024 17:37:12 UTC run: 20000031 subRun: 0 event: 189 ParticleListAction.cc:701
MCTruth Handles Size: 1
%MSG
%MSG-i endOfEventAction:  larg4Main:largeant@BeginModule  14-Nov-2024 17:37:12 UTC run: 20000031 subRun: 0 event: 189 ParticleListAction.cc:708
mclistHandle Size: 1
%MSG
%MSG-i endOfEventAction:  larg4Main:largeant@BeginModule  14-Nov-2024 17:37:12 UTC run: 20000031 subRun: 0 event: 189 ParticleListAction.cc:711
Found 1 particles
%MSG
%MSG-i NuRandomService:  IonAndScint:IonAndScint@BeginModule  14-Nov-2024 17:37:13 UTC run: 20000031 subRun: 0 event: 189
Random seed for this event, engine 'IonAndScint.ISCalcAlg': 200029968
%MSG
IonAndScint Module Producer
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneUInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCActiveInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneVInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneZInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCActiveOuter
%MSG-i NuRandomService:  SimDriftElectrons:elecDrift@BeginModule  14-Nov-2024 17:37:31 UTC run: 20000031 subRun: 0 event: 189
Random seed for this event, engine 'elecDrift': 809992814
%MSG
%MSG-i NuRandomService:  PDFastSimPAR:PDFastSim@BeginModule  14-Nov-2024 17:40:47 UTC run: 20000031 subRun: 0 event: 189
Random seed for this event, engine 'PDFastSim.photon': 700470739
%MSG
%MSG-i NuRandomService:  PDFastSimPAR:PDFastSim@BeginModule  14-Nov-2024 17:40:47 UTC run: 20000031 subRun: 0 event: 189
Random seed for this event, engine 'PDFastSim.scinttime': 240459937
%MSG
IonAndScint endJob.
14-Nov-2024 22:58:06 UTC  Closed input file "root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/usertests/af/40/000044_reco_data_2024-11-14T_092338Z.root"

================================================================================================================================
TimeTracker printout (sec)                        Min           Avg           Max         Median          RMS         nEvts   
================================================================================================================================
Full event                                     0.0147495      118.769       19237.5      0.589531       1404.41        189    
--------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                        0.000130789   0.000329696    0.0144218    0.000192395   0.00104208       189    
simulate:rns:RandomNumberSaver                2.4314e-05    4.44891e-05   0.000360928   4.0184e-05    2.64924e-05      189    
simulate:largeant:larg4Main                    0.0123835      2.52509       280.823      0.521468       20.4511        189    
simulate:IonAndScint:IonAndScint              0.000135083    0.108784       18.1279     0.000210821     1.32064        189    
simulate:elecDrift:SimDriftElectrons          5.8615e-05      1.18306       195.478     8.2776e-05      14.2498        189    
simulate:PDFastSim:PDFastSimPAR               8.2295e-05      114.622       18743.1     0.000117738     1368.11        189    
[art]:TriggerResults:TriggerResultInserter    1.1706e-05    2.01441e-05   0.000154316   1.8067e-05    1.22493e-05      189    
end_path:out1:RootOutput                       2.463e-06    5.70866e-06   2.7611e-05     5.087e-06    2.65018e-06      189    
end_path:out1:RootOutput(write)               0.000784847    0.330304       40.2814     0.00600605      2.9466         188    
================================================================================================================================
%MSG-i NuRandomService:  RootOutput:out1@EndJob 14-Nov-2024 22:58:06 UTC  ModuleEndJob

Summary of seeds computed by the NuRandomService
Random policy: 'perEvent'
  algorithm version: EventTimestamp_v1
   Configured value          Last value   ModuleLabel.InstanceName
        (per event)           200029968   IonAndScint.ISCalcAlg
        (per event)           700470739   PDFastSim.photon
        (per event)           240459937   PDFastSim.scinttime
        (per event)           809992814   elecDrift
        (per event)           480316838   largeant

%MSG

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 20035.8 MB
  Peak resident set size usage (VmHWM): 17697.9 MB
====================================================================================================

TrigReport ---------- Event summary -------------
TrigReport Events total = 189 passed = 189 failed = 0

TrigReport ---------- Modules in End-path ----------
TrigReport        Run    Success      Error Name
TrigReport        189        189          0 out1

TimeReport ---------- Time summary [sec] -------
TimeReport CPU = 22307.542907 Real = 22775.166842

MemReport  ---------- Memory summary [base-10 MB] ------
MemReport  VmPeak = 20035.8 VmHWM = 17697.9

terminate called after throwing an instance of 'cet::coded_exception<art::errors::ErrorCodes, &art::ExceptionDetail::translate[abi:cxx11]>'
  what():  ---- FatalRootError BEGIN
  Fatal Root Error: TBufferFile::AutoExpand
  Request to expand to a negative size, likely due to an integer overflow: 0x80000022 for a max of 0x7ffffffe.
  ROOT severity: 6000
---- FatalRootError END

=== End last 100 lines of lar log file ===
lar exit code 0
Traceback (most recent call last):
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 434, in <module>
    main()
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 373, in main
    mddict = expSpecificMetadata.getmetadata()
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 344, in getmetadata
    jobt = self.get_job(proc)
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 69, in get_job
    raise RuntimeError('sam_metadata_dumper returned nonzero exit status {}.'.format(rc))
RuntimeError: sam_metadata_dumper returned nonzero exit status 1.
extractor_prod.py exit code 1
Error reading metadata from file: Expecting value: line 1 column 1 (char 0)
pdjson2metadata exit code 1
.:
total 908840
-rw-r--r-- 1 dunegli fnalgrid 928978911 Nov 14 22:58 RootOutput-868a-4586-9526-c4ae.root
-rw-r--r-- 1 dunegli fnalgrid    734289 Nov 14 22:58 000044_reco_data_2024-11-14T_092338Z_reco_2024-11-14T_163542Z.log
-rw-r--r-- 1 dunegli fnalgrid      7749 Nov 14 22:58 jobscript.log
-rw-r--r-- 1 dunegli fnalgrid       274 Nov 14 16:35 TFileService-9624-9d2b-53f4-db4b.root
-rw-r--r-- 1 dunegli fnalgrid       104 Nov 14 16:35 all-input-dids.txt
-rw-r--r-- 1 dunegli fnalgrid         0 Nov 14 22:58 000044_reco_data_2024-11-14T_092338Z_reco_data_2024-11-14T_163542Z.root.ext.json
-rw-r--r-- 1 dunegli fnalgrid         0 Nov 14 22:58 000044_reco_data_2024-11-14T_092338Z_reco_data_2024-11-14T_163542Z.root.json
justIN time: 2024-11-17 14:39:55 UTC       justIN version: 01.01.09