justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 97805.0@justin-prod-sched02.dune.hep.ac.uk

Jobsub ID97805.0@justin-prod-sched02.dune.hep.ac.uk
Workflow ID4103
Stage ID1
User nameamoor@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
RSS bytes8388608000 (8000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2024-11-15 12:20:31
SiteUS_UChicago
EntryEngage_US_MWT2_uiuc_condce_mcore
Last heartbeat2024-11-15 17:26:45
From worker nodeHostnameuct2-c606.mwt2.org
cpuinfoAMD EPYC 7302 16-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes8388608000 (8000 MiB)
Wall seconds limit86400 (24 hours)
Inner Apptainer?True
Job stateoutputting_failed
Allocator namejustin-allocator-pro.dune.hep.ac.uk
Started2024-11-15 12:22:40
Input filesusertests:000194_reco_data_2024-11-14T_092713Z.root
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Outputting started 
Output files
Finished2024-11-15 17:26:45
List job events     Wrapper job log

Jobscript log (last 10,000 characters)

AFM g4 jobscript.
Input PFN = root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/usertests/c4/b3/000194_reco_data_2024-11-14T_092713Z.root
Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org
Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/
/cvmfs/larsoft.opensciencegrid.org/products/xrootd/v5_4_3b/Linux64bit+3.10-2.17-e20-p3913-prof/lib/libXrdPosixPreload.so
../justin-jobscript: line 72:  1127 Aborted                 (core dumped) lar -c $FCL_FILE $events_option -o $outFile "$pfn" > ${fname}_reco_${now}.log 2>&1
=== Start last 100 lines of lar log file ===
%MSG-i generatePrimaries:  larg4Main:largeant@BeginModule  15-Nov-2024 06:57:39 CST run: 20000031 subRun: 0 event: 721 MCTruthEventAction.cc:112
Generating 1 particles
%MSG
%MSG-i ParticleListActionService:  larg4Main:largeant@BeginModule  15-Nov-2024 07:01:20 CST run: 20000031 subRun: 0 event: 721
Not Stored Process summary:
	compt : 4308273
	annihil : 199212
	Pair : 104
	phot : 1311068
	Brem : 1202872
	conv : 199100
	Ion : 483457
%MSG
%MSG-i endOfEventAction:  larg4Main:largeant@BeginModule  15-Nov-2024 07:01:20 CST run: 20000031 subRun: 0 event: 721 ParticleListAction.cc:701
MCTruth Handles Size: 1
%MSG
%MSG-i endOfEventAction:  larg4Main:largeant@BeginModule  15-Nov-2024 07:01:20 CST run: 20000031 subRun: 0 event: 721 ParticleListAction.cc:708
mclistHandle Size: 1
%MSG
%MSG-i endOfEventAction:  larg4Main:largeant@BeginModule  15-Nov-2024 07:01:20 CST run: 20000031 subRun: 0 event: 721 ParticleListAction.cc:711
Found 1 particles
%MSG
%MSG-i NuRandomService:  IonAndScint:IonAndScint@BeginModule  15-Nov-2024 07:01:21 CST run: 20000031 subRun: 0 event: 721
Random seed for this event, engine 'IonAndScint.ISCalcAlg': 647198573
%MSG
IonAndScint Module Producer
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneUInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCActiveInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneVInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneZInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCActiveOuter
%MSG-i NuRandomService:  SimDriftElectrons:elecDrift@BeginModule  15-Nov-2024 07:01:38 CST run: 20000031 subRun: 0 event: 721
Random seed for this event, engine 'elecDrift': 184186539
%MSG
%MSG-i NuRandomService:  PDFastSimPAR:PDFastSim@BeginModule  15-Nov-2024 07:04:11 CST run: 20000031 subRun: 0 event: 721
Random seed for this event, engine 'PDFastSim.photon': 47091852
%MSG
%MSG-i NuRandomService:  PDFastSimPAR:PDFastSim@BeginModule  15-Nov-2024 07:04:11 CST run: 20000031 subRun: 0 event: 721
Random seed for this event, engine 'PDFastSim.scinttime': 773342013
%MSG
IonAndScint endJob.
15-Nov-2024 11:25:52 CST  Closed input file "root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/usertests/c4/b3/000194_reco_data_2024-11-14T_092713Z.root"

================================================================================================================================
TimeTracker printout (sec)                        Min           Avg           Max         Median          RMS         nEvts   
================================================================================================================================
Full event                                    0.00979751      25.0037       16075.8      0.339948       598.253        721    
--------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                        8.7234e-05    0.000274407   0.00705037    0.00015976    0.000531629      721    
simulate:rns:RandomNumberSaver                1.8324e-05    3.23542e-05   0.000267914   2.9887e-05    1.48571e-05      721    
simulate:largeant:larg4Main                   0.00580403     0.838919       222.292      0.294766       8.37304        721    
simulate:IonAndScint:IonAndScint              0.000108755    0.0260417      17.2865     0.000155022    0.643313        721    
simulate:elecDrift:SimDriftElectrons          3.4675e-05     0.232324       152.648     5.0355e-05      5.68092        721    
simulate:PDFastSim:PDFastSimPAR               4.6448e-05      23.8156       15683.5     7.1545e-05      583.673        721    
[art]:TriggerResults:TriggerResultInserter     7.795e-06     1.231e-05    5.5525e-05     1.105e-05    4.33722e-06      721    
end_path:out1:RootOutput                       2.615e-06    4.45905e-06   0.000227067    3.587e-06    9.07503e-06      721    
end_path:out1:RootOutput(write)               0.000716799    0.0899794      4.65825     0.00557205     0.289367        720    
================================================================================================================================
%MSG-i NuRandomService:  RootOutput:out1@EndJob 15-Nov-2024 11:25:52 CST  ModuleEndJob

Summary of seeds computed by the NuRandomService
Random policy: 'perEvent'
  algorithm version: EventTimestamp_v1
   Configured value          Last value   ModuleLabel.InstanceName
        (per event)           647198573   IonAndScint.ISCalcAlg
        (per event)            47091852   PDFastSim.photon
        (per event)           773342013   PDFastSim.scinttime
        (per event)           184186539   elecDrift
        (per event)           777949297   largeant

%MSG

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 28036.6 MB
  Peak resident set size usage (VmHWM): 21637.2 MB
====================================================================================================

TrigReport ---------- Event summary -------------
TrigReport Events total = 721 passed = 721 failed = 0

TrigReport ---------- Modules in End-path ----------
TrigReport        Run    Success      Error Name
TrigReport        721        721          0 out1

TimeReport ---------- Time summary [sec] -------
TimeReport CPU = 17646.067766 Real = 18066.501629

MemReport  ---------- Memory summary [base-10 MB] ------
MemReport  VmPeak = 28036.6 VmHWM = 21637.2

terminate called after throwing an instance of 'cet::coded_exception<art::errors::ErrorCodes, &art::ExceptionDetail::translate[abi:cxx11]>'
  what():  ---- FatalRootError BEGIN
  Fatal Root Error: TBufferFile::AutoExpand
  Request to expand to a negative size, likely due to an integer overflow: 0x8000002e for a max of 0x7ffffffe.
  ROOT severity: 6000
---- FatalRootError END

=== End last 100 lines of lar log file ===
lar exit code 0
Traceback (most recent call last):
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 434, in <module>
    main()
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 373, in main
    mddict = expSpecificMetadata.getmetadata()
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 344, in getmetadata
    jobt = self.get_job(proc)
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 69, in get_job
    raise RuntimeError('sam_metadata_dumper returned nonzero exit status {}.'.format(rc))
RuntimeError: sam_metadata_dumper returned nonzero exit status 1.
extractor_prod.py exit code 1
Error reading metadata from file: Expecting value: line 1 column 1 (char 0)
pdjson2metadata exit code 1
.:
total 1116684
-rw-r--r--. 1 dune osgvo 1140843484 Nov 15 11:25 RootOutput-c24e-563a-5a22-7850.root
-rw-r--r--. 1 dune osgvo    2612135 Nov 15 11:25 000194_reco_data_2024-11-14T_092713Z_reco_2024-11-15T_122249Z.log
-rw-r--r--. 1 dune osgvo       7751 Nov 15 11:26 jobscript.log
-rw-r--r--. 1 dune osgvo        274 Nov 15 06:23 TFileService-b31b-d227-8083-e20a.root
-rw-r--r--. 1 dune osgvo        104 Nov 15 06:22 all-input-dids.txt
-rw-r--r--. 1 dune osgvo          0 Nov 15 11:25 000194_reco_data_2024-11-14T_092713Z_reco_data_2024-11-15T_122249Z.root.ext.json
-rw-r--r--. 1 dune osgvo          0 Nov 15 11:26 000194_reco_data_2024-11-14T_092713Z_reco_data_2024-11-15T_122249Z.root.json
justIN time: 2024-11-17 08:30:01 UTC       justIN version: 01.01.09