justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 245179.1@justin-prod-sched01.dune.hep.ac.uk

Jobsub ID245179.1@justin-prod-sched01.dune.hep.ac.uk
Workflow ID2869
Stage ID1
User nameamoor@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2024-08-20 00:40:14
SiteUS_Colorado
EntryCMSHTPC_T3_US_Colorado_heposg01-colorado
Last heartbeat2024-08-20 02:58:42
From worker nodeHostnamelnxfarm338.colorado.edu
cpuinfoIntel(R) Core(TM) i9-7920X CPU @ 2.90GHz
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit171000 (47 hours)
Inner Apptainer?True
Job statejobscript_error
Allocator namejustin-allocator-pro.dune.hep.ac.uk
Started2024-08-20 00:41:45
Input filesusertests:000431_reco_data_2024-08-16T_162313Z.root
JobscriptExit code1
Real time0m (0s)
CPU time0m (0s = 0%)
Outputting started 
Output files
Finished2024-08-20 02:58:42
Saved logsjustin-logs:245179.1-justin-prod-sched01.dune.hep.ac.uk.logs.tgz
List job events     Wrapper job log

Jobscript log (last 10,000 characters)

AFM g4 jobscript.
Input PFN = root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/usertests/bb/8a/000431_reco_data_2024-08-16T_162313Z.root
Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org
Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/
/cvmfs/larsoft.opensciencegrid.org/products/xrootd/v5_4_3b/Linux64bit+3.10-2.17-e20-p3913-prof/lib/libXrdPosixPreload.so
../justin-jobscript: line 72:  1280 Aborted                 (core dumped) lar -c $FCL_FILE $events_option -o $outFile "$pfn" > ${fname}_reco_${now}.log 2>&1
=== Start last 100 lines of lar log file ===
%MSG
%MSG-i ParticleListActionService:  larg4Main:largeant@BeginModule  19-Aug-2024 19:01:28 MDT run: 20000031 subRun: 0 event: 414
Not Stored Process summary:
	compt : 2498213
	annihil : 121950
	Pair : 76
	phot : 815479
	Brem : 748589
	conv : 121872
	Ion : 324609
%MSG
%MSG-i endOfEventAction:  larg4Main:largeant@BeginModule  19-Aug-2024 19:01:28 MDT run: 20000031 subRun: 0 event: 414 ParticleListAction.cc:701
MCTruth Handles Size: 1
%MSG
%MSG-i endOfEventAction:  larg4Main:largeant@BeginModule  19-Aug-2024 19:01:28 MDT run: 20000031 subRun: 0 event: 414 ParticleListAction.cc:708
mclistHandle Size: 1
%MSG
%MSG-i endOfEventAction:  larg4Main:largeant@BeginModule  19-Aug-2024 19:01:28 MDT run: 20000031 subRun: 0 event: 414 ParticleListAction.cc:711
Found 1 particles
%MSG
%MSG-i NuRandomService:  IonAndScint:IonAndScint@BeginModule  19-Aug-2024 19:01:29 MDT run: 20000031 subRun: 0 event: 414
Random seed for this event, engine 'IonAndScint.ISCalcAlg': 848089
%MSG
IonAndScint Module Producer
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneUInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCActiveInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneVInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCPlaneZInner
SimEnergyDeposit input module: largeant, instance name: LArG4DetectorServicevolTPCActiveOuter
%MSG-i NuRandomService:  SimDriftElectrons:elecDrift@BeginModule  19-Aug-2024 19:01:36 MDT run: 20000031 subRun: 0 event: 414
Random seed for this event, engine 'elecDrift': 246648247
%MSG
%MSG-i NuRandomService:  PDFastSimPAR:PDFastSim@BeginModule  19-Aug-2024 19:02:39 MDT run: 20000031 subRun: 0 event: 414
Random seed for this event, engine 'PDFastSim.photon': 338914067
%MSG
%MSG-i NuRandomService:  PDFastSimPAR:PDFastSim@BeginModule  19-Aug-2024 19:02:39 MDT run: 20000031 subRun: 0 event: 414
Random seed for this event, engine 'PDFastSim.scinttime': 450103317
%MSG
Plugin version SecClnt v5.4.3 is incompatible with secztn v5.6.8 (must be <= 5.4.x) in sec.protocol libXrdSecztn-5.so
IonAndScint endJob.
19-Aug-2024 20:58:18 MDT  Closed input file "root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/usertests/bb/8a/000431_reco_data_2024-08-16T_162313Z.root"

================================================================================================================================
TimeTracker printout (sec)                        Min           Avg           Max         Median          RMS         nEvts   
================================================================================================================================
Full event                                    0.00560314      19.3905       7013.57       0.24999       344.253        414    
--------------------------------------------------------------------------------------------------------------------------------
source:RootInput(read)                        8.5897e-05    0.000602084    0.025892     0.000137626   0.00324496       414    
simulate:rns:RandomNumberSaver                1.6864e-05    2.88606e-05   0.000186181   2.77165e-05   1.01467e-05      414    
simulate:largeant:larg4Main                   0.00493417     0.627214       89.1316      0.227776       4.45708        414    
simulate:IonAndScint:IonAndScint              7.8773e-05     0.0188173      7.07456     0.00011213     0.347244        414    
simulate:elecDrift:SimDriftElectrons          2.9237e-05     0.172139       63.4814     4.75095e-05     3.11598        414    
simulate:PDFastSim:PDFastSimPAR               4.3939e-05      18.569        6853.88     7.11045e-05     336.432        414    
simulate:muonfilter:LArG4ParticleFilter       1.2605e-05    8.03388e-05   0.00772706    2.0591e-05    0.000476444      414    
[art]:TriggerResults:TriggerResultInserter     8.12e-06     1.12002e-05   4.8064e-05     1.01e-05     3.35513e-06      414    
end_path:out1:RootOutput                       5.708e-06    8.73433e-06   0.00015957     8.047e-06    7.54886e-06      414    
end_path:out1:RootOutput(write)                2.254e-06    0.00209805     0.373434      2.86e-06      0.0258923       413    
================================================================================================================================
%MSG-i NuRandomService:  RootOutput:out1@EndJob 19-Aug-2024 20:58:18 MDT  ModuleEndJob

Summary of seeds computed by the NuRandomService
Random policy: 'perEvent'
  algorithm version: EventTimestamp_v1
   Configured value          Last value   ModuleLabel.InstanceName
        (per event)              848089   IonAndScint.ISCalcAlg
        (per event)           338914067   PDFastSim.photon
        (per event)           450103317   PDFastSim.scinttime
        (per event)           246648247   elecDrift
        (per event)           831144890   largeant

%MSG

====================================================================================================
MemoryTracker summary (base-10 MB units used)

  Peak virtual memory usage (VmPeak)  : 18479.8 MB
  Peak resident set size usage (VmHWM): 15164.8 MB
====================================================================================================

TrigReport ---------- Event summary -------------
TrigReport Events total = 414 passed = 5 failed = 409

TrigReport ---------- Modules in End-path ----------
TrigReport        Run    Success      Error Name
TrigReport          5          5          0 out1

TimeReport ---------- Time summary [sec] -------
TimeReport CPU = 8044.843503 Real = 8128.630658

MemReport  ---------- Memory summary [base-10 MB] ------
MemReport  VmPeak = 18479.8 VmHWM = 15164.8

terminate called after throwing an instance of 'cet::coded_exception<art::errors::ErrorCodes, &art::ExceptionDetail::translate[abi:cxx11]>'
  what():  ---- FatalRootError BEGIN
  Fatal Root Error: TBufferFile::Expand
  Requested size (2147483647) is too large (max is 2147483646).
  ROOT severity: 6000
---- FatalRootError END

=== End last 100 lines of lar log file ===
lar exit code 0
Traceback (most recent call last):
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 434, in <module>
    main()
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 373, in main
    mddict = expSpecificMetadata.getmetadata()
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 344, in getmetadata
    jobt = self.get_job(proc)
  File "/cvmfs/dune.opensciencegrid.org/products/dune/duneutil/v09_75_03d00/bin/extractor_prod.py", line 69, in get_job
    raise RuntimeError('sam_metadata_dumper returned nonzero exit status {}.'.format(rc))
RuntimeError: sam_metadata_dumper returned nonzero exit status 1.
extractor_prod.py exit code 1
Error reading metadata from file: Expecting value: line 1 column 1 (char 0)
pdjson2metadata exit code 1
.:
total 109220
-rw-r--r-- 1 dunepilot gridusers 110292815 Aug 19 20:58 RootOutput-6bbe-b1d0-a2c8-7104.root
-rw-r--r-- 1 dunepilot gridusers   1528772 Aug 19 20:58 000431_reco_data_2024-08-16T_162313Z_reco_2024-08-20T_004150Z.log
-rw-r--r-- 1 dunepilot gridusers      7770 Aug 19 20:58 jobscript.log
-rw-r--r-- 1 dunepilot gridusers       274 Aug 19 18:41 TFileService-7ec6-17a1-c589-cbf4.root
-rw-r--r-- 1 dunepilot gridusers       104 Aug 19 18:41 all-input-dids.txt
-rw-r--r-- 1 dunepilot gridusers         0 Aug 19 20:58 000431_reco_data_2024-08-16T_162313Z_reco_data_2024-08-20T_004150Z.root.ext.json
-rw-r--r-- 1 dunepilot gridusers         0 Aug 19 20:58 000431_reco_data_2024-08-16T_162313Z_reco_data_2024-08-20T_004150Z.root.json
justIN time: 2024-11-17 08:13:19 UTC       justIN version: 01.01.09