21 July 2025: This instance at RAL is read-only. Please do not try submitting new workflows for now.
Jobsub ID 425431.157@justin-prod-sched01.dune.hep.ac.uk
Jobsub ID | 425431.157@justin-prod-sched01.dune.hep.ac.uk | |
Workflow ID | 8046 | |
Stage ID | 1 | |
User name | higuera@fnal.gov | |
HTCondor Group | group_dune.prod_mcsim | |
Requested | Processors | 1 |
GPU | No | |
RSS bytes | 4194304000 (4000 MiB) | |
Wall seconds limit | 80000 (22 hours) | |
Submitted time | 2025-07-01 07:13:54 | |
Site | UK_Durham | |
Entry | DUNE_UK_SGridDurham_ce3 | |
Last heartbeat | 2025-07-01 08:10:34 | |
From worker node | Hostname | n263.dur.scotgrid.ac.uk |
cpuinfo | AMD EPYC 9534 64-Core Processor | |
OS release | Scientific Linux release 7.9 (Nitrogen) | |
Processors | 1 | |
RSS bytes | 4194304000 (4000 MiB) | |
Wall seconds limit | 171000 (47 hours) | |
GPU | ||
Inner Apptainer? | True | |
Job state | jobscript_error | |
Allocator name | justin-allocator-pro.dune.hep.ac.uk | |
Started | 2025-07-01 07:41:48 | |
Input files | fardet-vd:prodmarley_nue_cc_flat_radiological_decay0_dunevd10kt_1x8x14_3view_30deg_20250204T110729Z_gen_001736_supernova_g4stage1_g4stage2_detsim_reco.root | |
Jobscript | Exit code | 139 |
Real time | 0m (0s) | |
CPU time | 0m (0s = 0%) | |
Max RSS bytes | 0 (0 MiB) | |
Outputting started | ||
Output files | ||
Finished | 2025-07-01 08:10:34 | |
Saved logs | justin-logs:425431.157-justin-prod-sched01.dune.hep.ac.uk.logs.tgz | |
List job events Wrapper job log |
Jobscript log (last 10,000 characters)
29Z_gen_001736_supernova_g4stage1_g4stage2_detsim_reco_20250701T074153Z_reco2.root -n -1 AcdDigitReader::ctor: LogLevel: 1 DuneToolManager::getPrivate: ERROR: Tool name is blank StandardRawDigitExtractService::ctor: Retrieved digit read tool digitReader StandardRawDigitExtractService::ctor: StandardRawDigitExtractService: StandardRawDigitExtractService::ctor: LogLevel: 1 StandardRawDigitExtractService::ctor: DigitReadTool: digitReader StandardRawDigitExtractService::ctor: PedestalOption: 1 StandardRawDigitExtractService::ctor: FlagStuckOff: 0 StandardRawDigitExtractService::ctor: FlagStuckOn: 0 StandardRawDigitPrepService::ctor: Fetching extract service. StandardRawDigitPrepService::ctor: Fetching channel status provider. StandardRawDigitPrepService::ctor: Channel status provider: @0xb91b7c0 StandardRawDigitPrepService::ctor: Extract service: @0xbf02150 StandardRawDigitPrepService::ctor: Fetching deconvolution service. StandardRawDigitPrepService::ctor: Deconvolution service: @0xb91c090 StandardRawDigitPrepService::ctor: Fetching ROI building service. StandardRawDigitPrepService::ctor: ROI building service: @0xb91d250 StandardRawDigitPrepService::ctor: Fetching wire building service. StandardRawDigitPrepService::ctor: Wire building service: @0x3619e50 StandardRawDigitPrepService::ctor: StandardRawDigitPrepService: StandardRawDigitPrepService::ctor: LogLevel: 1 StandardRawDigitPrepService::ctor: SkipBad: 1 StandardRawDigitPrepService::ctor: SkipNoisy: 0 StandardRawDigitPrepService::ctor: ChannelStatusOnline: 0 StandardRawDigitPrepService::ctor: DoMitigation: 0 StandardRawDigitPrepService::ctor: DoEarlySignalFinding: 0 StandardRawDigitPrepService::ctor: DoNoiseRemoval: 0 StandardRawDigitPrepService::ctor: DoDeconvolution: 1 StandardRawDigitPrepService::ctor: DoPedestalAdjustment: 0 StandardRawDigitPrepService::ctor: DoROI: 1 StandardRawDigitPrepService::ctor: DoWires: 1 StandardRawDigitPrepService::ctor: DoDump: 0 StandardRawDigitPrepService::ctor: DoIntermediateStates: 0 StandardRawDigitPrepService::ctor: No display tools. Warning in <TFile::Append>: Replacing existing TH1: FieldResponse_U (Potential memory leak). Warning in <TFile::Append>: Replacing existing TH1: FieldResponse_V (Potential memory leak). Warning in <TFile::Append>: Replacing existing TH1: FieldResponse_Y (Potential memory leak). Reading model from /cvmfs/dune.opensciencegrid.org/products/dune/dune_pardata/v01_84_00/CnnModels/cnn_ndkemtrk_pitch_5_wire_44_drift_48_down_6_mean_notes_AtmAndNdk.nnet Layers 12 Layer 0 Convolution2D LayerConv2D 48x1x5x5 border_mode valid Layer 1 Activation Activation type relu Layer 2 Dropout Layer 3 Flatten Layer 4 Dense weights 84480 bias 128 Layer 5 Activation Activation type tanh Layer 6 Dropout Layer 7 Dense weights 128 bias 32 Layer 8 Activation Activation type tanh Layer 9 Dropout Layer 10 Dense weights 32 bias 4 Layer 11 Activation Activation type sigmoid ************************************************************************************************************************************* Unique Ptrs that are added to the event ************************************************************************************************************************************* * Data Product Name: InitialTrack * Instance Name: * Type: std::vector<recob::Track, std::allocator<recob::Track> >* * * Data Product Name: ShowerPCA * Instance Name: * Type: std::vector<recob::PCAxis, std::allocator<recob::PCAxis> >* * * Data Product Name: shower * Instance Name: * Type: std::vector<recob::Shower, std::allocator<recob::Shower> >* * * Association Name: PFParticlePCAxisAssn * Instance Name: * Type: art::Assns<recob::PFParticle, recob::PCAxis, void>* * * Association Name: ShowerPCAxisAssn * Instance Name: * Type: art::Assns<recob::Shower, recob::PCAxis, void>* * * Association Name: ShowerTrackAssn * Instance Name: * Type: art::Assns<recob::Shower, recob::Track, void>* * * Association Name: ShowerTrackHitAssn * Instance Name: * Type: art::Assns<recob::Track, recob::Hit, void>* * * Association Name: clusterAssociationsbase * Instance Name: * Type: art::Assns<recob::Shower, recob::Cluster, void>* * * Association Name: hitAssociationsbase * Instance Name: * Type: art::Assns<recob::Shower, recob::Hit, void>* * * Association Name: pfShowerAssociationsbase * Instance Name: * Type: art::Assns<recob::Shower, recob::PFParticle, void>* * * Association Name: spShowerAssociationsbase * Instance Name: * Type: art::Assns<recob::Shower, recob::SpacePoint, void>* * ************************************************************************************************************************************* 01-Jul-2025 08:43:13 BST Initiating request to open input file "root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/fardet-vd/65/85/prodmarley_nue_cc_flat_radiological_decay0_dunevd10kt_1x8x14_3view_30deg_20250204T110729Z_gen_001736_supernova_g4stage1_g4stage2_detsim_reco.root" 01-Jul-2025 08:43:16 BST Opened input file "root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/fardet-vd/65/85/prodmarley_nue_cc_flat_radiological_decay0_dunevd10kt_1x8x14_3view_30deg_20250204T110729Z_gen_001736_supernova_g4stage1_g4stage2_detsim_reco.root" Begin processing the 1st record. run: 5043 subRun: 0 event: 17351 at 01-Jul-2025 08:43:32 BST Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_SNSignalTag_DUNEFD_VD_1_U_v04_12_00.pt' Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_SNSignalTag_DUNEFD_VD_1_V_v04_12_00.pt' Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_SNSignalTag_DUNEFD_VD_1_W_v04_12_00.pt' Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_SNSignalTag_DUNEFD_VD_2_U_v04_12_00.pt' Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_SNSignalTag_DUNEFD_VD_2_V_v04_12_00.pt' Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_SNSignalTag_DUNEFD_VD_2_W_v04_12_00.pt' Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_VD_LowE_1_U_v04_12_00.pt' Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_VD_LowE_1_V_v04_12_00.pt' Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_VD_LowE_1_W_v04_12_00.pt' Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_VD_LowE_2_U_v04_12_00.pt' Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_VD_LowE_2_V_v04_12_00.pt' Loaded the TorchScript model '/cvmfs/dune.osgstorage.org/pnfs/fnal.gov/usr/dune/persistent/stash//PandoraNetworkData/PandoraNet_Vertex_DUNEFD_VD_LowE_2_W_v04_12_00.pt' 01-Jul-2025 09:08:29 BST Opened output file with pattern "prodmarley_nue_cc_flat_radiological_decay0_dunevd10kt_1x8x14_3view_30deg_20250204T110729Z_gen_001736_supernova_g4stage1_g4stage2_detsim_reco_20250701T074153Z_reco2.root" Malformed TimeTracker database. The TimeEvent table is empty, but the TimeModule table is not. This can happen if an exception has been thrown from a module while processing the first event. Any saved database file is suspect and should not be used. ==================================================================================================== MemoryTracker summary (base-10 MB units used) Peak virtual memory usage (VmPeak) : 3854.91 MB Peak resident set size usage (VmHWM): 2327.64 MB ==================================================================================================== %MSG-s ArtException: PostEndJob 01-Jul-2025 09:08:31 BST ModuleEndJob ---- EventProcessorFailure BEGIN EventProcessor: an exception occurred during current event processing ---- ScheduleExecutionFailure BEGIN Path: ProcessingStopped. ---- FileReadError BEGIN ---- FatalRootError BEGIN Fatal Root Error: TNetXNGFile::ReadBuffer [ERROR] Server responded with an error: [3012] Failed to open file (Pool unavailable [1010]) ROOT severity: 3000 ---- FatalRootError END The above exception was thrown while processing module LineCluster/linecluster run: 5043 subRun: 0 event: 17351 ---- FileReadError END Exception going through path reco ---- ScheduleExecutionFailure END ---- EventProcessorFailure END ---- FatalRootError BEGIN Fatal Root Error: TNetXNGFile::TNetXNGFile The remote file is not open ROOT severity: 3000 ---- FatalRootError END ---- FatalRootError BEGIN Fatal Root Error: TNetXNGFile::Close [ERROR] Server responded with an error: [3012] Failed to open file (Pool unavailable [1010]) ROOT severity: 3000 ---- FatalRootError END ---- FatalRootError BEGIN Fatal Root Error: TNetXNGFile::Close [ERROR] Server responded with an error: [3012] Failed to open file (Pool unavailable [1010]) ROOT severity: 3000 ---- FatalRootError END %MSG ../justin-jobscript: line 74: 1139 Segmentation fault (core dumped) lar -c reco2_supernova_dunevd10kt_1x8x14_3view_30deg_prod2024.fcl $FILE -o ${reco2_name}.root -n -1