justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 97058.143@justin-prod-sched02.dune.hep.ac.uk

Jobsub ID97058.143@justin-prod-sched02.dune.hep.ac.uk
Workflow ID4105
Stage ID1
User namelavaut@fnal.gov
HTCondor Groupgroup_dune
RequestedProcessors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2024-11-14 15:01:02
SiteNL_SURFsara
EntryDUNE_SurfSARA_arc01
Last heartbeat2024-11-14 15:04:23
From worker nodeHostnamewn-la-13.gina.surfsara.nl
cpuinfoAMD EPYC 9754 128-Core Processor
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit129600 (36 hours)
Inner Apptainer?True
Job statejobscript_error
Allocator namejustin-allocator-pro.dune.hep.ac.uk
Started2024-11-14 15:03:48
Input fileshd-protodune-det-reco:np04hd_raw_run029186_0657_dataflow1_datawriter_0_20240915T064213_reco_stage1_reco_stage2_20240916T143525_keepup.root
hd-protodune-det-reco:np04hd_raw_run029186_0642_dataflow2_datawriter_0_20240915T061521_reco_stage1_reco_stage2_20240916T151802_keepup.root
hd-protodune-det-reco:np04hd_raw_run029186_0480_dataflow3_datawriter_0_20240915T013017_reco_stage1_reco_stage2_20240916T144551_keepup.root
hd-protodune-det-reco:np04hd_raw_run029186_0397_dataflow0_datawriter_0_20240914T230330_reco_stage1_reco_stage2_20240916T162641_keepup.root
hd-protodune-det-reco:np04hd_raw_run029186_0063_dataflow7_datawriter_0_20240914T132746_reco_stage1_reco_stage2_20240916T142420_keepup.root
hd-protodune-det-reco:np04hd_raw_run029186_0733_dataflow0_datawriter_0_20240915T091852_reco_stage1_reco_stage2_20240916T152233_keepup.root
hd-protodune-det-reco:np04hd_raw_run029186_0537_dataflow3_datawriter_0_20240915T030827_reco_stage1_reco_stage2_20240916T152148_keepup.root
hd-protodune-det-reco:np04hd_raw_run029186_0196_dataflow5_datawriter_0_20240914T171611_reco_stage1_reco_stage2_20240916T132038_keepup.root
hd-protodune-det-reco:np04hd_raw_run029186_0140_dataflow6_datawriter_0_20240914T154158_reco_stage1_reco_stage2_20240916T150605_keepup.root
JobscriptExit code90
Real time0m (0s)
CPU time0m (0s = 0%)
Outputting started 
Output files
Finished2024-11-14 15:04:23
Saved logsjustin-logs:97058.143-justin-prod-sched02.dune.hep.ac.uk.logs.tgz
List job events     Wrapper job log

Jobscript log (last 10,000 characters)

get_file receives:
Failed finding one file: Failed to allocate the chosen file (hd-protodune-det-reco:np04hd_raw_run029186_0716_dataflow7_datawriter_0_20240915T084625_reco_stage1_reco_stage2_20240916T150939_keepup.root): already allocated???get-file fails with HTTP code 500 from allocator!
Could not get file
Input PFN = root://mover.pp.rl.ac.uk:1094/pnfs/pp.rl.ac.uk/data/dune/hd-protodune-det-reco/0a/6c/np04hd_raw_run029186_0140_dataflow6_datawriter_0_20240914T154158_reco_stage1_reco_stage2_20240916T150605_keepup.root
Setting up larsoft UPS area... /cvmfs/larsoft.opensciencegrid.org
Setting up DUNE UPS area... /cvmfs/dune.opensciencegrid.org/products/dune/
PRODUCTS: /cvmfs/fifeuser3.opensciencegrid.org/sw/dune/b45eb38553651f97b6c3d6135486f05ed5eb726f:/cvmfs/dune.opensciencegrid.org/products/dune:/cvmfs/larsoft.opensciencegrid.org/products:/cvmfs/larsoft.opensciencegrid.org/packages:/cvmfs/fermilab.opensciencegrid.org/products/common/db/
DUNESW_DIR: 
DUNESW_DIR: /cvmfs/fifeuser3.opensciencegrid.org/sw/dune/b45eb38553651f97b6c3d6135486f05ed5eb726f/dunesw/v10_00_02d00
PROTODUNEANA_DIR: /cvmfs/fifeuser3.opensciencegrid.org/sw/dune/b45eb38553651f97b6c3d6135486f05ed5eb726f/protoduneana/v10_00_02d00
DUNEPROTOTYPES_DIR: /cvmfs/fifeuser3.opensciencegrid.org/sw/dune/b45eb38553651f97b6c3d6135486f05ed5eb726f/duneprototypes/v10_00_02d00

MRB_PROJECT=larsoft
MRB_PROJECT_VERSION=v10_00_02d00
MRB_QUALS=prof:e26
MRB_TOP=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/b45eb38553651f97b6c3d6135486f05ed5eb726f
MRB_SOURCE=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/b45eb38553651f97b6c3d6135486f05ed5eb726f/srcs
MRB_BUILDDIR=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/b45eb38553651f97b6c3d6135486f05ed5eb726f/build_slf7.x86_64
MRB_INSTALL=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/b45eb38553651f97b6c3d6135486f05ed5eb726f/localProducts_larsoft_v10_00_02d00_prof_e26

PRODUCTS=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/b45eb38553651f97b6c3d6135486f05ed5eb726f/localProducts_larsoft_v10_00_02d00_prof_e26:/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/b45eb38553651f97b6c3d6135486f05ed5eb726f:/cvmfs/dune.opensciencegrid.org/products/dune:/cvmfs/larsoft.opensciencegrid.org/products:/cvmfs/larsoft.opensciencegrid.org/packages:/cvmfs/fermilab.opensciencegrid.org/products/common/db/
CETPKG_INSTALL=/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/b45eb38553651f97b6c3d6135486f05ed5eb726f/localProducts_larsoft_v10_00_02d00_prof_e26

 VERSION ACTIVE =  VERSION ACTIVE =  VERSION ACTIVE = 
duneprototypes    v10_00_02d00    -f Linux64bit+3.10-2.17 -q e26:prof        -z /cvmfs/dune.opensciencegrid.org/products/dune
dunesw            v10_00_02d00    -f Linux64bit+3.10-2.17 -q e26:prof        -z /cvmfs/fifeuser3.opensciencegrid.org/sw/dune/b45eb38553651f97b6c3d6135486f05ed5eb726f/localProducts_larsoft_v10_00_02d00_prof_e26
local product directory is /cvmfs/fifeuser3.opensciencegrid.org/sw/dune/b45eb38553651f97b6c3d6135486f05ed5eb726f/localProducts_larsoft_v10_00_02d00_prof_e26
----------- this block should be empty ------------------
---------------------------------------------------------
lar exit code 90
.:
total 24
-rw-r--r--. 1 dune003 dune 3120 Nov 14 16:04 jobscript.log
-rw-r--r--. 1 dune003 dune 1847 Nov 14 16:03 file.list
-rw-r--r--. 1 dune003 dune 1390 Nov 14 16:03 all-input-dids.txt
-rw-r--r--. 1 dune003 dune 1251 Nov 14 16:03 did.list
-rw-r--r--. 1 dune003 dune  304 Nov 14 16:04 np04hd_raw_run029186_0140_dataflow6_datawriter_0_20240914T154158_reco_stage1_reco_stage2_20240916T150605_keepup_singleHit2024-11-14T_150400Z.log
-rw-r--r--. 1 dune003 dune  202 Nov 14 16:04 justin-processed-pfns.txt
justIN time: 2024-11-17 06:29:16 UTC       justIN version: 01.01.09