Workflow 6464, Stage 1
Priority | 50 |
Processors | 1 |
Wall seconds | 80000 |
Image | /cvmfs/singularity.opensciencegrid.org/fermilab/fnal-wn-sl7:latest |
RSS bytes | 4193255424 (3999 MiB) |
Max distance for inputs | 100.0 |
Enabled input RSEs |
CERN_PDUNE_EOS, DUNE_CA_SFU, DUNE_CERN_EOS, DUNE_ES_PIC, DUNE_FR_CCIN2P3_DISK, DUNE_IN_TIFR, DUNE_IT_INFN_CNAF, DUNE_UK_GLASGOW, DUNE_UK_LANCASTER_CEPH, DUNE_UK_MANCHESTER_CEPH, DUNE_US_BNL_SDCC, DUNE_US_FNAL_DISK_STAGE, FNAL_DCACHE, FNAL_DCACHE_STAGING, FNAL_DCACHE_TEST, MONTECARLO, NIKHEF, PRAGUE, QMUL, RAL-PP, RAL_ECHO, SURFSARA, T3_US_NERSC |
Enabled output RSEs |
CERN_PDUNE_EOS, DUNE_CA_SFU, DUNE_CERN_EOS, DUNE_ES_PIC, DUNE_FR_CCIN2P3_DISK, DUNE_IN_TIFR, DUNE_IT_INFN_CNAF, DUNE_UK_GLASGOW, DUNE_UK_LANCASTER_CEPH, DUNE_UK_MANCHESTER_CEPH, DUNE_US_BNL_SDCC, DUNE_US_FNAL_DISK_STAGE, FNAL_DCACHE, FNAL_DCACHE_STAGING, FNAL_DCACHE_TEST, NIKHEF, PRAGUE, QMUL, RAL-PP, RAL_ECHO, SURFSARA, T3_US_NERSC |
Enabled sites |
BR_CBPF, CA_SFU, CA_Victoria, CERN, CH_UNIBE-LHEP, ES_CIEMAT, ES_PIC, FR_CCIN2P3, IN_TIFR, IT_CNAF, NL_SURFsara, UK_Bristol, UK_Brunel, UK_Durham, UK_Edinburgh, UK_Glasgow, UK_Imperial, UK_Lancaster, UK_Liverpool, UK_Manchester, UK_Oxford, UK_QMUL, UK_RAL-PPD, UK_RAL-Tier1, UK_Sheffield, US_Caltech, US_Colorado, US_FNAL-FermiGrid, US_FNAL-T1, US_Michigan, US_MIT, US_Nebraska, US_NotreDame, US_PuertoRico, US_SU-ITS, US_Swan, US_UChicago, US_UConn-HPC, US_UCSD, US_Wisconsin |
Scope | usertests |
Events for this stage |
Output patterns
| Destination | Pattern | Lifetime | For next stage | RSE expression |
---|
1 | Rucio usertests:calcuttj_ehn1_np04_6379_merged-w6464s1p1 | *root | 604800 | False | |
Environment variables
Name | Value |
---|
DATASET | usertests:calcuttj_g4bl_prod_full_1_042125-w6379s1p1 |
LIMIT | 10 |
MERGE_DIR | /cvmfs/fifeuser2.opensciencegrid.org/sw/dune/04714e6ef575ca47529605518ed919ebaf29bea8 |
File states
Total files | Finding | Unallocated | Allocated | Outputting | Processed | Not found | Failed |
---|
|
1000 | 0 | 0 | 0 | 0 | 999 | 0 | 1 |
Job states
Total | Submitted | Started | Processing | Outputting | Finished | Notused | Aborted | Stalled | Jobscript error | Outputting failed | None processed |
---|
1762 | 0 | 0 | 0 | 0 | 1429 | 0 | 0 | 270 | 49 | 13 | 1 |
RSEs used
Name | Inputs | Outputs |
---|
MONTECARLO | 1222 | 0 |
DUNE_CERN_EOS | 0 | 433 |
RAL_ECHO | 0 | 247 |
RAL-PP | 0 | 156 |
DUNE_US_FNAL_DISK_STAGE | 0 | 114 |
QMUL | 0 | 22 |
DUNE_US_BNL_SDCC | 0 | 10 |
DUNE_UK_LANCASTER_CEPH | 0 | 8 |
DUNE_FR_CCIN2P3_DISK | 0 | 4 |
DUNE_IT_INFN_CNAF | 0 | 3 |
DUNE_UK_MANCHESTER_CEPH | 0 | 1 |
DUNE_UK_GLASGOW | 0 | 1 |
Stats of processed input files as CSV or JSON, and of uploaded output files as CSV or JSON (up to 10000 files included)
File reset events, by site
Site | Allocated | Outputting |
---|
US_FNAL-FermiGrid | 22 | 13 |
CERN | 21 | 27 |
UK_RAL-Tier1 | 13 | 12 |
UK_RAL-PPD | 11 | 3 |
US_NotreDame | 8 | 0 |
UK_QMUL | 4 | 5 |
FR_CCIN2P3 | 3 | 0 |
UK_Edinburgh | 3 | 2 |
UK_Glasgow | 2 | 1 |
UK_Sheffield | 2 | 0 |
UK_Durham | 2 | 4 |
US_UChicago | 1 | 0 |
UK_Lancaster | 1 | 1 |
ES_PIC | 1 | 0 |
UK_Liverpool | 1 | 1 |
UK_Imperial | 0 | 2 |
US_Wisconsin | 0 | 1 |
UK_Oxford | 0 | 1 |
Jobscript
#!/bin/bash
source /cvmfs/dune.opensciencegrid.org/products/dune/setup_dune.sh
if [ -z ${DATASET} ]; then
echo "ERROR MUST SUPPLY DATASET"
exit 1
fi
if [ -z ${MERGE_DIR} ]; then
echo "ERROR MUST SUPPLY MERGE_DIR"
exit 1
fi
if [ -z ${JUSTIN_PROCESSORS} ]; then
JUSTIN_PROCESSORS=1
fi
echo "Justin processors: ${JUSTIN_PROCESSORS}"
export TF_NUM_THREADS=${JUSTIN_PROCESSORS}
export OPENBLAS_NUM_THREADS=${JUSTIN_PROCESSORS}
export JULIA_NUM_THREADS=${JUSTIN_PROCESSORS}
export MKL_NUM_THREADS=${JUSTIN_PROCESSORS}
export NUMEXPR_NUM_THREADS=${JUSTIN_PROCESSORS}
export OMP_NUM_THREADS=${JUSTIN_PROCESSORS}
##Get the MC number from this to bookkeep for justin
DID_PFN_RSE=`$JUSTIN_PATH/justin-get-file`
pfn_exit=$?
if [ $pfn_exit -ne 0 ]; then
echo "Error in justin-get-file. Exiting safely"
exit 0
fi
echo "did_pfn_rse $DID_PFN_RSE"
pfn=`echo $DID_PFN_RSE | cut -f2 -d' '`
setup root v6_28_12 -q e26:p3915:prof
export METACAT_SERVER_URL=https://metacat.fnal.gov:9443/dune_meta_prod/app;
export METACAT_AUTH_SERVER_URL=https://metacat.fnal.gov:8143/auth/dune
setup metacat
setup rucio
LIMIT=${LIMIT:-100}
subrun=`echo $JUSTIN_JOBSUB_ID | cut -f1 -d@ | cut -f2 -d.`
run=`echo $JUSTIN_JOBSUB_ID | cut -f1 -d@ | cut -f1 -d.`
echo $subrun $run
python $MERGE_DIR/merge_g4bl.py \
--dataset ${DATASET} \
-o inherit \
--limit ${LIMIT} \
--iter $(( 10#$pfn - 1 )) --run $run \
--subrun $subrun
#--namespace ehn1-beam-np04
if [ $? -ne 0 ]
then
echo "Exiting with error"
exit 1
else
echo "$pfn" > justin-processed-pfns.txt
fi