justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Workflow 2294, Stage 1

Priority50
Processors1
Wall seconds80000
RSS bytes2096103424 (1999 MiB)
Max distance for inputs30.0
Enabled input RSEs CERN_PDUNE_EOS, DUNE_CERN_EOS, DUNE_ES_PIC, DUNE_FR_CCIN2P3_DISK, DUNE_IN_TIFR, DUNE_IT_INFN_CNAF, DUNE_UK_LANCASTER_CEPH, DUNE_US_BNL_SDCC, DUNE_US_FNAL_DISK_STAGE, FNAL_DCACHE, FNAL_DCACHE_STAGING, FNAL_DCACHE_TEST, MANCHESTER, MONTECARLO, NIKHEF, PRAGUE, QMUL, RAL-PP, RAL_ECHO, SURFSARA, T3_US_NERSC
Enabled output RSEs CERN_PDUNE_EOS, DUNE_CERN_EOS, DUNE_ES_PIC, DUNE_FR_CCIN2P3_DISK, DUNE_IN_TIFR, DUNE_IT_INFN_CNAF, DUNE_UK_LANCASTER_CEPH, DUNE_US_BNL_SDCC, DUNE_US_FNAL_DISK_STAGE, FNAL_DCACHE, FNAL_DCACHE_STAGING, FNAL_DCACHE_TEST, MANCHESTER, NIKHEF, PRAGUE, QMUL, RAL-PP, RAL_ECHO, SURFSARA, T3_US_NERSC
Enabled sites CA_SFU, CA_Victoria, CERN, CH_UNIBE-LHEP, CZ_FZU, ES_CIEMAT, ES_PIC, FR_CCIN2P3, IN_TIFR, IT_CNAF, NL_NIKHEF, NL_SURFsara, UK_Bristol, UK_Brunel, UK_Durham, UK_Edinburgh, UK_Imperial, UK_Lancaster, UK_Liverpool, UK_Manchester, UK_Oxford, UK_QMUL, UK_RAL-Tier1, UK_Sheffield, US_BNL, US_Caltech, US_Colorado, US_FNAL-FermiGrid, US_FNAL-T1, US_Michigan, US_MIT, US_Nebraska, US_NotreDame, US_PuertoRico, US_SU-ITS, US_Swan, US_UChicago, US_UConn-HPC, US_UCSD, US_Wisconsin
Scopeusertests
Events for this stage

Output patterns

 DestinationPatternLifetimeFor next stage
1https://fndcadoor.fnal.gov:2880/dune/scratch/users/calcuttj/justin/hitchecker_full2/02294/1*root

Environment variables

NameValue
PROTODUNEANA_DIR/cvmfs/fifeuser1.opensciencegrid.org/sw/dune/c1679e63d9b3d97496fee9162ef0b3e2b52ffff9

File states

Total filesFindingUnallocatedAllocatedOutputtingProcessedNot foundFailed
709000070900

Job states

TotalSubmittedStartedProcessingOutputtingFinishedNotusedAbortedStalledJobscript errorOutputting failedNone processed
10600000966030350128
Files processed00100100200200300300400400500500600600Jun-12 12:00Jun-12 13:00Jun-12 14:00Jun-12 15:00Jun-12 16:00Files processedBin start timesNumber per binUK_BrunelIT_CNAFCERNES_PICES_CIEMATUK_ImperialUK_SheffieldCZ_FZUNL_SURFsaraUK_LiverpoolUK_LancasterNL_NIKHEFUK_Manchester
Replicas per RSE237471.3476056870184181.61961306701713181404.9559297041646336.3000076862206111288.42119332412136285.9751110419404777279.9406125224207201.1943758234713235307.63198320393803157.508430889379133333.5927162763984139.4777591389352625359.1424704597163130.786369788306610375.33566339261796128.80335573787045Replicas per RSENIKHEF (33%)RAL-PP (25%)DUNE_CERN_EOS (15%)DUNE_ES_PIC (10%)RAL_ECHO (4%)DUNE_FR_CCIN2P3_DISK (4%)PRAGUE (3%)QMUL (1%)

RSEs used

NameInputsOutputs
NIKHEF2440
RAL-PP1910
DUNE_CERN_EOS1160
DUNE_ES_PIC780
RAL_ECHO370
DUNE_FR_CCIN2P3_DISK350
PRAGUE260
QMUL100

Stats of processed input files as CSV or JSON, and of uploaded output files as CSV or JSON (up to 10000 files included)

File reset events, by site

SiteAllocatedOutputting
NL_NIKHEF42
CERN12
UK_Sheffield10
UK_Imperial11
UK_Brunel10
IT_CNAF13
UK_Liverpool04
UK_Lancaster03
NL_SURFsara01
ES_CIEMAT01
CZ_FZU01
UK_RAL-Tier101

Jobscript

#!/bin/bash
#

source /cvmfs/dune.opensciencegrid.org/products/dune/setup_dune.sh
setup metacat
export METACAT_SERVER_URL=https://metacat.fnal.gov:9443/dune_meta_prod/app
export METACAT_AUTH_SERVER_URL=https://metacat.fnal.gov:8143/auth/dune

if [ -n "${PROTODUNEANA_DIR}" ]; then
  stat ${PROTODUNEANA_DIR}
  if [ $? -ne 0 ]; then
    echo "failed to stat dunesw dir"
  fi

  export PRODUCTS=$PROTODUNEANA_DIR:$PRODUCTS
fi


#Setup recent lar software suite
DUNE_VERSION=${DUNE_VERSION:-v09_90_01d00}
setup dunesw \
   "${DUNE_VERSION}" \
   -q "${DUNE_QUALIFIER:-e26:prof}"

if [ $? -ne 0 ]; then
  echo "Failed to setup dunesw $DUNE_VERSION $DUNE_QUALIFIER"
  exit 1
fi

echo "FCL dump:"
FCL=${FCL:-run_pdhd_hit_checker.fcl}
fhicl-dump $FCL

echo "pduneana loc:"
ups active | grep protoduneana

if [ -z ${JUSTIN_PROCESSORS} ]; then
  JUSTIN_PROCESSORS=1
fi

echo "Justin processors: ${JUSTIN_PROCESSORS}"

export TF_NUM_THREADS=${JUSTIN_PROCESSORS}   
export OPENBLAS_NUM_THREADS=${JUSTIN_PROCESSORS} 
export JULIA_NUM_THREADS=${JUSTIN_PROCESSORS} 
export MKL_NUM_THREADS=${JUSTIN_PROCESSORS} 
export NUMEXPR_NUM_THREADS=${JUSTIN_PROCESSORS} 
export OMP_NUM_THREADS=${JUSTIN_PROCESSORS}  

echo "printing env"
env

echo "Will use justin-get-file"
#
DID_PFN_RSE=`$JUSTIN_PATH/justin-get-file`
##Check that any file was returned
if [ "${DID_PFN_RSE}" == "" ] ; then
  echo "Could not get file"
  exit 0
fi

pfn=`echo ${DID_PFN_RSE} | cut -f2 -d' '`
did=`echo ${DID_PFN_RSE} | cut -f1 -d' '`
echo "pfn: ${pfn}"
echo "did: ${did}"
now=$(date -u +"%Y%m%dT%H%M%SZ")

nevents=${NEVENTS:--1}

outname=`echo $did | cut -f2 -d':' | sed -e "s/.root/_${now}_hit_checker.root/"`
#-c standard_reco_calibration_protodunehd_keepup.fcl \
echo "Running reco"
touch reco.log
starttime=`date +"%s"`.0
lar \
    -c $FCL \
    -n ${nevents} \
    -T $outname \
    ${pfn} #>reco.log 2>&1
larExit=$?
endtime=`date +"%s"`.0

if [ $larExit -ne 0 ]; then
  echo "Error in reco"
  cat reco.log
  exit $larExit
fi
echo "$pfn" > justin-processed-pfns.txt
justIN time: 2024-11-17 03:10:44 UTC       justIN version: 01.01.09