justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Workflow 7090, Stage 1

Priority50
Processors1
Wall seconds80000
Image/cvmfs/singularity.opensciencegrid.org/fermilab/fnal-wn-sl7:latest
RSS bytes4194304000 (4000 MiB)
Max distance for inputs30.0
Enabled input RSEs CERN_PDUNE_EOS, DUNE_CA_SFU, DUNE_CERN_EOS, DUNE_ES_PIC, DUNE_FR_CCIN2P3_DISK, DUNE_IN_TIFR, DUNE_IT_INFN_CNAF, DUNE_UK_GLASGOW, DUNE_UK_LANCASTER_CEPH, DUNE_UK_MANCHESTER_CEPH, DUNE_US_BNL_SDCC, DUNE_US_FNAL_DISK_STAGE, FNAL_DCACHE, FNAL_DCACHE_STAGING, FNAL_DCACHE_TEST, MONTECARLO, NIKHEF, PRAGUE, QMUL, RAL-PP, RAL_ECHO, SURFSARA, T3_US_NERSC
Enabled output RSEs CERN_PDUNE_EOS, DUNE_CA_SFU, DUNE_CERN_EOS, DUNE_ES_PIC, DUNE_FR_CCIN2P3_DISK, DUNE_IN_TIFR, DUNE_IT_INFN_CNAF, DUNE_UK_GLASGOW, DUNE_UK_LANCASTER_CEPH, DUNE_UK_MANCHESTER_CEPH, DUNE_US_BNL_SDCC, DUNE_US_FNAL_DISK_STAGE, FNAL_DCACHE, FNAL_DCACHE_STAGING, FNAL_DCACHE_TEST, NIKHEF, PRAGUE, QMUL, RAL-PP, RAL_ECHO, SURFSARA, T3_US_NERSC
Enabled sites BR_CBPF, CA_SFU, CA_Victoria, CERN, CH_UNIBE-LHEP, ES_CIEMAT, ES_PIC, FR_CCIN2P3, IN_TIFR, IT_CNAF, NL_SURFsara, UK_Bristol, UK_Brunel, UK_Durham, UK_Edinburgh, UK_Lancaster, UK_Manchester, UK_Oxford, UK_QMUL, UK_RAL-PPD, UK_RAL-Tier1, UK_Sheffield, US_Caltech, US_Colorado, US_FNAL-FermiGrid, US_FNAL-T1, US_Michigan, US_MIT, US_Nebraska, US_NotreDame, US_PuertoRico, US_SU-ITS, US_Swan, US_UChicago, US_UConn-HPC, US_UCSD, US_Wisconsin
Scopeusertests
Events for this stage

Output patterns

 DestinationPatternLifetimeFor next stageRSE expression
1https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/07090/1graph_output*

Environment variables

NameValue
CODE_TAR_DIR_LOCAL/cvmfs/fifeuser4.opensciencegrid.org/sw/dune/c976ce192292f2f9835b2b96cb18eba74fad6274
DUNE_QUALIFIERe26:prof
DUNE_VERSIONv10_04_06d00
FCL_FILE/cvmfs/fifeuser1.opensciencegrid.org/sw/dune/6d610a8273859e200589d2f4ba622fbb5cc70b37/atm-training-extract.fcl
FCL_SECONDARY/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/47c256ce8bbe924516790667ff895b1f82b9ef4b/eid.fcl
NUM_EVENTS200
XML_MASTER/cvmfs/fifeuser1.opensciencegrid.org/sw/dune/6d610a8273859e200589d2f4ba622fbb5cc70b37/PandoraSettings_Master_Training_CSV.xml
XML_NEUTRINO/cvmfs/fifeuser1.opensciencegrid.org/sw/dune/6d610a8273859e200589d2f4ba622fbb5cc70b37/PandoraSettings_Neutrino_Training_CSV.xml

File states

Total filesFindingUnallocatedAllocatedOutputtingProcessedNot foundFailed
300000002977023

Job states

TotalSubmittedStartedProcessingOutputtingFinishedNotusedAbortedStalledJobscript errorOutputting failedNone processed
3766000035820331313800
Files processed00200200400400600600800800100010001200120014001400160016001800180020002000May-21 16:00May-21 17:00May-21 18:00May-21 19:00May-21 20:00May-21 21:00Files processedBin start timesNumber per binUK_RAL-PPDCERNFR_CCIN2P3ES_PICUK_OxfordUK_RAL-Tier1UK_QMULUK_ManchesterUK_BristolUK_Durham
Replicas per RSE3000479.0594453312355198.288429176694483000326.51454419282413324.70499444406926337280.19107705607263200.44007001239174332292.94346871104005174.76107394681816311311.2990079057914154.19859847551865306333.3571295638751139.5937892007159265356.4278619242881131.372302439645694371.6686152956152129.0301089411466450377.8506385170879128.72193844292264Replicas per RSEFNAL_DCACHE (38%)PRAGUE (38%)RAL_ECHO (4%)NIKHEF (4%)SURFSARA (4%)QMUL (3%)RAL-PP (3%)DUNE_FR_CCIN2P3_DISK (1%)DUNE_ES_PIC (0%)

RSEs used

NameInputsOutputs
PRAGUE15200
RAL_ECHO3340
QMUL3050
NIKHEF3030
SURFSARA2900
RAL-PP2690
DUNE_FR_CCIN2P3_DISK890
DUNE_ES_PIC500
None020839

Stats of processed input files as CSV or JSON, and of uploaded output files as CSV or JSON (up to 10000 files included)

File reset events, by site

SiteAllocatedOutputting
CERN200
UK_Manchester110
UK_RAL-PPD50
ES_PIC50
UK_QMUL20
UK_Oxford10
UK_RAL-Tier110

Jobscript

#!/bin/bash
:<<'EOF'
This jobscript generates CaloHitList-based graph data 
from input reco2 ROOT files using your custom LArSoft setup.

Required environment variables:
  - FCL_FILE
  - CODE_TAR_DIR_LOCAL
  - DUNE_VERSION
  - DUNE_QUALIFIER
  - XML_MASTER
  - XML_NEUTRINO
  - NUM_EVENTS (optional)
EOF

# === Setup FCL and version info ===
FCL_FILE=${FCL_FILE:-atm-training-extract.fcl}
DUNE_VERSION=${DUNE_VERSION:-v10_04_06d00}
DUNE_QUALIFIER=${DUNE_QUALIFIER:-e26:prof}

# === Number of events option ===
if [ -n "$NUM_EVENTS" ]; then
  events_option="-n $NUM_EVENTS"
fi

# === Get a file from justIN ===
did_pfn_rse=$($JUSTIN_PATH/justin-get-file)
if [ -z "$did_pfn_rse" ]; then
  echo "No file assigned. Exiting jobscript."
  exit 0
fi

# === Track input DID for MetaCat ===
echo "$did_pfn_rse" | cut -f1 -d' ' >> all-input-dids.txt

# === Parse PFN from DID ===
pfn=$(echo "$did_pfn_rse" | cut -d' ' -f2)
echo "Input PFN = $pfn"

# === Setup DUNE software ===
source /cvmfs/dune.opensciencegrid.org/products/dune/setup_dune.sh
setup dunesw "$DUNE_VERSION" -q "$DUNE_QUALIFIER"

# === Mirror CODE_TAR_DIR_LOCAL ===
INPUT_TAR_DIR_LOCAL="$CODE_TAR_DIR_LOCAL"
echo "INPUT_TAR_DIR_LOCAL = $INPUT_TAR_DIR_LOCAL"

# === Setup custom code ===
if [ -n "$CODE_TAR_DIR_LOCAL" ]; then
  echo "Using local products from $CODE_TAR_DIR_LOCAL"
  source "$CODE_TAR_DIR_LOCAL/larsoft_graph_V1_2025/localProducts_larsoft_v10_04_06_e26_prof/setup-grid"
  mrbslp
fi

# === Generate common timestamp and random suffix for output renaming ===
timestamp=$(date -u +"%Y-%m-%dT_%H%M%SZ")
rand_suffix=$((1 + RANDOM % 10))

# === Output file naming ===
fname=$(basename "$pfn" .root)
outFile="${fname}_graph_${timestamp}.root"
logFile="${fname}_graph_${timestamp}.log"

# === Set FW search path ===
XML_DIR_MASTER=$(dirname "$XML_MASTER")
XML_DIR_NEUTRINO=$(dirname "$XML_NEUTRINO")
export FW_SEARCH_PATH="$XML_DIR_MASTER:$XML_DIR_NEUTRINO:$FW_SEARCH_PATH"

# === Run lar (primary) ===
export LD_PRELOAD=${XROOTD_LIB}/libXrdPosixPreload.so
echo "Running LArSoft with FCL: $FCL_FILE"
lar -c "$FCL_FILE" $events_option -o "$outFile" "$pfn" > "$logFile" 2>&1
larExit=$?

# === Run lar (secondary) if needed ===
if [ -n "$FCL_SECONDARY" ]; then
  secondary_out="secondary_${outFile}"
  secondary_log="secondary_${logFile}"
  echo "Running LArSoft with secondary FCL: $FCL_SECONDARY"
  lar -c "$FCL_SECONDARY" $events_option "$pfn" > "$secondary_log" 2>&1
fi

if [ -f "$secondary_log" ]; then
  echo '=== Start last 100 lines of secondary lar log file ==='
  tail -100 "$secondary_log"
  echo '=== End last 100 lines of secondary lar log file ==='
fi





# === Rename .data and .root files with timestamp and suffix ===
if [ $larExit -eq 0 ]; then
  for f in *.data; do
    if [ -f "$f" ]; then
      newname="graph_output_${timestamp}_${rand_suffix}_$f"
      mv -f "$f" "$newname"
      echo "Renamed $f -> $newname"
    fi
  done

  for f in *eid.root; do
    if [ -f "$f" ]; then
      newname="graph_output_${timestamp}_${rand_suffix}_$f"
      mv -f "$f" "$newname"
      echo "Renamed $f -> $newname"
    fi
  done
fi

# === Show lar log tail ===
echo '=== Start last 100 lines of lar log file ==='
tail -100 "$logFile"
echo '=== End last 100 lines of lar log file ==='

# === Mark processed ===
if [ $larExit -eq 0 ]; then
  echo "$pfn" > justin-processed-pfns.txt
  jobscriptExit=0
else
  jobscriptExit=1
fi

# === Package logs ===
tar zcf "${JUSTIN_JOBSUB_ID//[@]/_}.logs.tgz" *.log

# === Display output summary ===
echo "=== Generated output files ==="
ls -1 *.* 2>/dev/null | grep -v 'all-input-dids.txt' || echo "No output files found."


exit $jobscriptExit
justIN time: 2025-05-22 14:16:16 UTC       justIN version: 01.03.01