Workflow 7111, Stage 1
Priority | 50 |
Processors | 1 |
Wall seconds | 80000 |
Image | /cvmfs/singularity.opensciencegrid.org/fermilab/fnal-wn-sl7:latest |
RSS bytes | 4194304000 (4000 MiB) |
Max distance for inputs | 0.0 |
Enabled input RSEs |
CERN_PDUNE_EOS, DUNE_CA_SFU, DUNE_CERN_EOS, DUNE_ES_PIC, DUNE_FR_CCIN2P3_DISK, DUNE_IN_TIFR, DUNE_IT_INFN_CNAF, DUNE_UK_GLASGOW, DUNE_UK_LANCASTER_CEPH, DUNE_UK_MANCHESTER_CEPH, DUNE_US_BNL_SDCC, DUNE_US_FNAL_DISK_STAGE, FNAL_DCACHE, FNAL_DCACHE_STAGING, FNAL_DCACHE_TEST, MONTECARLO, NIKHEF, PRAGUE, QMUL, RAL-PP, RAL_ECHO, SURFSARA, T3_US_NERSC |
Enabled output RSEs |
CERN_PDUNE_EOS, DUNE_CA_SFU, DUNE_CERN_EOS, DUNE_ES_PIC, DUNE_FR_CCIN2P3_DISK, DUNE_IN_TIFR, DUNE_IT_INFN_CNAF, DUNE_UK_GLASGOW, DUNE_UK_LANCASTER_CEPH, DUNE_UK_MANCHESTER_CEPH, DUNE_US_BNL_SDCC, DUNE_US_FNAL_DISK_STAGE, FNAL_DCACHE, FNAL_DCACHE_STAGING, FNAL_DCACHE_TEST, NIKHEF, PRAGUE, QMUL, RAL-PP, RAL_ECHO, SURFSARA, T3_US_NERSC |
Enabled sites |
BR_CBPF, CA_SFU, CA_Victoria, CERN, CH_UNIBE-LHEP, ES_CIEMAT, ES_PIC, FR_CCIN2P3, IN_TIFR, IT_CNAF, NL_SURFsara, UK_Bristol, UK_Brunel, UK_Durham, UK_Edinburgh, UK_Lancaster, UK_Manchester, UK_Oxford, UK_QMUL, UK_RAL-PPD, UK_RAL-Tier1, UK_Sheffield, US_Caltech, US_Colorado, US_FNAL-FermiGrid, US_FNAL-T1, US_Michigan, US_MIT, US_Nebraska, US_NotreDame, US_PuertoRico, US_SU-ITS, US_Swan, US_UChicago, US_UConn-HPC, US_UCSD, US_Wisconsin |
Scope | usertests |
Events for this stage |
Output patterns
| Destination | Pattern | Lifetime | For next stage | RSE expression |
---|
1 | https://fndcadoor.fnal.gov:2880/dune/scratch/users/ichong/07111/1 | *.root | | | |
Environment variables
Name | Value |
---|
DUNE_QUALIFIER | e26:prof |
DUNE_VERSION | v10_04_06d00 |
FCL_FILE | /cvmfs/fifeuser4.opensciencegrid.org/sw/dune/91a556f0a297ef798b177e3897f3d0e81a7988f7/anatree_new_prod.fcl |
NUM_EVENTS | 200 |
File states
Total files | Finding | Unallocated | Allocated | Outputting | Processed | Not found | Failed |
---|
|
3000 | 0 | 1948 | 0 | 0 | 1052 | 0 | 0 |
Job states
Total | Submitted | Started | Processing | Outputting | Finished | Notused | Aborted | Stalled | Jobscript error | Outputting failed | None processed |
---|
5568 | 0 | 0 | 0 | 0 | 5556 | 0 | 0 | 12 | 0 | 0 | 0 |
RSEs used
Name | Inputs | Outputs |
---|
RAL_ECHO | 347 | 0 |
QMUL | 306 | 0 |
RAL-PP | 265 | 0 |
DUNE_FR_CCIN2P3_DISK | 94 | 0 |
DUNE_ES_PIC | 52 | 0 |
None | 0 | 1052 |
Stats of processed input files as CSV or JSON, and of uploaded output files as CSV or JSON (up to 10000 files included)
File reset events, by site
Site | Allocated | Outputting |
---|
UK_RAL-Tier1 | 10 | 0 |
ES_PIC | 2 | 0 |
Jobscript
#!/bin/bash
:<<'EOF'
To use this jobscript to process 5 files from the dataset fardet-hd__fd_mc_2023a_reco2__full-reconstructed__v09_81_00d02__standard_reco2_dune10kt_nu_1x2x6__prodgenie_nu_dune10kt_1x2x6__out1__validation
data and put the output in the $USER namespace (MetaCat) and saves the output in /scratch
Use this command to create the workflow:
justin simple-workflow \
--mql \
"files from fardet-hd:fardet-hd__fd_mc_2023a_reco2__full-reconstructed__v09_81_00d02__standard_reco2_dune10kt_nu_1x2x6__prodgenie_nu_dune10kt_1x2x6__out1__validation limit 5 ordered" \
--jobscript submit_ana.jobscript --rss-mb 4000 \
--output-pattern '*_ana_*.root:$FNALURL/$USERF'
EOF
# === Config ===
FCL_FILE=${FCL_FILE:-standard_ana_dune10kt_1x2x6.fcl}
DUNE_VERSION=${DUNE_VERSION:-v09_81_00d02}
DUNE_QUALIFIER=${DUNE_QUALIFIER:-e26:prof}
# === Number of events
if [ "$NUM_EVENTS" != "" ]; then
events_option="-n $NUM_EVENTS"
fi
# === Get input PFN
did_pfn_rse=$($JUSTIN_PATH/justin-get-file)
if [ "$did_pfn_rse" = "" ]; then
echo "Nothing to process - exit jobscript"
exit 0
fi
echo "$did_pfn_rse" | cut -f1 -d' ' >> all-input-dids.txt
pfn=$(echo "$did_pfn_rse" | cut -f2 -d' ')
echo "Input PFN = $pfn"
# === Environment
source /cvmfs/dune.opensciencegrid.org/products/dune/setup_dune.sh
setup dunesw "$DUNE_VERSION" -q "$DUNE_QUALIFIER"
now=$(date -u +"%Y-%m-%dT_%H%M%SZ")
Ffname=$(basename "$pfn")
fname="${Ffname%%.*}"
logFile="${fname}_ana_${now}.log"
campaign="justIN.w${JUSTIN_WORKFLOW_ID}s${JUSTIN_STAGE_ID}"
# === Run lar without -o (uses default output: out.root)
(
export LD_PRELOAD=${XROOTD_LIB}/libXrdPosixPreload.so
echo "$LD_PRELOAD"
lar -c "$FCL_FILE" $events_option "$pfn" > "$logFile" 2>&1
)
# === Rename out.root if job succeeded
larExit=$?
echo "lar exit code $larExit"
if [ $larExit -eq 0 ]; then
rand=$(od -An -N2 -i /dev/urandom | tr -d ' ')
new_outFile="${fname}_ana_${now}_r${rand}.root"
mv ana_tree_hd.root "$new_outFile"
echo "$pfn" > justin-processed-pfns.txt
jobscriptExit=0
else
jobscriptExit=1
fi
echo '=== Start last 100 lines of lar log file ==='
tail -100 "$logFile"
echo '=== End last 100 lines of lar log file ==='
# === Archive logs
tar zcf "$(echo "$JUSTIN_JOBSUB_ID.logs.tgz" | sed 's/@/_/g')" *.log
exit $jobscriptExit