21 July 2025: This instance at RAL is read-only. Please do not try submitting new workflows for now.
Jobsub ID 221469.0@justin-prod-sched02.dune.hep.ac.uk
Jobsub ID | 221469.0@justin-prod-sched02.dune.hep.ac.uk | |
Workflow Testing | Yes | |
Workflow ID | 1 | |
Stage ID | 1 | |
User name | amcnab@fnal.gov | |
HTCondor Group | group_dune.prod_mcsim | |
Requested | Processors | 1 |
GPU | No | |
RSS bytes | 1073741824 (1024 MiB) | |
Wall seconds limit | 3600 (1 hours) | |
Submitted time | 2025-06-14 21:47:07 | |
Site | US_SU-ITS | |
Entry | Glow_US_Syracuse_condor-ce3 | |
Last heartbeat | 2025-06-14 22:48:12 | |
From worker node | Hostname | CRUSH-OSG-C7-10-5-198-173 |
cpuinfo | Intel(R) Xeon(R) CPU E5-2698 v3 @ 2.30GHz | |
OS release | Scientific Linux release 7.9 (Nitrogen) | |
Processors | 1 | |
RSS bytes | 1073741824 (1024 MiB) | |
Wall seconds limit | 171000 (47 hours) | |
GPU | ||
Inner Apptainer? | False | |
Job state | outputting_failed | |
Allocator name | justin-allocator-pro.dune.hep.ac.uk | |
Started | 2025-06-14 21:48:18 | |
Input files | ||
Jobscript | Exit code | 0 |
Real time | 0m (0s) | |
CPU time | 0m (0s = 0%) | |
Max RSS bytes | 0 (0 MiB) | |
Outputting started | 2025-06-14 22:23:38 | |
Output files | ||
Finished | 2025-06-14 22:48:12 | |
List job events Wrapper job log |
Jobscript log (last 10,000 characters)
------------ US_SU-ITS SURFSARA davs root://otter12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt 'xrdcp --force --nopbar --verbose root://otter12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt downloaded.txt' returns 0 { "created_timestamp": null, "creator": "dunepro", "fid": "hL4uU50aTTSBo1qM", "metadata": {}, "name": "awt-1749937707-pjmrDDU8zS", "namespace": "testpro", "retired": false, "retired_by": null, "retired_timestamp": null, "size": 0, "updated_by": null, "updated_timestamp": null } metacat file declare returns 0 GFAL_CONFIG_DIR: GFAL_PLUGIN_DIR: justin-rucio-upload attempt 1 DEBUG:root:Num. of files that upload client is processing: 1 DEBUG:dogpile.cache.region:No value present for key: "host_to_choose_choice['https://dune-rucio.fnal.gov']" DEBUG:dogpile.lock:NeedRegenerationException DEBUG:dogpile.lock:no value, waiting for create lock DEBUG:dogpile.lock:value creation lock <dogpile.cache.region.CacheRegion._LockWrapper object at 0x2ab8b7edc940> acquired DEBUG:dogpile.cache.region:No value present for key: "host_to_choose_choice['https://dune-rucio.fnal.gov']" DEBUG:dogpile.lock:Calling creation function for not-yet-present value DEBUG:dogpile.cache.region:Cache value generated in 0.000 seconds for key(s): "host_to_choose_choice['https://dune-rucio.fnal.gov']" DEBUG:dogpile.lock:Released creation lock DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): dune-rucio.fnal.gov:443 DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "GET /rses/?expression=SURFSARA HTTP/1.1" 200 None DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): dune-rucio.fnal.gov:443 DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "GET /rses/SURFSARA HTTP/1.1" 503 299 [33;1m2025-06-14 22:21:19,303 WARNING Waiting 0.25s due to reason: server returned 503 [0m WARNING:baseclient:Waiting 0.25s due to reason: server returned 503 DEBUG:urllib3.connectionpool:Starting new HTTPS connection (2): dune-rucio.fnal.gov:443 DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "GET /rses/SURFSARA HTTP/1.1" 504 247 [33;1m2025-06-14 22:22:26,163 WARNING Waiting 0.5s due to reason: server returned 504 [0m WARNING:baseclient:Waiting 0.5s due to reason: server returned 504 DEBUG:urllib3.connectionpool:Starting new HTTPS connection (3): dune-rucio.fnal.gov:443 DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "GET /rses/SURFSARA HTTP/1.1" 503 299 [33;1m2025-06-14 22:22:44,863 WARNING Waiting 1.0s due to reason: server returned 503 [0m WARNING:baseclient:Waiting 1.0s due to reason: server returned 503 --- Upload try 1/1 --- Rucio upload 1/1 fails: An unknown exception occurred. Details: no error information passed (http status code: 503) --- Exit with 99 'justin-rucio-upload --rse SURFSARA --protocol davs --scope testpro --dataset awt-uploads-202523 awt-1749937707-pjmrDDU8zS --timeout 1200' returns 99 --------------------------------------------------------------------- US_SU-ITS T3_US_NERSC davs root://dtn14.nersc.gov:1094//global/cfs/cdirs/m3249/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt 'xrdcp --force --nopbar --verbose root://dtn14.nersc.gov:1094//global/cfs/cdirs/m3249/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt downloaded.txt' returns 0 { "created_timestamp": null, "creator": "dunepro", "fid": "xm8qS5UmROeMjq8y", "metadata": {}, "name": "awt-1749937707-lkBRZ1ALV8", "namespace": "testpro", "retired": false, "retired_by": null, "retired_timestamp": null, "size": 0, "updated_by": null, "updated_timestamp": null } metacat file declare returns 0 GFAL_CONFIG_DIR: GFAL_PLUGIN_DIR: justin-rucio-upload attempt 1 DEBUG:root:Num. of files that upload client is processing: 1 DEBUG:dogpile.cache.region:No value present for key: "host_to_choose_choice['https://dune-rucio.fnal.gov']" DEBUG:dogpile.lock:NeedRegenerationException DEBUG:dogpile.lock:no value, waiting for create lock DEBUG:dogpile.lock:value creation lock <dogpile.cache.region.CacheRegion._LockWrapper object at 0x2b85aa957940> acquired DEBUG:dogpile.cache.region:No value present for key: "host_to_choose_choice['https://dune-rucio.fnal.gov']" DEBUG:dogpile.lock:Calling creation function for not-yet-present value DEBUG:dogpile.cache.region:Cache value generated in 0.000 seconds for key(s): "host_to_choose_choice['https://dune-rucio.fnal.gov']" DEBUG:dogpile.lock:Released creation lock DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): dune-rucio.fnal.gov:443 DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "GET /rses/?expression=T3_US_NERSC HTTP/1.1" 503 299 [33;1m2025-06-14 22:23:05,189 WARNING Waiting 0.25s due to reason: server returned 503 [0m WARNING:baseclient:Waiting 0.25s due to reason: server returned 503 DEBUG:urllib3.connectionpool:Starting new HTTPS connection (2): dune-rucio.fnal.gov:443 DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "GET /rses/?expression=T3_US_NERSC HTTP/1.1" 503 299 [33;1m2025-06-14 22:23:20,979 WARNING Waiting 0.5s due to reason: server returned 503 [0m WARNING:baseclient:Waiting 0.5s due to reason: server returned 503 DEBUG:urllib3.connectionpool:Starting new HTTPS connection (3): dune-rucio.fnal.gov:443 DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "GET /rses/?expression=T3_US_NERSC HTTP/1.1" 503 299 [33;1m2025-06-14 22:23:36,761 WARNING Waiting 1.0s due to reason: server returned 503 [0m WARNING:baseclient:Waiting 1.0s due to reason: server returned 503 --- Upload try 1/1 --- Rucio upload 1/1 fails: An unknown exception occurred. Details: no error information passed (http status code: 503) --- Exit with 99 'justin-rucio-upload --rse T3_US_NERSC --protocol davs --scope testpro --dataset awt-uploads-202523 awt-1749937707-lkBRZ1ALV8 --timeout 1200' returns 99 subject : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk/CN=1595776544/CN=174993769883 issuer : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk/CN=1595776544 identity : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk/CN=1595776544 type : RFC compliant proxy strength : 2048 bits path : /srv/home/awt-proxy.pem timeleft : 167:24:40 key usage : Digital Signature, Key Encipherment, Key Agreement === VO dune extension information === VO : dune subject : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk issuer : /DC=org/DC=incommon/C=US/ST=Illinois/O=Fermi Research Alliance/CN=voms2.fnal.gov attribute : /dune/Role=Production/Capability=NULL attribute : /dune/Role=NULL/Capability=NULL timeleft : 149:25:24 uri : voms2.fnal.gov:15042 ===== Results ===== Download/upload commands: xrdcp --force --nopbar --verbose $read_pfn downloaded.txt echo '{"namespace":"testpro","name":"FILENAME","size":0}' >tmp.json metacat file declare --json -f tmp.json "dune:all" justin-rucio-upload --rse $rse_name --protocol $write_protocol --scope testpro --dataset awt-uploads-202523 --timeout 1200 FILENAME Use the wrapper job link on the page for the job on the justIN Dashboard to find the full log file, with errors from these commands Each line: $JUSTIN_SITE_NAME $rse_name $download_retval $upload_retval $read_pfn $write_protocol ==awt== US_SU-ITS DUNE_CA_SFU 0 0 root://lcg-dunese1.sfu.computecanada.ca:1094//dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== US_SU-ITS DUNE_CERN_EOS 0 0 root://eospublic.cern.ch:1094//eos/experiment/neutplatform/protodune/dune/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== US_SU-ITS DUNE_ES_PIC 0 99 root://xrootd.pic.es:1094/pnfs/pic.es/data/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== US_SU-ITS DUNE_FR_CCIN2P3_DISK 0 99 root://ccxrootdegee.in2p3.fr:1094/pnfs/in2p3.fr/data/dune/disk/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== US_SU-ITS DUNE_IT_INFN_CNAF 51 99 root://xrootd-archive.cr.cnaf.infn.it:1096//dune/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== US_SU-ITS DUNE_UK_GLASGOW 0 99 root://cephc02.gla.scotgrid.ac.uk:1094//cephfs/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== US_SU-ITS DUNE_UK_LANCASTER_CEPH 0 99 root://xgate.hec.lancs.ac.uk:1094//cephfs/grid/dune/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== US_SU-ITS DUNE_UK_MANCHESTER_CEPH 0 99 root://meitner.tier2.hep.manchester.ac.uk:1094//cephfs/experiments/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== US_SU-ITS DUNE_US_BNL_SDCC 0 99 root://dcdndoor.sdcc.bnl.gov:1094//pnfs/sdcc.bnl.gov/data/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== US_SU-ITS DUNE_US_FNAL_DISK_STAGE 0 99 root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== US_SU-ITS NIKHEF 0 99 root://dune.dcache.nikhef.nl:1094/pnfs/nikhef.nl/data/dune/generic/rucio/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== US_SU-ITS PRAGUE 0 98 root://golias100.farm.particle.cz:1094/dpm/farm.particle.cz/home/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== US_SU-ITS QMUL 0 99 root://xrootd1.esc.qmul.ac.uk:1094//dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== US_SU-ITS RAL-PP 0 99 root://mover.pp.rl.ac.uk:1094/pnfs/pp.rl.ac.uk/data/dune/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== US_SU-ITS RAL_ECHO 0 99 root://xrootd.echo.stfc.ac.uk:1094/dune:/protodune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== US_SU-ITS SURFSARA 0 99 root://otter12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== US_SU-ITS T3_US_NERSC 0 99 root://dtn14.nersc.gov:1094//global/cfs/cdirs/m3249/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs