21 July 2025: This instance at RAL is read-only. Please do not try submitting new workflows for now.
Jobsub ID 220567.0@justin-prod-sched02.dune.hep.ac.uk
Jobsub ID | 220567.0@justin-prod-sched02.dune.hep.ac.uk | |
Workflow Testing | Yes | |
Workflow ID | 1 | |
Stage ID | 1 | |
User name | amcnab@fnal.gov | |
HTCondor Group | group_dune.prod_mcsim | |
Requested | Processors | 1 |
GPU | No | |
RSS bytes | 1073741824 (1024 MiB) | |
Wall seconds limit | 3600 (1 hours) | |
Submitted time | 2025-06-14 03:35:29 | |
Site | CA_SFU | |
Entry | DUNE_CA_SFU_lcg-ce3 | |
Last heartbeat | 2025-06-14 05:20:08 | |
From worker node | Hostname | cdr2245.int.cedar.computecanada.ca |
cpuinfo | Intel(R) Xeon(R) Platinum 8260 CPU @ 2.40GHz | |
OS release | Scientific Linux release 7.9 (Nitrogen) | |
Processors | 1 | |
RSS bytes | 1073741824 (1024 MiB) | |
Wall seconds limit | 84598 (23 hours) | |
GPU | ||
Inner Apptainer? | True | |
Job state | stalled | |
Allocator name | justin-allocator-pro.dune.hep.ac.uk | |
Started | 2025-06-14 04:25:45 | |
Input files | ||
Outputting started | 2025-06-14 05:20:08 | |
Output files | ||
Finished | 2025-06-14 06:10:08 | |
List job events Wrapper job log |
Jobscript log (last 10,000 characters)
al.gov:443 DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "GET /rses/T3_US_NERSC/attr/ HTTP/1.1" 200 139 DEBUG:root:wan domain is used for the upload DEBUG:root:Registering file DEBUG:dogpile.lock:value creation lock <dogpile.cache.region.CacheRegion._LockWrapper object at 0x153855980190> acquired DEBUG:dogpile.lock:Calling creation function for previously expired value DEBUG:dogpile.cache.region:Cache value generated in 0.000 seconds for key(s): "host_to_choose_choice['https://dune-rucio.fnal.gov']" DEBUG:dogpile.lock:Released creation lock DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "GET /accounts/dunepro/scopes/ HTTP/1.1" 503 299 [33;1m2025-06-13 22:18:41,216 WARNING Waiting 0.25s due to reason: server returned 503 [0m WARNING:baseclient:Waiting 0.25s due to reason: server returned 503 DEBUG:urllib3.connectionpool:Starting new HTTPS connection (5): dune-rucio.fnal.gov:443 DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "GET /accounts/dunepro/scopes/ HTTP/1.1" 200 799 DEBUG:root:Trying to create dataset: testpro:awt-uploads-202523 DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "POST /dids/testpro/awt-uploads-202523 HTTP/1.1" 409 104 INFO:root:Dataset testpro:awt-uploads-202523 already exists - no rule will be created DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "GET /dids/testpro/awt-1749875149-8d6UgWG6gr/meta?plugin=DID_COLUMN HTTP/1.1" 404 129 DEBUG:root:File DID does not exist DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "POST /replicas HTTP/1.1" 201 7 INFO:root:Successfully added replica in Rucio catalogue at T3_US_NERSC DEBUG:root:gfal.NoRename: connecting to storage DEBUG:root:Checking if davs://dtn14.nersc.gov:1094/global/cfs/cdirs/m3249/dune/RSE/testpro/19/69/awt-1749875149-8d6UgWG6gr exists DEBUG:root:gfal.NoRename: checking if file exists davs://dtn14.nersc.gov:1094/global/cfs/cdirs/m3249/dune/RSE/testpro/19/69/awt-1749875149-8d6UgWG6gr DEBUG:root:gfal.NoRename: closing protocol connection DEBUG:root:[{'hostname': 'dtn14.nersc.gov', 'scheme': 'root', 'port': 1094, 'prefix': '//global/cfs/cdirs/m3249/dune/RSE', 'impl': 'rucio.rse.protocols.gfal.NoRename', 'domains': {'lan': {'read': 10, 'write': 10, 'delete': 10}, 'wan': {'read': 10, 'write': 10, 'delete': 10, 'third_party_copy_read': 0, 'third_party_copy_write': 0}}, 'extended_attributes': None}, {'hostname': 'dtn14.nersc.gov', 'scheme': 'davs', 'port': 1094, 'prefix': '/global/cfs/cdirs/m3249/dune/RSE', 'impl': 'rucio.rse.protocols.gfal.NoRename', 'domains': {'lan': {'read': 1, 'write': 1, 'delete': 1}, 'wan': {'read': 1, 'write': 1, 'delete': 1, 'third_party_copy_read': 1, 'third_party_copy_write': 1}}, 'extended_attributes': None}] INFO:root:Trying upload with davs to T3_US_NERSC DEBUG:root:Processing upload with the domain: wan DEBUG:root:gfal.NoRename: connecting to storage DEBUG:root:The PFN created from the LFN: davs://dtn14.nersc.gov:1094/global/cfs/cdirs/m3249/dune/RSE/testpro/19/69/awt-1749875149-8d6UgWG6gr DEBUG:root:gfal.NoRename: checking if file exists davs://dtn14.nersc.gov:1094/global/cfs/cdirs/m3249/dune/RSE/testpro/19/69/awt-1749875149-8d6UgWG6gr DEBUG:root:gfal.NoRename: checking if file exists davs://dtn14.nersc.gov:1094/global/cfs/cdirs/m3249/dune/RSE/testpro/19/69/awt-1749875149-8d6UgWG6gr DEBUG:root:put: Attempt 1 DEBUG:root:gfal.NoRename: uploading file from awt-1749875149-8d6UgWG6gr to davs://dtn14.nersc.gov:1094/global/cfs/cdirs/m3249/dune/RSE/testpro/19/69/awt-1749875149-8d6UgWG6gr INFO:root:Successful upload of temporary file. davs://dtn14.nersc.gov:1094/global/cfs/cdirs/m3249/dune/RSE/testpro/19/69/awt-1749875149-8d6UgWG6gr DEBUG:root:skip_upload_stat=False DEBUG:root:stat: pfn=davs://dtn14.nersc.gov:1094/global/cfs/cdirs/m3249/dune/RSE/testpro/19/69/awt-1749875149-8d6UgWG6gr DEBUG:root:gfal.NoRename: getting stats of file davs://dtn14.nersc.gov:1094/global/cfs/cdirs/m3249/dune/RSE/testpro/19/69/awt-1749875149-8d6UgWG6gr DEBUG:root:Filesize: Expected=26 Found=26 DEBUG:root:Checksum: Expected=5f380703 Found=5f380703 DEBUG:root:gfal.NoRename: closing protocol connection DEBUG:root:Upload done. INFO:root:Successfully uploaded file awt-1749875149-8d6UgWG6gr DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): dune-rucio.fnal.gov:443 /cvmfs/dune.opensciencegrid.org/products/dune/rucio/v37_1_0_post1/NULL/lib/python3.9/site-packages/urllib3/connectionpool.py:1061: InsecureRequestWarning: Unverified HTTPS request is being made to host 'dune-rucio.fnal.gov'. Adding certificate verification is strongly advised. See: https://urllib3.readthedocs.io/en/1.26.x/advanced-usage.html#ssl-warnings warnings.warn( DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "POST /traces/ HTTP/1.1" 404 207 DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "PUT /replicas HTTP/1.1" 200 0 DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "POST /dids/testpro/awt-uploads-202523/dids HTTP/1.1" 201 7 DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): dune-rucio.fnal.gov:443 DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "POST /replicas/list HTTP/1.1" 503 299 [33;1m2025-06-13 22:19:02,728 WARNING Waiting 0.25s due to reason: server returned 503 [0m WARNING:baseclient:Waiting 0.25s due to reason: server returned 503 DEBUG:urllib3.connectionpool:Starting new HTTPS connection (2): dune-rucio.fnal.gov:443 DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "POST /replicas/list HTTP/1.1" 200 None DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): dune-rucio.fnal.gov:443 DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "GET /dids/testpro/awt-uploads-202523/files HTTP/1.1" 200 None --- Upload try 1/1 --- Rucio upload 1/1 returns 0 --- Replica check try 1/1 --- Dataset awt-uploads-202523 check try 1/1 --- Upload, replicas, and datasets checks passed 'justin-rucio-upload --rse T3_US_NERSC --protocol davs --scope testpro --dataset awt-uploads-202523 awt-1749875149-8d6UgWG6gr --timeout 1200' returns 0 subject : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk/CN=1595776544/CN=174987514562 issuer : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk/CN=1595776544 identity : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk/CN=1595776544 type : RFC compliant proxy strength : 2048 bits path : /home/awt-proxy.pem timeleft : 167:05:38 key usage : Digital Signature, Key Encipherment, Key Agreement === VO dune extension information === VO : dune subject : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk issuer : /DC=org/DC=incommon/C=US/ST=Illinois/O=Fermi Research Alliance/CN=voms2.fnal.gov attribute : /dune/Role=Production/Capability=NULL attribute : /dune/Role=NULL/Capability=NULL timeleft : 166:28:55 uri : voms2.fnal.gov:15042 ===== Results ===== Download/upload commands: xrdcp --force --nopbar --verbose $read_pfn downloaded.txt echo '{"namespace":"testpro","name":"FILENAME","size":0}' >tmp.json metacat file declare --json -f tmp.json "dune:all" justin-rucio-upload --rse $rse_name --protocol $write_protocol --scope testpro --dataset awt-uploads-202523 --timeout 1200 FILENAME Use the wrapper job link on the page for the job on the justIN Dashboard to find the full log file, with errors from these commands Each line: $JUSTIN_SITE_NAME $rse_name $download_retval $upload_retval $read_pfn $write_protocol ==awt== CA_SFU DUNE_CA_SFU 0 0 root://lcg-dunese1.sfu.computecanada.ca:1094//dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== CA_SFU DUNE_CERN_EOS 0 0 root://eospublic.cern.ch:1094//eos/experiment/neutplatform/protodune/dune/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== CA_SFU DUNE_ES_PIC 0 0 root://xrootd.pic.es:1094/pnfs/pic.es/data/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== CA_SFU DUNE_FR_CCIN2P3_DISK 0 0 root://ccxrootdegee.in2p3.fr:1094/pnfs/in2p3.fr/data/dune/disk/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== CA_SFU DUNE_IT_INFN_CNAF 51 99 root://xrootd-archive.cr.cnaf.infn.it:1096//dune/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== CA_SFU DUNE_UK_GLASGOW 0 0 root://cephc02.gla.scotgrid.ac.uk:1094//cephfs/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== CA_SFU DUNE_UK_LANCASTER_CEPH 0 0 root://xgate.hec.lancs.ac.uk:1094//cephfs/grid/dune/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== CA_SFU DUNE_UK_MANCHESTER_CEPH 0 0 root://meitner.tier2.hep.manchester.ac.uk:1094//cephfs/experiments/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== CA_SFU DUNE_US_BNL_SDCC 0 0 root://dcdndoor.sdcc.bnl.gov:1094//pnfs/sdcc.bnl.gov/data/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== CA_SFU DUNE_US_FNAL_DISK_STAGE 0 1 root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== CA_SFU NIKHEF 0 99 root://dune.dcache.nikhef.nl:1094/pnfs/nikhef.nl/data/dune/generic/rucio/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== CA_SFU PRAGUE 0 0 root://golias100.farm.particle.cz:1094/dpm/farm.particle.cz/home/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== CA_SFU QMUL 0 0 root://xrootd1.esc.qmul.ac.uk:1094//dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== CA_SFU RAL-PP 0 97 root://mover.pp.rl.ac.uk:1094/pnfs/pp.rl.ac.uk/data/dune/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== CA_SFU RAL_ECHO 0 99 root://xrootd.echo.stfc.ac.uk:1094/dune:/protodune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== CA_SFU SURFSARA 0 99 root://otter12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== CA_SFU T3_US_NERSC 0 0 root://dtn14.nersc.gov:1094//global/cfs/cdirs/m3249/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs