Jobsub ID 103936.0@justin-prod-sched02.dune.hep.ac.uk
Jobsub ID | 103936.0@justin-prod-sched02.dune.hep.ac.uk |
Workflow Testing | Yes |
Workflow ID | 1 |
Stage ID | 1 |
User name | amcnab@fnal.gov |
HTCondor Group | group_dune.prod_mcsim |
Requested | Processors | 1 |
RSS bytes | 1073741824 (1024 MiB) |
Wall seconds limit | 3600 (1 hours) |
Submitted time | 2024-11-19 10:39:52 |
Site | US_FNAL-T1 |
Entry | CMSHTPC_T1_US_FNAL_condce_opp1_whole |
Last heartbeat | 2024-11-19 11:07:08 |
From worker node | Hostname | dunegli-36874-0-cmswn2289.fnal.gov |
cpuinfo | Intel(R) Xeon(R) CPU E5-2670 v3 @ 2.30GHz |
OS release | Scientific Linux release 7.9 (Nitrogen) |
Processors | 1 |
RSS bytes | 1073741824 (1024 MiB) |
Wall seconds limit | 171000 (47 hours) |
Inner Apptainer? | True |
Job state | outputting_failed |
Allocator name | justin-allocator-pro.dune.hep.ac.uk |
Started | 2024-11-19 10:57:56 |
Input files | |
Jobscript | Exit code | 0 |
Real time | 9m (540s) |
CPU time | 0m (16s = 2%) |
Outputting started | 2024-11-19 11:06:57 |
Output files | |
Finished | 2024-11-19 11:07:08 |
List job events Wrapper job log |
Jobscript log (last 10,000 characters)
s most likely the one.
DEBUG:charset_normalizer:Encoding detection: ascii is most likely the one.
DEBUG:charset_normalizer:Encoding detection: ascii is most likely the one.
INFO:root:Dataset testpro:awt-uploads-202447 already exists - no rule will be created
DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "GET /dids/testpro/awt-1732013878-eooos6YW9R/meta?plugin=DID_COLUMN HTTP/1.1" 404 129
DEBUG:charset_normalizer:Encoding detection: ascii is most likely the one.
DEBUG:charset_normalizer:Encoding detection: ascii is most likely the one.
DEBUG:charset_normalizer:Encoding detection: ascii is most likely the one.
DEBUG:root:File DID does not exist
DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "POST /replicas HTTP/1.1" 201 7
INFO:root:Successfully added replica in Rucio catalogue at SURFSARA
DEBUG:rucio.rse.protocols.protocol:PFN2LFN function will not be fetched from the policy package
DEBUG:root:gfal.Default: connecting to storage
DEBUG:root:gfal.Default: checking if file exists None
DEBUG:root:Checking if root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/23/3b/awt-1732013878-eooos6YW9R exists
DEBUG:root:gfal.Default: checking if file exists root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/23/3b/awt-1732013878-eooos6YW9R
DEBUG:root:gfal.Default: closing protocol connection
DEBUG:root:[{'hostname': 'webdav.grid.surfsara.nl', 'scheme': 'davs', 'port': 2880, 'prefix': '/pnfs/grid.sara.nl/data/dune/disk/RSE', 'impl': 'rucio.rse.protocols.gfal.Default', 'domains': {'lan': {'read': 2, 'write': 1, 'delete': 1}, 'wan': {'read': 2, 'write': 1, 'delete': 1, 'third_party_copy_read': 1, 'third_party_copy_write': 1}}, 'extended_attributes': None}, {'hostname': 'penguin12.grid.surfsara.nl', 'scheme': 'root', 'port': 21094, 'prefix': '/pnfs/grid.sara.nl/data/dune/disk/RSE', 'impl': 'rucio.rse.protocols.gfal.Default', 'domains': {'lan': {'read': 1, 'write': 1, 'delete': 2}, 'wan': {'read': 1, 'write': 1, 'delete': 2, 'third_party_copy_read': 10, 'third_party_copy_write': 10}}, 'extended_attributes': None}]
INFO:root:Trying upload with root to SURFSARA
DEBUG:root:Processing upload with the domain: wan
DEBUG:root:gfal.Default: connecting to storage
DEBUG:root:The PFN created from the LFN: root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/23/3b/awt-1732013878-eooos6YW9R
DEBUG:root:gfal.Default: checking if file exists root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/23/3b/awt-1732013878-eooos6YW9R
DEBUG:root:gfal.Default: checking if file exists root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/23/3b/awt-1732013878-eooos6YW9R.rucio.upload
DEBUG:root:put: Attempt 1
DEBUG:root:gfal.Default: uploading file from awt-1732013878-eooos6YW9R to root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/23/3b/awt-1732013878-eooos6YW9R.rucio.upload
INFO:root:Successful upload of temporary file. root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/23/3b/awt-1732013878-eooos6YW9R.rucio.upload
DEBUG:root:skip_upload_stat=False
DEBUG:root:stat: pfn=root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/23/3b/awt-1732013878-eooos6YW9R.rucio.upload
DEBUG:root:gfal.Default: getting stats of file root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/23/3b/awt-1732013878-eooos6YW9R.rucio.upload
DEBUG:root:Filesize: Expected=26 Found=26
DEBUG:root:Checksum: Expected=62df074f Found=62df074f
DEBUG:root:Renaming file root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/23/3b/awt-1732013878-eooos6YW9R.rucio.upload to root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/23/3b/awt-1732013878-eooos6YW9R
DEBUG:root:gfal.Default: renaming file from root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/23/3b/awt-1732013878-eooos6YW9R.rucio.upload to root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/23/3b/awt-1732013878-eooos6YW9R
DEBUG:root:gfal.Default: closing protocol connection
DEBUG:root:Upload done.
INFO:root:Successfully uploaded file awt-1732013878-eooos6YW9R
DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): dune-rucio.fnal.gov:443
/cvmfs/dune.opensciencegrid.org/products/dune/rucio/v35_4_0/NULL/lib/python3.9/site-packages/urllib3/connectionpool.py:1061: InsecureRequestWarning: Unverified HTTPS request is being made to host 'dune-rucio.fnal.gov'. Adding certificate verification is strongly advised. See: https://urllib3.readthedocs.io/en/1.26.x/advanced-usage.html#ssl-warnings
warnings.warn(
DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "POST /traces/ HTTP/1.1" 404 207
DEBUG:dogpile.lock:value creation lock <dogpile.cache.region.CacheRegion._LockWrapper object at 0x15319f31dac0> acquired
DEBUG:dogpile.lock:Calling creation function for previously expired value
DEBUG:dogpile.cache.region:Cache value generated in 0.000 seconds for key(s): "host_to_choose_choice['https://dune-rucio.fnal.gov']"
DEBUG:dogpile.lock:Released creation lock
DEBUG:urllib3.connectionpool:Resetting dropped connection: dune-rucio.fnal.gov
DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "PUT /replicas HTTP/1.1" 200 0
DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "POST /dids/testpro/awt-uploads-202447/dids HTTP/1.1" 201 7
DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): dune-rucio.fnal.gov:443
DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "POST /replicas/list HTTP/1.1" 200 None
DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): dune-rucio.fnal.gov:443
DEBUG:urllib3.connectionpool:https://dune-rucio.fnal.gov:443 "GET /dids/testpro/awt-uploads-202447/files HTTP/1.1" 200 None
--- Upload try 1/1
--- Rucio upload 1/1 returns 0
--- Replica check try 1/1
--- Dataset awt-uploads-202447 check try 1/1
--- Upload, replicas, and datasets checks passed
'justin-rucio-upload --rse SURFSARA --protocol davs --scope testpro --dataset awt-uploads-202447 awt-1732013878-eooos6YW9R --timeout 1200' returns 0
subject : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk/CN=1833661745/CN=173201387637
issuer : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk/CN=1833661745
identity : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk/CN=1833661745
type : RFC compliant proxy
strength : 2048 bits
path : /home/awt-proxy.pem
timeleft : 167:51:00
key usage : Digital Signature, Key Encipherment, Key Agreement
=== VO dune extension information ===
VO : dune
subject : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk
issuer : /DC=org/DC=incommon/C=US/ST=Illinois/O=Fermi Research Alliance/CN=voms1.fnal.gov
attribute : /dune/Role=Production/Capability=NULL
attribute : /dune/Role=NULL/Capability=NULL
timeleft : 160:00:07
uri : voms1.fnal.gov:15042
===== Results =====
Download/upload commands:
xrdcp --force --nopbar --verbose $read_pfn downloaded.txt
justin-rucio-upload --rse $rse_name --protocol $write_protocol --scope testpro --dataset --timeout 1200 FILENAME
Use the wrapper job link on the page for the job on the justIN Dashboard to find the full log file, with errors from these commands
Each line: $JUSTIN_SITE_NAME $rse_name $download_retval $upload_retval $read_pfn $write_protocol
==awt== US_FNAL-T1 DUNE_CA_SFU 0 0 root://lcg-dunese1.sfu.computecanada.ca:1094//dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 DUNE_CERN_EOS 0 0 root://eospublic.cern.ch:1094//eos/experiment/neutplatform/protodune/dune/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 DUNE_ES_PIC 0 0 root://xrootd.pic.es:1094/pnfs/pic.es/data/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 DUNE_FR_CCIN2P3_DISK 0 0 root://ccxrootdegee.in2p3.fr:1094/pnfs/in2p3.fr/data/dune/disk/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 DUNE_IT_INFN_CNAF 0 0 root://xrootd-archive.cr.cnaf.infn.it:1096//dune/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 DUNE_UK_GLASGOW 0 0 root://cephc02.gla.scotgrid.ac.uk:1094//cephfs/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 DUNE_UK_LANCASTER_CEPH 0 0 root://xgate.hec.lancs.ac.uk:1094//cephfs/grid/dune/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 DUNE_UK_MANCHESTER_CEPH 0 0 root://meitner.tier2.hep.manchester.ac.uk:1094//cephfs/experiments/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 DUNE_US_BNL_SDCC 0 0 root://dcdndoor.sdcc.bnl.gov:1094//pnfs/sdcc.bnl.gov/data/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 DUNE_US_FNAL_DISK_STAGE 0 0 root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 NIKHEF 0 0 root://dune.dcache.nikhef.nl:1094/pnfs/nikhef.nl/data/dune/generic/rucio/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 PRAGUE 0 0 root://golias100.farm.particle.cz:1094/dpm/farm.particle.cz/home/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 QMUL 0 0 root://xrootd1.esc.qmul.ac.uk:1094//dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 RAL-PP 0 0 root://mover.pp.rl.ac.uk:1094/pnfs/pp.rl.ac.uk/data/dune/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 RAL_ECHO 0 0 root://xrootd.echo.stfc.ac.uk:1094/dune:/protodune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs
==awt== US_FNAL-T1 SURFSARA 0 0 root://penguin12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs