21 July 2025: This instance at RAL is read-only. Please do not try submitting new workflows for now.
Jobsub ID 204415.0@justin-prod-sched02.dune.hep.ac.uk
Jobsub ID | 204415.0@justin-prod-sched02.dune.hep.ac.uk | |
Workflow Testing | Yes | |
Workflow ID | 1 | |
Stage ID | 1 | |
User name | amcnab@fnal.gov | |
HTCondor Group | group_dune.prod_mcsim | |
Requested | Processors | 1 |
GPU | No | |
RSS bytes | 1073741824 (1024 MiB) | |
Wall seconds limit | 3600 (1 hours) | |
Submitted time | 2025-05-15 06:13:25 | |
Site | NL_NIKHEF | |
Entry | VIRGO_NL_NIKHEF_klomp | |
Last heartbeat | 2025-05-15 06:18:13 | |
From worker node | Hostname | wn-lot-033.farm.nikhef.nl |
cpuinfo | AMD EPYC 7702P 64-Core Processor | |
OS release | Scientific Linux release 7.9 (Nitrogen) | |
Processors | 1 | |
RSS bytes | 1073741824 (1024 MiB) | |
Wall seconds limit | 129600 (36 hours) | |
GPU | ||
Inner Apptainer? | True | |
Job state | outputting_failed | |
Allocator name | justin-allocator-pro.dune.hep.ac.uk | |
Started | 2025-05-15 06:14:44 | |
Input files | ||
Jobscript | Exit code | 0 |
Real time | 3m (194s) | |
CPU time | 0m (18s = 9%) | |
Max RSS bytes | 40296448 (38 MiB) | |
Outputting started | 2025-05-15 06:17:59 | |
Output files | ||
Finished | 2025-05-15 06:18:13 | |
List job events Wrapper job log |
Jobscript log (last 10,000 characters)
.c:1129)')))[0m ERROR:baseclient:ConnectionError: HTTPSConnectionPool(host='dune-rucio.fnal.gov', port=443): Max retries exceeded with url: /auth/x509_proxy (Caused by SSLError(SSLError(1, '[SSL: SSLV3_ALERT_CERTIFICATE_EXPIRED] sslv3 alert certificate expired (_ssl.c:1129)'))) --- Upload try 1/1 --- Rucio upload 1/1 fails: Cannot connect to the Rucio server. --- Exit with 99 'justin-rucio-upload --rse RAL_ECHO --protocol davs --scope testpro --dataset awt-uploads-202519 awt-1747289689-2RgUslcKpS --timeout 1200' returns 99 --------------------------------------------------------------------- NL_NIKHEF SURFSARA davs root://otter12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt Run: [FATAL] Auth failed: No protocols left to try (source) 'xrdcp --force --nopbar --verbose root://otter12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt downloaded.txt' returns 52 Token not found metacat file declare returns 1 GFAL_CONFIG_DIR: GFAL_PLUGIN_DIR: justin-rucio-upload attempt 1 DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): dune-rucio.fnal.gov:443 [31;1m2025-05-15 08:17:54,793 ERROR ConnectionError: HTTPSConnectionPool(host='dune-rucio.fnal.gov', port=443): Max retries exceeded with url: /auth/x509_proxy (Caused by SSLError(SSLError(1, '[SSL: SSLV3_ALERT_CERTIFICATE_EXPIRED] sslv3 alert certificate expired (_ssl.c:1129)')))[0m ERROR:baseclient:ConnectionError: HTTPSConnectionPool(host='dune-rucio.fnal.gov', port=443): Max retries exceeded with url: /auth/x509_proxy (Caused by SSLError(SSLError(1, '[SSL: SSLV3_ALERT_CERTIFICATE_EXPIRED] sslv3 alert certificate expired (_ssl.c:1129)'))) DEBUG:urllib3.connectionpool:Starting new HTTPS connection (2): dune-rucio.fnal.gov:443 [31;1m2025-05-15 08:17:55,182 ERROR ConnectionError: HTTPSConnectionPool(host='dune-rucio.fnal.gov', port=443): Max retries exceeded with url: /auth/x509_proxy (Caused by SSLError(SSLError(1, '[SSL: SSLV3_ALERT_CERTIFICATE_EXPIRED] sslv3 alert certificate expired (_ssl.c:1129)')))[0m ERROR:baseclient:ConnectionError: HTTPSConnectionPool(host='dune-rucio.fnal.gov', port=443): Max retries exceeded with url: /auth/x509_proxy (Caused by SSLError(SSLError(1, '[SSL: SSLV3_ALERT_CERTIFICATE_EXPIRED] sslv3 alert certificate expired (_ssl.c:1129)'))) DEBUG:urllib3.connectionpool:Starting new HTTPS connection (3): dune-rucio.fnal.gov:443 [31;1m2025-05-15 08:17:55,561 ERROR ConnectionError: HTTPSConnectionPool(host='dune-rucio.fnal.gov', port=443): Max retries exceeded with url: /auth/x509_proxy (Caused by SSLError(SSLError(1, '[SSL: SSLV3_ALERT_CERTIFICATE_EXPIRED] sslv3 alert certificate expired (_ssl.c:1129)')))[0m ERROR:baseclient:ConnectionError: HTTPSConnectionPool(host='dune-rucio.fnal.gov', port=443): Max retries exceeded with url: /auth/x509_proxy (Caused by SSLError(SSLError(1, '[SSL: SSLV3_ALERT_CERTIFICATE_EXPIRED] sslv3 alert certificate expired (_ssl.c:1129)'))) --- Upload try 1/1 --- Rucio upload 1/1 fails: Cannot connect to the Rucio server. --- Exit with 99 'justin-rucio-upload --rse SURFSARA --protocol davs --scope testpro --dataset awt-uploads-202519 awt-1747289689-FNGsBc3JxL --timeout 1200' returns 99 --------------------------------------------------------------------- NL_NIKHEF T3_US_NERSC davs root://dtn14.nersc.gov:1094//global/cfs/cdirs/m3249/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt Run: [FATAL] Auth failed: No protocols left to try (source) 'xrdcp --force --nopbar --verbose root://dtn14.nersc.gov:1094//global/cfs/cdirs/m3249/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt downloaded.txt' returns 52 Token not found metacat file declare returns 1 GFAL_CONFIG_DIR: GFAL_PLUGIN_DIR: justin-rucio-upload attempt 1 DEBUG:urllib3.connectionpool:Starting new HTTPS connection (1): dune-rucio.fnal.gov:443 [31;1m2025-05-15 08:17:58,710 ERROR ConnectionError: HTTPSConnectionPool(host='dune-rucio.fnal.gov', port=443): Max retries exceeded with url: /auth/x509_proxy (Caused by SSLError(SSLError(1, '[SSL: SSLV3_ALERT_CERTIFICATE_EXPIRED] sslv3 alert certificate expired (_ssl.c:1129)')))[0m ERROR:baseclient:ConnectionError: HTTPSConnectionPool(host='dune-rucio.fnal.gov', port=443): Max retries exceeded with url: /auth/x509_proxy (Caused by SSLError(SSLError(1, '[SSL: SSLV3_ALERT_CERTIFICATE_EXPIRED] sslv3 alert certificate expired (_ssl.c:1129)'))) DEBUG:urllib3.connectionpool:Starting new HTTPS connection (2): dune-rucio.fnal.gov:443 [31;1m2025-05-15 08:17:59,085 ERROR ConnectionError: HTTPSConnectionPool(host='dune-rucio.fnal.gov', port=443): Max retries exceeded with url: /auth/x509_proxy (Caused by SSLError(SSLError(1, '[SSL: SSLV3_ALERT_CERTIFICATE_EXPIRED] sslv3 alert certificate expired (_ssl.c:1129)')))[0m ERROR:baseclient:ConnectionError: HTTPSConnectionPool(host='dune-rucio.fnal.gov', port=443): Max retries exceeded with url: /auth/x509_proxy (Caused by SSLError(SSLError(1, '[SSL: SSLV3_ALERT_CERTIFICATE_EXPIRED] sslv3 alert certificate expired (_ssl.c:1129)'))) DEBUG:urllib3.connectionpool:Starting new HTTPS connection (3): dune-rucio.fnal.gov:443 [31;1m2025-05-15 08:17:59,463 ERROR ConnectionError: HTTPSConnectionPool(host='dune-rucio.fnal.gov', port=443): Max retries exceeded with url: /auth/x509_proxy (Caused by SSLError(SSLError(1, '[SSL: SSLV3_ALERT_CERTIFICATE_EXPIRED] sslv3 alert certificate expired (_ssl.c:1129)')))[0m ERROR:baseclient:ConnectionError: HTTPSConnectionPool(host='dune-rucio.fnal.gov', port=443): Max retries exceeded with url: /auth/x509_proxy (Caused by SSLError(SSLError(1, '[SSL: SSLV3_ALERT_CERTIFICATE_EXPIRED] sslv3 alert certificate expired (_ssl.c:1129)'))) --- Upload try 1/1 --- Rucio upload 1/1 fails: Cannot connect to the Rucio server. --- Exit with 99 'justin-rucio-upload --rse T3_US_NERSC --protocol davs --scope testpro --dataset awt-uploads-202519 awt-1747289689-1Y5jNwQM37 --timeout 1200' returns 99 subject : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk/CN=1528843534/CN=174728968489 issuer : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk/CN=1528843534 identity : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk/CN=1528843534 type : RFC compliant proxy strength : 2048 bits path : /home/awt-proxy.pem timeleft : 167:56:45 key usage : Digital Signature, Key Encipherment, Key Agreement === VO dune extension information === VO : dune subject : /C=UK/O=eScience/OU=Manchester/L=HEP/CN=justin-jobs-production.dune.hep.ac.uk issuer : /DC=org/DC=incommon/C=US/ST=Illinois/O=Fermi Research Alliance/CN=voms1.fnal.gov attribute : /dune/Role=Production/Capability=NULL attribute : /dune/Role=NULL/Capability=NULL timeleft : 0:00:00 uri : voms1.fnal.gov:15042 ===== Results ===== Download/upload commands: xrdcp --force --nopbar --verbose $read_pfn downloaded.txt echo '{"namespace":"testpro","name":"FILENAME","size":0}' >tmp.json metacat file declare --json -f tmp.json "dune:all" justin-rucio-upload --rse $rse_name --protocol $write_protocol --scope testpro --dataset awt-uploads-202519 --timeout 1200 FILENAME Use the wrapper job link on the page for the job on the justIN Dashboard to find the full log file, with errors from these commands Each line: $JUSTIN_SITE_NAME $rse_name $download_retval $upload_retval $read_pfn $write_protocol ==awt== NL_NIKHEF DUNE_CA_SFU 0 99 root://lcg-dunese1.sfu.computecanada.ca:1094//dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== NL_NIKHEF DUNE_CERN_EOS 0 99 root://eospublic.cern.ch:1094//eos/experiment/neutplatform/protodune/dune/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== NL_NIKHEF DUNE_ES_PIC 52 99 root://xrootd.pic.es:1094/pnfs/pic.es/data/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== NL_NIKHEF DUNE_FR_CCIN2P3_DISK 52 99 root://ccxrootdegee.in2p3.fr:1094/pnfs/in2p3.fr/data/dune/disk/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== NL_NIKHEF DUNE_IT_INFN_CNAF 51 99 root://xrootd-archive.cr.cnaf.infn.it:1096//dune/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== NL_NIKHEF DUNE_UK_GLASGOW 52 99 root://cephc02.gla.scotgrid.ac.uk:1094//cephfs/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== NL_NIKHEF DUNE_UK_LANCASTER_CEPH 52 99 root://xgate.hec.lancs.ac.uk:1094//cephfs/grid/dune/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== NL_NIKHEF DUNE_UK_MANCHESTER_CEPH 52 99 root://meitner.tier2.hep.manchester.ac.uk:1094//cephfs/experiments/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== NL_NIKHEF DUNE_US_BNL_SDCC 52 99 root://dcdndoor.sdcc.bnl.gov:1094//pnfs/sdcc.bnl.gov/data/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== NL_NIKHEF DUNE_US_FNAL_DISK_STAGE 52 99 root://fndca1.fnal.gov:1094/pnfs/fnal.gov/usr/dune/persistent/staging/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== NL_NIKHEF NIKHEF 52 99 root://dune.dcache.nikhef.nl:1094/pnfs/nikhef.nl/data/dune/generic/rucio/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== NL_NIKHEF PRAGUE 54 99 root://golias100.farm.particle.cz:1094/dpm/farm.particle.cz/home/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== NL_NIKHEF QMUL 52 99 root://xrootd1.esc.qmul.ac.uk:1094//dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== NL_NIKHEF RAL-PP 52 99 root://mover.pp.rl.ac.uk:1094/pnfs/pp.rl.ac.uk/data/dune/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== NL_NIKHEF RAL_ECHO 52 99 root://xrootd.echo.stfc.ac.uk:1094/dune:/protodune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== NL_NIKHEF SURFSARA 52 99 root://otter12.grid.surfsara.nl:21094/pnfs/grid.sara.nl/data/dune/disk/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs ==awt== NL_NIKHEF T3_US_NERSC 52 99 root://dtn14.nersc.gov:1094//global/cfs/cdirs/m3249/dune/RSE/testpro/bb/7f/awt-download-2023-03-07-01.txt davs