justIN           Dashboard       Workflows       Jobs       AWT       Sites       Storages       Docs       Login

Jobsub ID 393982.2@justin-prod-sched01.dune.hep.ac.uk

Jobsub ID393982.2@justin-prod-sched01.dune.hep.ac.uk
Workflow ID6627
Stage ID1
User namehiguera@fnal.gov
HTCondor Groupgroup_dune.prod_mcsim
RequestedProcessors1
GPUNo
RSS bytes4194304000 (4000 MiB)
Wall seconds limit80000 (22 hours)
Submitted time2025-05-08 15:05:38
SiteUK_Edinburgh
EntryDUNE_UK_SGridECDF_ce1
Last heartbeat2025-05-08 15:09:11
From worker nodeHostnamenode2b23.ecdf.ed.ac.uk
cpuinfoIntel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz
OS releaseScientific Linux release 7.9 (Nitrogen)
Processors1
RSS bytes4194304000 (4000 MiB)
Wall seconds limit171000 (47 hours)
GPU
Inner Apptainer?True
Job statejobscript_error
Allocator namejustin-allocator-pro.dune.hep.ac.uk
Started2025-05-08 15:06:41
Input fileshd-protodune:np04hd_raw_run029918_1307_dataflow0_datawriter_0_20241012T194339.hdf5
JobscriptExit code2
Real time0m (0s)
CPU time0m (0s = 0%)
Max RSS bytes0 (0 MiB)
Outputting started 
Output files
Finished2025-05-08 15:09:11
Saved logsjustin-logs:393982.2-justin-prod-sched01.dune.hep.ac.uk.logs.tgz
List job events     Wrapper job log

Jobscript log (last 10,000 characters)

.10-2.17 -z /cvmfs/dune.opensciencegrid.org/products/dune -q e26:prof
DUNERECO_VERSION=v10_05_00d00
DUNEDETDATAFORMATS_INC=/cvmfs/dune.opensciencegrid.org/products/dune/dunedetdataformats/v4_4_5/include
SETUP_DUNESW=dunesw v10_05_00d00 -f Linux64bit+3.10-2.17 -z /cvmfs/dune.opensciencegrid.org/products/dune -q e26:prof
PROTODUNEANA_INC=/cvmfs/dune.opensciencegrid.org/products/dune/protoduneana/v10_05_00d00/include
DUNEDATAPREP_LIB=/cvmfs/dune.opensciencegrid.org/products/dune/dunedataprep/v10_05_00d00/slf7.x86_64.e26.prof/lib
DUNERECO_DIR=/cvmfs/dune.opensciencegrid.org/products/dune/dunereco/v10_05_00d00
Installing yaml
Collecting pyyaml
  Downloading PyYAML-6.0.2-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (737 kB)
Installing collected packages: pyyaml
Successfully installed pyyaml-6.0.2
WARNING: You are using pip version 21.2.4; however, version 25.1.1 is available.
You should consider upgrading via the '/cvmfs/larsoft.opensciencegrid.org/products/python/v3_9_15/Linux64bit+3.10-2.17/bin/python3 -m pip install --upgrade pip' command.
Done
Will use justin-get-file
input file: np04hd_raw_run029918_1307_dataflow0_datawriter_0_20241012T194339.hdf5
jobsub_id: 393982.2@justin-prod-sched01.dune.hep.ac.uk
Getting run subrun from hd-protodune:np04hd_raw_run029918_1307_dataflow0_datawriter_0_20241012T194339.hdf5
Traceback (most recent call last):
  File "/cvmfs/fermilab.opensciencegrid.org/products/common/prd/python_future_six_request/v1_3_1/Linux64bit-3-10-2-17-python3-9/urllib3/connection.py", line 169, in _new_conn
    conn = connection.create_connection(
  File "/cvmfs/fermilab.opensciencegrid.org/products/common/prd/python_future_six_request/v1_3_1/Linux64bit-3-10-2-17-python3-9/urllib3/util/connection.py", line 96, in create_connection
    raise err
  File "/cvmfs/fermilab.opensciencegrid.org/products/common/prd/python_future_six_request/v1_3_1/Linux64bit-3-10-2-17-python3-9/urllib3/util/connection.py", line 86, in create_connection
    sock.connect(sa)
TimeoutError: [Errno 110] Connection timed out

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/cvmfs/fermilab.opensciencegrid.org/products/common/prd/python_future_six_request/v1_3_1/Linux64bit-3-10-2-17-python3-9/urllib3/connectionpool.py", line 699, in urlopen
    httplib_response = self._make_request(
  File "/cvmfs/fermilab.opensciencegrid.org/products/common/prd/python_future_six_request/v1_3_1/Linux64bit-3-10-2-17-python3-9/urllib3/connectionpool.py", line 382, in _make_request
    self._validate_conn(conn)
  File "/cvmfs/fermilab.opensciencegrid.org/products/common/prd/python_future_six_request/v1_3_1/Linux64bit-3-10-2-17-python3-9/urllib3/connectionpool.py", line 1010, in _validate_conn
    conn.connect()
  File "/cvmfs/fermilab.opensciencegrid.org/products/common/prd/python_future_six_request/v1_3_1/Linux64bit-3-10-2-17-python3-9/urllib3/connection.py", line 353, in connect
    conn = self._new_conn()
  File "/cvmfs/fermilab.opensciencegrid.org/products/common/prd/python_future_six_request/v1_3_1/Linux64bit-3-10-2-17-python3-9/urllib3/connection.py", line 181, in _new_conn
    raise NewConnectionError(
urllib3.exceptions.NewConnectionError: <urllib3.connection.HTTPSConnection object at 0x14736a4c6550>: Failed to establish a new connection: [Errno 110] Connection timed out

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/cvmfs/fermilab.opensciencegrid.org/products/common/prd/python_future_six_request/v1_3_1/Linux64bit-3-10-2-17-python3-9/requests/adapters.py", line 439, in send
    resp = conn.urlopen(
  File "/cvmfs/fermilab.opensciencegrid.org/products/common/prd/python_future_six_request/v1_3_1/Linux64bit-3-10-2-17-python3-9/urllib3/connectionpool.py", line 755, in urlopen
    retries = retries.increment(
  File "/cvmfs/fermilab.opensciencegrid.org/products/common/prd/python_future_six_request/v1_3_1/Linux64bit-3-10-2-17-python3-9/urllib3/util/retry.py", line 574, in increment
    raise MaxRetryError(_pool, url, error or ResponseError(cause))
urllib3.exceptions.MaxRetryError: HTTPSConnectionPool(host='metacat.fnal.gov', port=9443): Max retries exceeded with url: /dune_meta_prod/app/data/file?with_metadata=yes&with_provenance=yes&with_datasets=no&name=np04hd_raw_run029918_1307_dataflow0_datawriter_0_20241012T194339.hdf5&namespace=hd-protodune (Caused by NewConnectionError('<urllib3.connection.HTTPSConnection object at 0x14736a4c6550>: Failed to establish a new connection: [Errno 110] Connection timed out'))

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "/cvmfs/larsoft.opensciencegrid.org/products/python/v3_9_15/Linux64bit+3.10-2.17/lib/python3.9/runpy.py", line 197, in _run_module_as_main
    return _run_code(code, main_globals, None,
  File "/cvmfs/larsoft.opensciencegrid.org/products/python/v3_9_15/Linux64bit+3.10-2.17/lib/python3.9/runpy.py", line 87, in _run_code
    exec(code, run_globals)
  File "/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/fa49d1d4aaf0d5ceb75154614adc48877f735a0c/beam_job_utils.py", line 369, in <module>
    routines[args.routine](args)
  File "/cvmfs/fifeuser3.opensciencegrid.org/sw/dune/fa49d1d4aaf0d5ceb75154614adc48877f735a0c/beam_job_utils.py", line 46, in get_run_subrun
    md = mc.get_file(did=args.i, with_metadata=True)
  File "/cvmfs/dune.opensciencegrid.org/products/dune/metacat/v4_0_2/NULL/lib/python3.9/site-packages/metacat/webapi/webapi.py", line 1259, in get_file
    return self.get_json(url)
  File "/cvmfs/dune.opensciencegrid.org/products/dune/metacat/v4_0_2/NULL/lib/python3.9/site-packages/metacat/webapi/webapi.py", line 206, in get_json
    return self.unpack_json_data(self.send_request("get", uri_suffix, headers=headers, stream=True))
  File "/cvmfs/dune.opensciencegrid.org/products/dune/metacat/v4_0_2/NULL/lib/python3.9/site-packages/metacat/webapi/webapi.py", line 154, in send_request
    self.LastResponse = response = self.retry_request(method, url, headers=headers, **args)
  File "/cvmfs/dune.opensciencegrid.org/products/dune/metacat/v4_0_2/NULL/lib/python3.9/site-packages/metacat/webapi/webapi.py", line 131, in retry_request
    response = requests.get(url, timeout=self.Timeout, **args)
  File "/cvmfs/fermilab.opensciencegrid.org/products/common/prd/python_future_six_request/v1_3_1/Linux64bit-3-10-2-17-python3-9/requests/api.py", line 75, in get
    return request('get', url, params=params, **kwargs)
  File "/cvmfs/fermilab.opensciencegrid.org/products/common/prd/python_future_six_request/v1_3_1/Linux64bit-3-10-2-17-python3-9/requests/api.py", line 61, in request
    return session.request(method=method, url=url, **kwargs)
  File "/cvmfs/fermilab.opensciencegrid.org/products/common/prd/python_future_six_request/v1_3_1/Linux64bit-3-10-2-17-python3-9/requests/sessions.py", line 542, in request
    resp = self.send(prep, **send_kwargs)
  File "/cvmfs/fermilab.opensciencegrid.org/products/common/prd/python_future_six_request/v1_3_1/Linux64bit-3-10-2-17-python3-9/requests/sessions.py", line 655, in send
    r = adapter.send(request, **kwargs)
  File "/cvmfs/fermilab.opensciencegrid.org/products/common/prd/python_future_six_request/v1_3_1/Linux64bit-3-10-2-17-python3-9/requests/adapters.py", line 516, in send
    raise ConnectionError(e, request=request)
requests.exceptions.ConnectionError: HTTPSConnectionPool(host='metacat.fnal.gov', port=9443): Max retries exceeded with url: /dune_meta_prod/app/data/file?with_metadata=yes&with_provenance=yes&with_datasets=no&name=np04hd_raw_run029918_1307_dataflow0_datawriter_0_20241012T194339.hdf5&namespace=hd-protodune (Caused by NewConnectionError('<urllib3.connection.HTTPSConnection object at 0x14736a4c6550>: Failed to establish a new connection: [Errno 110] Connection timed out'))
error in get_run_subrun

Found sps data for run : /cvmfs/fifeuser3.opensciencegrid.org/sw/dune/fa49d1d4aaf0d5ceb75154614adc48877f735a0c/spillrun031036.csv
nevents: -1
input flag: -i root://eospublic.cern.ch:1094//eos/experiment/neutplatform/protodune/dune/hd-protodune/0e/6f/np04hd_raw_run029918_1307_dataflow0_datawriter_0_20241012T194339.hdf5 --parent hd-protodune:np04hd_raw_run029918_1307_dataflow0_datawriter_0_20241012T194339.hdf5
[2025-05-08 16:08:54.112403 +0100][Debug  ][Utility           ] Initializing xrootd client version: v5.5.5
[2025-05-08 16:08:54.112610 +0100][Debug  ][Utility           ] Unable to process user config file: [ERROR] OS Error: no such file or directory
[2025-05-08 16:08:54.112744 +0100][Debug  ][PlugInMgr         ] Initializing plug-in manager...
[2025-05-08 16:08:54.112751 +0100][Debug  ][PlugInMgr         ] No default plug-in, loading plug-in configs...
[2025-05-08 16:08:54.112755 +0100][Debug  ][PlugInMgr         ] Processing plug-in definitions in /etc/xrootd/client.plugins.d...
[2025-05-08 16:08:54.112916 +0100][Debug  ][PlugInMgr         ] Trying to disable plug-in for '*'
[2025-05-08 16:08:54.112936 +0100][Debug  ][PlugInMgr         ] Processing plug-in definitions in /home/.xrootd/client.plugins.d...
[2025-05-08 16:08:54.112943 +0100][Debug  ][PlugInMgr         ] Unable to process directory /home/.xrootd/client.plugins.d: [ERROR] OS Error: no such file or directory
usage: beam_job_utils.py [-h] [-i I] [--yaml YAML] [-o O] [--json JSON]
                         [--run RUN] [--subrun SUBRUN]
                         [--exclude EXCLUDE [EXCLUDE ...]]
                         [--overrides OVERRIDES [OVERRIDES ...]]
                         [--parent PARENT] [--event EVENT] [--nevents NEVENTS]
                         [--past_fcls PAST_FCLS [PAST_FCLS ...]]
                         [--past_apps PAST_APPS [PAST_APPS ...]]
                         [--past_vers PAST_VERS [PAST_VERS ...]]
                         {get_run_subrun,make_metadata,run_job,get_artroot_nevents}
beam_job_utils.py: error: argument --run: expected one argument
Error running. Exiting with 2
justIN time: 2025-05-23 00:20:07 UTC       justIN version: 01.03.01