Thanks Todd.
I changed just EXECUTE value only on each node/master and
runned restart systemctl too. Now, I have a nice response
from condor_config_val. The value of the variable was
updated.
BUTTTTTT........... after new submits, jobs quickly
placed in a hold state and condor shows the following
reasons:
Condor_q -hold:
ID OWNER HELD_SINCE HOLD_REASON
589.0 me 10/26 10:42 Error from slot1@node2:
Failed to execute
[nfs_location]/execute/dir_2707175/condor_exec.exe' with
arguments 999000: (errno=13: 'Permission denied')
589.1 me 10/26 10:42 Error from slot2@node2:
Failed to execute
[nfs_location]/execute/dir_2707176/condor_exec.exe' with
arguments 999001: (errno=13: 'Permission denied')
Exports FILE on master:
[nfs_location]
node2(rw,sync,root_squash,no_subtree_check)
Fstab FILE on node2:
master:[nfs_location] [nfs_local_location] nfs
root_squash,rw,sync,hard,intr 0 0
Permissions on [nfs_location]:
drwxrwxrwt (1777)
Regards,
Ale
On 10/23/2021 7:03 AM, Alejandro AcuÃa wrote:
Hi all,
Linux help needed: I wonder if is possible to move
just EXECUTE path (generally placed in /var or
LOCAL_DIR) to a different location with better
hardware requirements (basically more disk space to
execution files).
If I change value of this variable at condor_config
(master config) and run condor_reconfig, the nodes
refresh their local config correctly but when I submit
a test job, it stay at held status forever.
Condor_q -analyze reasons: lost logs by references
to their original locations (which I never change
because I don't need that)
I prefer to change just execution place because if
I change more important global variables as e.g.
LOCAL_DIR, I am forced to configure more variables.
Is there any "elegant" way of move this location
post installation that reconfigure execution?
Thanks!!!
Alex
Hi Alex,
You can change the location of EXECUTE alone; no need to
change LOCAL_DIR.
Some advice:
You cannot change the location of EXECUTE on-the-fly with just
condor_reconfig. You will need to restart HTCondor on each
node where you change the location of EXECUTE, by either
restarting the HTCondor service (i.e. systemctrl restart
condor) or with "condor_restart -master". If you are running
HTCondor v9.x or above, then the upon restarting, the
condor_master will create the EXECUTE directory in the new
location for you with the proper ownership/permissions. If
you are running an older version of HTCondor, then you will
need to create the EXECUTE directory yourself with the same
ownership/permissions as the original EXECUTE directory.
It is advised you keep EXECUTE on a local filesystem If you
are trying to relocate EXECUTE onto a shared filesystem, such
as an NFS volume with root-squash enabled, you will need to
chown EXECUTE to permissions 1777. See details here:
https://opensciencegrid.atlassian.net/browse/HTCONDOR-73
Hope the above helps,
Todd