|
One of the nice things about HTCondor is that essentially any version can interoperate seamlessly with any other. Any ClassAd attribute that is not recognized by an older version is simply ignored. This makes
upgrades very simple no matter how far youâre jumping. In addition, when a running condor_master notices when its own binary has changed, and automatically triggers a restart of itself and the daemons for which itâs responsible. Michael V Pelletier Principal Technologist
From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx>
On Behalf Of Vechinski, Douglas Iâm trying to find guidance on the process of upgrading a Condor pool. At the moment lets assume upgrading between feature releases of the same major version such
as 25.âx.ây
-> 25.âx+a.ây+b.
The RPMSs are downloaded to a local Iâm trying to find guidance on the process of upgrading a Condor pool. At the moment lets assume upgrading between feature releases of the same major version such as 25.x.y ->
25.x+a.y+b. The RPMSs are downloaded to a local repository and at some point every couple of months a general update process is applied to the machines for other packages as well (currently using RHEL 8 here). Letâs take 2 cases: A) the condor pool is idle,
i.e. no jobs are actively running, B) the condor pool currently has jobs running. Since things are upgraded via RPM packages I assume the executables simply get overwritten. Can condor_master be running on a machine and be upgraded or should it be stopped? Is
there any particular order (machine) that the Condor upgrades should occur? That is should the collector/negotiator be done first before any submit and/or execution machines or does it matter? Can an execute machine that is actively running jobs be updated
live. If so, are there potential consequences? The jobs that we typically have running do not have the ability to be checkpointed.
|