Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [HTCondor-users] DENY_WRITE and exclude execute node temporarily
- Date: Mon, 26 Aug 2019 09:43:54 +0200 (CEST)
- From: Xiaomei NIU <xiaomei.niu@xxxxxxxxxxx>
- Subject: Re: [HTCondor-users] DENY_WRITE and exclude execute node temporarily
Just a little update: to my previous question :
> Another question is: when the node is excluded, what will happens to the jobs running before this change? Will they finish properly?
Mytest showed the simple running job was finished properly.
Xiaomei
----- Original Message -----
From: "Xiaomei NIU" <xiaomei.niu@xxxxxxxxxxx>
To: "HTCondor-Users Mail List" <htcondor-users@xxxxxxxxxxx>
Sent: Friday, 23 August, 2019 11:02:08
Subject: Re: [HTCondor-users] DENY_WRITE and exclude execute node temporarily
Hello,
I test with DENY_ADVERTISE_STARTD, it works, good news!
When I pass through the doc of security section and this thread https://htcondor-wiki.cs.wisc.edu/index.cgi/wiki?p=HowToBanMachine,
I understand it as: not definded ADVERTISE_STARTD will fall back to DAEMON, and not definded DAEMON will fall back to WRITE, there is an example of exclude machine from sched by DENY_WRITE, that is why I chose to try DENY_WRITE. But may somethere, the daemon interactions, our local configigurations...
If it is possible to update this https://htcondor-wiki.cs.wisc.edu/index.cgi/wiki?p=HowToBanMachine, it could be helpful to the people like me :)
Thanks for your guys, have a nice day
Xiaomei
----- Original Message -----
From: "Jaime Frey" <jfrey@xxxxxxxxxxx>
To: "HTCondor-Users Mail List" <htcondor-users@xxxxxxxxxxx>
Sent: Thursday, 22 August, 2019 20:28:36
Subject: Re: [HTCondor-users] DENY_WRITE and exclude execute node temporarily
> On Aug 22, 2019, at 5:12 AM, Xiaomei NIU <xiaomei.niu@xxxxxxxxxxx> wrote:
>
> I am testing how to exclude some execut nodes from condor pool temporarily under condor 8.9. It is for maintenance on execute nodes.
> From the doc and FFAQ, I chose to teste with DENY_WRITE on the central manager where there is NEGOTIATOR, COLLECTOR...
>
> Here is my setting:
>
> cat /etc/condor-ce/config.d/99_exclude.config
> DENY_WRITE = $(DENY_WRITE), tbcondor05.in2p3.fr
>
> then i run condor_reconfig -full on this machine,
>
> But one day after the change, this machine is always available when I run condor_status tbcondor05
>
> I also tried with
> DENY_WRITE = $(DENY_WRITE), tbcondor05.in2p3.fr, condor_pool@$(UID_DOMAIN)/tbcondor05.in2p3.fr, root@$(UID_DOMAIN)/tbcondor05.in2p3.fr
>
> Same results.
>
> I didn't try HOSTDENY_WRITE, I think DENY_WRITE is the higher lever?
> This machine is allowed under: ALLOW_WRITE, COLLECTOR.ALLOW_ADVERTISE_MASTER COLLECTOR.ALLOW_ADVERTISE_STARTD
> But I suppose DEBY_WRITE has the higher priority?
>
>
> Another question is: when the node is excluded, what will happens to the jobs running before this change? Will they finish properly?
>
> Any help is welcome
Try setting DENY_ADVERTISE_STARTD and DENY_ADVERTISE_MASTER to the hostname of the machine you want to exclude.
ADVERTISE_STARTD and ADVERTISE_MASTER are the permission levels for an execute node to join a pool (that is, advertise to the collector). If those are unset in your configuration files, then the Condor daemons will use settings for DAEMON or WRITE level as a fallback.
Thanks and regards,
Jaime Frey
UW-Madison HTCondor Project
_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
The archives can be found at:
https://lists.cs.wisc.edu/archive/htcondor-users/