|
As Zach McGrew said, you canât feed the contents of the offline.ads file to condor_power, so the errors youâre seeing there are expected. Instead, youâd provide each ad from running 'condor_status -offline -longâ.
Can you provide more detail on condor_rooster not working?
I suggest adding this line to your configuration, which will cause condor_rooster to write more details to its RoosterLog file:
ROOSTER_DEBUG = D_FULLDEBUG
With the more detailed logging, it will write these messages each time it looks for machines to wake up:
Cock-a-doodle-doo! (Time to look for machines to wake up.)
Got ### startd ads matching ROOSTER_UNHIBERNATE=...
Sending wakeup call to XXXX.
Youâll see that last line only if the count of startd ads is greater than 0.
- Jaime
On Mar 5, 2026, at 12:42âAM, Valerio Bellizzomi <valerio@xxxxxxxxxx> wrote:
On
Thu, 2026-02-26 at 19:25 +0100, Valerio Bellizzomi wrote:
On Thu, 2026-02-26 at 17:05 +0000, Zach McGrew wrote:
The `-i` tells condor_power to read a classad from stdin and not a file. Removing the `-i` lets you specify a file to read from instead. It's a neat trick where you can build your own tiny classad to wake up a machine similar to what
condor_rooster does. You can use something like this to wakeup machines on demand:
printf "MyAddress = \"<${the_ip}:9618>\"\nHardwareAddress = \"${hwaddr}\"\nSubnetMask = \"${subnet}\"\n" | condor_power -i
Presumably your offline.ads is set by "COLLECTOR_PERSISTENT_AD_LOG" in which case it's not a classad, but a little database like file that describes the slots that the collector was aware of but stopped talking to it for some reason or another. These should
be visible with "condor_status -offline" or "condor_status -absent" depending on how they got entered. The file itself is used to restore those slots into memory when the collector restarts (i.e. restarting the collector means you no longer forget about the
EPs that are powered off). You're not meant to pass this file as is to condor_power.
Yes exact, my central-manager config is as follows:
ABSENT_REQUIREMENTS = True
EXPIRE_INVALIDATED_ADS = True
COLLECTOR_PERSISTENT_AD_LOG = $(SPOOL)/offline.ads
VALID_SPOOL_FILES = $(SPOOL)/offline.ads
Condor_rooster is supposed to call condor_power to wake up a machine,
and in this case the documentation says that the default value is
condor_power -d -i:
https://urldefense.com/v3/__https://htcondor.readthedocs.io/en/latest/admin-manual/configuration-macros.html*condor-rooster-configuration-file-macros__;Iw!!Mak6IKo!MdklhxhrHPvivWYvDVmvgh8vG_ft_o95Vma7QvRCDoz6e8f0roYtSMGdSJLkPhgYgHYWdImPZ9SnTAOgPw$
ROOSTER_WAKEUP_CMDÂ
A string representing the command line invoked by condor_rooster
that is to wake up a machine. The command and any arguments should be
enclosed in double quote marks, the same as arguments syntax in an
HTCondor submit description file. The default value is
â$(BIN)/condor_power -d -iâ. The command is expected to read from its
standard input a ClassAd representing the offline machine.
But this configuration does not work for me.
Follow-up:
The
following command wakes up the ep:
condor_power
-d -s 255.255.255.255 -m b8:af:6f:84:5c:67
03/04/26
19:53:14 Can't find Name in classad for startd
03/04/26
19:53:14 Can't find CondorVersion in classad for startd
03/04/26
19:53:14 Can't find CondorPlatform in classad for startd
03/04/26
19:53:14 Can't find Machine in classad for startd
Packet
sent.
the
ep boots and the job that was waiting in idle state now is in
running
state.
however
I don't know how to automate this since rooster doesn't seem to
invoke
condor_power at intervals of 300 sec like it is specified in my
configuration.
Looking
at the condor_power code, "error in class ad" is actually
E_CLASSAD
= -9, while the errno = -1 indicates something else.
-Zach
________________________________________
From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf of Valerio Bellizzomi <valerio@xxxxxxxxxx>
Sent: Thursday, February 26, 2026 6:01 AM
To: htcondor-users@xxxxxxxxxxx
Subject: Re: [HTCondor-users] Error with condor_power
On Thu, 2026-02-26 at 13:20 +0000, Pelletier, Michael V via HTCondor-
users wrote:
Double-check the offline.ads file for that â107 1â string â looks like itâs just carping about a syntax error.
Michael V Pelletier
Principal Technologist
C: +1 339.293.9149
michael.v.pelletier@xxxxxxx
Thank you, the file is generated automatically by the collector I
think, and I have attempted to edit the file removing the initial
numbers, but still the same error.
From: HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx>On Behalf Of Valerio Bellizzomi
Sent: Thursday, February 26, 2026 6:19 AM
To: HTCondor-Users List <htcondor-users@xxxxxxxxxxx>
Subject: [HTCondor-users] Error with condor_power
Hello, has anyone had the same error? # condor_power -d -i /var/spool/condor/offline.âads 02/26/26 12:â16:â41 failed to create classad; bad expr = '107 1 CreationTimestamp 1772031128' condor_power: error in class-ad (errno = -1). _______________________________________________
Hello,
has anyone had the same error?
# condor_power -d -i /var/spool/condor/offline.ads
02/26/26 12:16:41 failed to create classad; bad expr = '107 1
CreationTimestamp 1772031128'
condor_power: error in class-ad (errno = -1).
_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
The archives can be found at: https://urldefense.com/v3/__https://urldefense.us/v2/url?u=https-3A__www-2Dauth.cs.wisc.edu_lists_htcondor-2Dusers_&d=DwICAg&c=MASr1KIcYm9UGIT-jfIzwQg1YBeAkaJoBtxV_4o83uQ&r=4PJgb1eyyvhzSV4fRwSECGK3jb50YP8vZUAedXybzgaNykar_o0SxKOUPkRHE0WG&m=d5NrZQlaDxHYRhD0JmcYYR_2Y71kE5GdT4M9LJHJGSIX5qUM9ckUxOSvtqr4zyH4&s=LL4IY0Fx3TIK2_DP31KVvC3KT1v0J-aCqRyiIkqDa0w&e=__;!!Mak6IKo!MdklhxhrHPvivWYvDVmvgh8vG_ft_o95Vma7QvRCDoz6e8f0roYtSMGdSJLkPhgYgHYWdImPZ9Qk6e26dQ$
_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
The archives can be found at: https://www-auth.cs.wisc.edu/lists/htcondor-users/
_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
The archives can be found at: https://www-auth.cs.wisc.edu/lists/htcondor-users/
_______________________________________________
HTCondor-users mailing list
To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
The archives can be found at: https://www-auth.cs.wisc.edu/lists/htcondor-users/
_______________________________________________
HTCondor-users
mailing list
To
unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx with a
subject:
Unsubscribe
The
archives can be found at: https://www-auth.cs.wisc.edu/lists/htcondor-users/
|