Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [HTCondor-users] reason for suspended jobs
- Date: Tue, 24 Apr 2018 10:14:45 -0300 (BRT)
- From: Carlos Adean <carlosadean@xxxxxxxxxxxx>
- Subject: Re: [HTCondor-users] reason for suspended jobs
Hello,
----- Mensagem original -----
> De: "Ben Cotton" <bcotton@xxxxxxxxxxxxxxxxx>
> Para: "HTCondor-Users Mail List" <htcondor-users@xxxxxxxxxxx>
> Enviadas: Sexta-feira, 20 de abril de 2018 18:46:19
> Assunto: Re: [HTCondor-users] reason for suspended jobs
>
>
>
>
> HI Carlos,
>
>
> What are the values of SUSPEND and WANT_SUSPEND on your execute
> nodes?
>
CAN_RUN_WHOLE_MACHINE = SlotID == $(WHOLE_MACHINE_SLOT)
SINGLE_CORE_SLOTS_CLAIMED = ($(WHOLE_MACHINE_SLOT_STATE) =?= 'Claimed') < (Slot1_State =?= 'Claimed' ) + (Slot2_State =?= 'Claimed' ) + (Slot3_State =?= 'Claimed' ) + (Slot4_State =?= 'Claimed' ) + (Slot5_State =?= 'Claimed' ) + (Slot6_State =?= 'Claimed' ) + (Slot7_State =?= 'Claimed' ) + (Slot8_State =?= 'Claimed' ) + (Slot9_State =?= 'Claimed' ) + (Slot10_State =?= 'Claimed' ) + (Slot11_State =?= 'Claimed' ) + (Slot12_State =?= 'Claimed' ) + (Slot13_State =?= 'Claimed' ) + (Slot14_State =?= 'Claimed' ) + (Slot15_State =?= 'Claimed' ) + (Slot16_State =?= 'Claimed' ) + (Slot17_State =?= 'Claimed' ) + (Slot18_State =?= 'Claimed' ) + (Slot19_State =?= 'Claimed' ) + (Slot20_State =?= 'Claimed' ) + (Slot21_State =?= 'Claimed' ) + (Slot22_State =?= 'Claimed' ) + (Slot23_State =?= 'Claimed' ) + (Slot24_State =?= 'Claimed' )
SUSPEND = ($(SUSPEND)) || ( MY.CAN_RUN_WHOLE_MACHINE && ($(SINGLE_CORE_SLOTS_CLAIMED)) )
WANT_SUSPEND = ($(WANT_SUSPEND)) || ($(SUSPEND))
WHOLE_MACHINE_SLOT = ($(DETECTED_CORES)+1)
How do I interpret the options above?
thanks,
--
Carlos Adean
IT Team
linea.gov.br
skype: carlosadean