[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [HTCondor-users] Problem with ALICE jobs on AL9



On Fri, 2024-06-14 at 23:03 +0000, Maarten Litmaath wrote:
> 
> Hi Greg,
> ALICE jobs run fine on any HTCondor version used elsewhere.
> The use of cgroups v2 can be configured per site and is off by
> default.
> 
> 
> 
> 
> From:ÂHTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf
> of Greg Thain via HTCondor-users <htcondor-users@xxxxxxxxxxx>
> Sent:ÂFriday, June 14, 2024 7:05 PM
> To:Âhtcondor-users@xxxxxxxxxxx <htcondor-users@xxxxxxxxxxx>
> Cc:ÂGregory Thain <gthain@xxxxxxxxxxx>
> Subject:ÂRe: [HTCondor-users] Problem with ALICE jobs on AL9
> 
> Â
> On 6/14/24 02:42, Alexandr Mikula wrote:
> > Hi list,
> > we are in the middle of migrating our infrastructure from CentOS 7
> > to
> > Alma Linux 9.
> > Most of the infra is on CC7 with condor 9.* and we have one testing
> > wn
> > cluster with AL9 on condor 23.
> > So far the ATLAS workload works fine on new cluster, but the ALICE
> > jobs
> > land on the WN and fail right a way without producing any output.
> 
> Hi Alexandr:
> 
> We've worked directly with ALICE researchers, and I know that their
> job
> will run on Alma 9 systems with cgroup v2 on HTCondor 23.7. I
> believe
> their jobs are cgroup v2 aware, so they many only work on cgroup v2
> systems.
> 
> 
> -greg
> 
> _______________________________________________
> HTCondor-users mailing list
> To unsubscribe, send a message to htcondor-users-request@xxxxxxxxxxx
> with a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
> 
> The archives can be found at:
> https://lists.cs.wisc.edu/archive/htcondor-users/
> _______________________________________________
> HTCondor-users mailing list
> To unsubscribe, send a message to
> htcondor-users-request@xxxxxxxxxxxÂwith a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/htcondor-users
> 
> The archives can be found at:
> https://lists.cs.wisc.edu/archive/htcondor-users/
Hi Greg and Maarten,
It appears that the problem was caused by WN and Collector/Negotiator
HTCondor version mismatch (9.* vs 23.*)
With thanks for your time
AM
-- 
Alexandr Mikula
OddÄlenà sÃÅovÃnà a vÃpoÄetnà techniky & VÃpoÄetnà stÅedisko 
FyzikÃlnà Ãstav Akademie vÄd Äeskà republiky, v. v. i.
Institute of Physics of the Czech Academy of Sciences 

Attachment: smime.p7s
Description: S/MIME cryptographic signature