Mailing List Archives
Authenticated access
|
|
|
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Condor-users] "Over submitter resource limit (0) ... only considerstartd ranks"
- Date: Tue, 21 Feb 2006 09:16:51 +1000
- From: "DeVoil, Peter" <Peter.DeVoil@xxxxxxxxxxxxxx>
- Subject: Re: [Condor-users] "Over submitter resource limit (0) ... only considerstartd ranks"
(A followup for the archives.)
The problem disappeared after restarting the condor daemons on the
submitting machine.
P
> -----Original Message-----
> From: condor-users-bounces@xxxxxxxxxxx
> [mailto:condor-users-bounces@xxxxxxxxxxx] On Behalf Of DeVoil, Peter
> Sent: Tuesday, 14 February 2006 11:15 AM
> To: Condor-Users Mail List
> Subject: [Condor-users] "Over submitter resource limit (0)
> ... only considerstartd ranks"
>
> Hi,
>
> I have a problem with a 80-node windows pool.
>
> I have a "bulk user" that has submitted tens of thousands of
> jobs, and also ordinary users - hundreds of jobs.
>
> Since last week, only about ~50% of the pool has been active
> at any time. There are about 100 nice_user (bulk) jobs in the
> queue, but only 40 execute at any time - should be 80.
>
> I've read manual pages and can't find a setting that mentions
> this restriction. Any suggestions?
>
> Yours,
> pdev.
>
> There is a strange message in the negotiator log:
> 2/13 11:41:24 ---------- Started Negotiation Cycle ----------
> 2/13 11:41:24 Phase 1: Obtaining ads from collector ...
> 2/13 11:41:24 Getting all public ads ...
> 2/13 11:41:24 Sorting 134 ads ...
> 2/13 11:41:24 Getting startd private ads ...
> 2/13 11:41:25 Got ads: 134 public and 85 private
> 2/13 11:41:25 Public ads include 2 submitter, 85 startd
> 2/13 11:41:25 Phase 2: Performing accounting ...
> 2/13 11:41:25 Phase 3: Sorting submitter ads by priority ...
> 2/13 11:41:25 Phase 4.1: Negotiating with schedds ...
> 2/13 11:41:25 Negotiating with nice-user.Reds@* at
> <192.168.0.98:1868>
> 2/13 11:41:25 Over submitter resource limit (0) ... only consider
> startd ranks
> 2/13 11:41:36 Request 91073.00000:
> 2/13 11:41:36 Rejected 91073.0 nice-user.Reds@*
> <192.168.0.98:1868>: no match found
> 2/13 11:41:36 Request 91074.00000:
> 2/13 11:41:37 Rejected 91074.0 nice-user.Reds@*
> <192.168.0.98:1868>: no match found
> 2/13 11:41:37 Request 91075.00000:
> 2/13 11:41:37 Rejected 91075.0 nice-user.Reds@*
> <192.168.0.98:1868>: no match found
> .............
>
> I have reset the userpriorities to no avail. Any ideas?
>
> Yours,
> pdev.
>
> ********************************DISCLAIMER****************************
> The information contained in the above e-mail message or
> messages (which includes any attachments) is confidential and
> may be legally privileged. It is intended only for the use
> of the person or entity to which it is addressed. If you are
> not the addressee any form of disclosure, copying,
> modification, distribution or any action taken or omitted in
> reliance on the information is unauthorised. Opinions
> contained in the message(s) do not necessarily reflect the
> opinions of the Queensland Government and its authorities.
> If you received this communication in error, please notify
> the sender immediately and delete it from your computer
> system network.
>
> _______________________________________________
> Condor-users mailing list
> Condor-users@xxxxxxxxxxx
> https://lists.cs.wisc.edu/mailman/listinfo/condor-users
>
>