[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Reseting the ImageSize



Hi Dan,

thank you for the prompt response. I changed the ImageSize, in fact I tried, but the command didn't ended with code 0. Also, I messed up the whole system, so it was necessary restart the services to be able to see again the status of the schedd and jobs.

I'm not sure I understood what you meant about adjust advertised memory needed. The requirements to allow that jobs can run here, are:
Requirements = (OpSys == "LINUX" && Arch == "INTEL") && (Disk >= DiskUsage) && ((Memory * 1024) >= ImageSize) && (HasFileTransfer)

It's not clear to me what will happen if I use '(Memory > 1000)' or '(Memory > 10)', because the job ImageSize still will be greater than 2GB available per job (ImageSize = 2500000, in this case).

Thanks,
Alan.

On Tue, May 31, 2011 at 5:16 PM, Dan O'Donovan <odonovan@xxxxxxxxxxxxxxxxxxx> wrote:
Hi Alan,

>From the manual:

http://www.cs.wisc.edu/condor/manual/v7.3/condor_qedit.html
and
http://www.cs.wisc.edu/condor/manual/v7.3/7_3Running_Condor.html#SECTION008312000000000000000

It looks like you want to use:

condor_qedit Requirements 'ImageSize == 0'

but I'm not sure that this is a good idea.
Rather than messing with ImageSize, why not just adjust the advertised memory needed?

condor_qedit Requirements '(Memory > 1000)'

or similar..

Dan


On May 31, 2011, at 4:05 PM, Alan Malta wrote:

> Dear experts,
>
> there are several idle jobs in the condor queue, due to their memory requirements is greater than the available resources. So, I've been trying to reset the ImageSize of those jobs to finally get their running, but the problem here is that I can't run the command, as below:
>
> [root@osgce osg-1.2.4]# condor_qedit ImageSize 0
> Usage: condor_qedit [-debug] [-n schedd-name] [-pool pool-name] { cluster | cluster.proc | owner | -constraint constraint } attribute-name attribute-value ...
> or ...
> [root@osgce osg-1.2.4]# condor_qedit -n osgce.hepgrid.uerj.br ImageSize 0
> condor_qedit: No such schedd named osgce.hepgrid.uerj.br in local pool
> or ...
> [root@osgce osg-1.2.4]# condor_qedit -n osgce@xxxxxxxxxxxxxxxxxxxxx ImageSize 0
> Usage: condor_qedit [-debug] [-n schedd-name] [-pool pool-name] { cluster | cluster.proc | owner | -constraint constraint } attribute-name attribute-value ...
>
> It seems that I'm running the command with the wrong parameters, but I'm not sure about that. Running the condor_q command, I can see the following line showing the schedd_name:
> -- Submitter: osgce@xxxxxxxxxxxxxxxxxxxxx : <xx.xxx.xxx.xx:xxx> : osgce.hepgrid.uerj.br
>
> Also, the condor version and platform are:
> [root@osgce osg-1.2.4]# condor_version
> $CondorVersion: 7.2.4 Jun 15 2009 BuildID: 159529 $
> $CondorPlatform: X86_64-LINUX_RHEL5 $
>
> Any hints would be really appreciate,
> Alan.
>
> _______________________________________________
> Condor-users mailing list
> To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/condor-users
>
> The archives can be found at:
> https://lists.cs.wisc.edu/archive/condor-users/

_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users

The archives can be found at:
https://lists.cs.wisc.edu/archive/condor-users/