[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Virtual Condor on Novell OES Linux



Hi Dan

Sorry about the delay, i've been busy graduating again. After setting
RESERVED_SWAP = 0 (one of the condor logs told me to do it) the env and loop
example programs now work.  

Thanks

James

---

Dr James Osborne, BSc (hons), PhD
High End Computing Support Engineer
Information Services, Cardiff University, 50 Park Place, Cardiff CF10 3AT,
Wales UK
Tel: +44(0)29 2087 4657
Fax: +44(0)29 2087 4285
Email: osborneja1@xxxxxxxxxxxxx 
Web: http://www.cardiff.ac.uk/insrv/condor

>>> dan@xxxxxxxxxxxx 07/07/2006 23:47:24 >>>

If the job is never even getting started running, then it doesn't 
matter whether condor_compile worked or not.  I think the next step is 
to understand whether the job is failing to get matched to a machine, 
failing to claim the machine, failing to execute, or what.

To answer these questions, you can look in several places:

"user log"  (the file specified as the log in the submit file)

In some cases, all of your questions about a job can be answered by 
looking in the "user log", because it contains a record of many of the 
events that happen in the life of the job.  However, it does not 
contain all information, so some kinds of problems require looking in 
the Condor daemon logs:

NegotiatorLog  (does the negotiator say it successfully matched the job 
to a machine?)

StartLog (does the machine show that it is getting claimed by the 
schedd?)

ShadowLog (is a shadow starting up to manage the job?)

StarterLog (is the starter attempting to run the job?)

--Dan

On Jul 7, 2006, at 4:57 PM, James Osborne wrote:

> Hi Dan
>
> I have never looked at the NegotiatorLog before, but I think this line 
> in the
> log indicates my virtual linux node and our central manager are 
> talking to
> eachother...
>
> 7/6 15:15:43   Negotiating with condor@xxxxxxxxxxxxxx at
> <131.251.50.243:32797>
>
> The log contains lots of lines mentioning this machine.  Running 
> condor off
> works from the central manager to the virtual linux node so I don't 
> think it's a
> firewall issue, but I might be wrong. It could be that condor_compile 
> didn't
> work correctly because I installed the compiler after installing 
> condor - I had
> to use YAST to install GCC etc post install since the basic build 
> didn't include
> compiler tools.  Do you have a simple submit script that you could 
> send that I
> could try to run that calls unix date or something, a simple script 
> that you
> know for a fact works on a platform like red hat ?
>
> Thanks in advance
>
> James
>
> ---
>
> James Osborne
> High End Computing Support Engineer
> Information Services, Cardiff University, 50 Park Place, Cardiff CF10 
> 3AT,
> Wales UK
> Tel: +44(0)29 2087 4657
> Fax: +44(0)29 2087 4285
> Email: osborneja1@xxxxxxxxxxxxx 
> Web: http://www.cardiff.ac.uk/insrv/condor 
>
>>>> dan@xxxxxxxxxxxx 06/07/2006 15:13:25 >>>
> Hi James,
>
> If you haven't already done so, you may want to look in the
> NegotiatorLog to see if it is successfully connecting to your schedd
> running inside VMWare.
>
> --Dan
>
> James Osborne wrote:
>
>> Hi All
>>
>> I have a virtual condor execute node installed on Novell OES Linux in 
>> VMWare
>> running on my Windows Machine.  All daemons are up i.e. Master, 
>> Startd and
>> Schedd, however I cannot get the example jobs to run properly.  I have
> compiled
>> them using condor_compile I believe successfully, but cannot get them 
>> to run
>> after submitting them.  I tried the env and loop examples.  I 
>> submitted both
>> whilst logged in as a condor user I made myself (that might be the 
>> problem
> ?).
>> The virtual machine registers with the central manager correctly and
> everything
>>
>> running condor_q -analyze gives...
>> ...
>> 1 match, match, but reject the job for unknown reasons
>> ...
>>
>> Does anybody have a better step by step or hints guide for getting 
>> condor to
>> work on basically suse in vmware ? The next step might be for me to 
>> remove
> the
>> current installation and install a personal condor using 
>> condor_configure and
>> see if that can complete the jobs.  I installed a version of condor 
>> on linux
> in
>> milan last week so I don't think I missed anything.
>>
>> Thanks in advance
>>
>> James
>>
>> ---
>>
>> James Osborne
>> High End Computing Support Engineer
>> Information Services, Cardiff University, 50 Park Place, Cardiff CF10 
>> 3AT,
>> Wales UK
>> Tel: +44(0)29 2087 4657
>> Fax: +44(0)29 2087 4285
>> Email: osborneja1@xxxxxxxxxxxxx 
>> Web: http://www.cardiff.ac.uk/insrv/condor 
>> _______________________________________________
>> Condor-users mailing list
>> To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx 
>> with a
>> subject: Unsubscribe
>> You can also unsubscribe by visiting
>> https://lists.cs.wisc.edu/mailman/listinfo/condor-users 
>>
>> The archives can be found at either
>> https://lists.cs.wisc.edu/archive/condor-users/ 
>> http://www.opencondor.org/spaces/viewmailarchive.action?key=CONDOR 
>>
>>
> _______________________________________________
> Condor-users mailing list
> To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx 
> with a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/condor-users 
>
> The archives can be found at either
> https://lists.cs.wisc.edu/archive/condor-users/ 
> http://www.opencondor.org/spaces/viewmailarchive.action?key=CONDOR 
> _______________________________________________
> Condor-users mailing list
> To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx 
> with a
> subject: Unsubscribe
> You can also unsubscribe by visiting
> https://lists.cs.wisc.edu/mailman/listinfo/condor-users 
>
> The archives can be found at either
> https://lists.cs.wisc.edu/archive/condor-users/ 
> http://www.opencondor.org/spaces/viewmailarchive.action?key=CONDOR 

_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users 

The archives can be found at either
https://lists.cs.wisc.edu/archive/condor-users/ 
http://www.opencondor.org/spaces/viewmailarchive.action?key=CONDOR