On 11/11/06, Christopher Jon Jursa <cjursa@xxxxxxxxxxxx> wrote:
Hi,
I am attempting to run an MPI executable and submit it through
Condor. I am
using Fedore Core 5, mpich 1.2.4 and Condor 6.7.21.
However, I am getting an rsh error when Condor executes.
The errror message is at
http://gis.sis.pitt.edu/temp/chris/mpi2/mpiTest.error
The job submission file I am using is at
http://gis.sis.pitt.edu/temp/chris/mpi/mpi_job
Finally, the script file used in the job file is at
http://gis.sis.pitt.edu/temp/chris/mpi2/jobfile.sh
How is rsh used by Condor and MPI? How do I fix the rsh error? I
think I
have dropped all the firewalls. What are common reasons for this
error?
Cheers,
Chris
Sincerely,
Christopher Jon Jursa
Geoinformatics Laboratory
School of Information Sciences
University of Pittsburgh
web: http://gis.sis.pitt.edu
email: cjursa@xxxxxxxxxxxx
phone: 412-624-8858
You have to configure RSH, by default, it is disabled in Linux.
Check the /etc/xinetd.d/rsh and /etc/xinetd.d/rlogin and make sure
they say disable=no.
In /etc/hosts.allow you need an entry like this:
in.rshd: 192.168.1.0/255.255.255.0
in.rlogind: 192.168.1.0/255.255.255.0
Also, you have to add every mpi host in the /etc/hosts.equiv file so
you can freely use rsh between them.
Be carefull, rsh is a plain text comunication service, so if you are
not in a closed cluster, try using ssh instead rsh.
I hope this helps.
Bye!
--
Diego Bello Carreño
Estudiante Memorista de Ingeniería Civil Informática
UTFSM, Valparaíso, Chile
Usuario #294897 counter.li.org
_______________________________________________
Condor-users mailing list
To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx
with a
subject: Unsubscribe
You can also unsubscribe by visiting
https://lists.cs.wisc.edu/mailman/listinfo/condor-users
The archives can be found at either
https://lists.cs.wisc.edu/archive/condor-users/
http://www.opencondor.org/spaces/viewmailarchive.action?key=CONDOR