[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Condor-users] Problem with match: sending small msg failed



Is 24427 within the LOWPORT / HIGHPORT range on your startd machine?
 
Can you check using condor_config_val?
 
Is there also a firewall on the schedd machine? What are its settings?
 
I am a little confused at the following:
* You have 2 machines in pool
* 1 matches job, 1 doesn't
* The error mentions vm2@24043@mydomain
* vm2 implies a vm1 on the same machine and you would expect them to share
  configurations.
* Maybe this is a different use of vm2 (I haven't studied log files more than I have to)

cheers

JK
 
 


________________________________

	From: condor-users-bounces@xxxxxxxxxxx [mailto:condor-users-bounces@xxxxxxxxxxx] On Behalf Of Enol Fernández
	Sent: Thursday, November 22, 2007 5:03 PM
	To: Condor-Users Mail List
	Subject: [Condor-users] Problem with match: sending small msg failed
	
	
	Hello,
	
	I have a condor pool set up and I'm trying to get my job executed in an
	specific machine, but it never gets to run
	
	If I do a condor_q -ana I get the following output
	
	211.000:  Run analysis summary.  Of 2 machines, 
	      0 are rejected by your job's requirements
	      1 reject your job because of their own requirements
	      0 match but are serving users with a better priority in the pool
	      1 match but reject the job for unknown reasons 
	      0 match but will not currently preempt their existing job
	      0 are available to run your job
	        No successful match recorded.
	        Last failed match: Thu Nov 22 16:38:45 2007
	        Reason for last match failure: no match found 
	
	And then in the Negotiator log I can see the following message:
	
	11/22 16:19:50 SafeMsg: sending small msg failed. errno: 1
	11/22 16:19:50       Could not send MATCH_INFO/capability to
	vm2@24043@mydomain 
	11/22 16:19:50       Rejected 211.0 myuser@xxxxxxx <xxx.xxx.x.xx:24427>:
	no match found
	
	I have a firewall in the startd machine, but the ports between LOWPORT
	and HIGHPORT are open.
	
	What king of message is this "small msg"? 
	
	Thanks,
	Enol.
	
	--
	Enol Fernández
	Computer Architecture & Operting Systems Department
	Universitat Autònoma de Barcelona