Dear Zach: I added ' TOOL_DEBUG = D_ALL' and 'SUBMIT_DEBUG = D_ALL' in condor_config and the following is the result of 'condor_submit -debug **', any idea what part leads to this 'ERROR: Failed to connect to local queue manager; SECMAN:2007:Failed to end classad message' ? Please notice that this machine valtical00 is used as ' COLLECTOR, MASTER, NEGOTIATOR, SCHEDD, STARTD'. Enclose are the 2 configuration files and all the logs(old logs already removed). -bash-4.1$ condor_submit -debug valtical00.job 11/02/13 12:51:49 (fd:3) (pid:28613) config: using subsystem 'SUBMIT', local '' 11/02/13 12:51:49 (fd:3) (pid:28613) OpSysMajorVersion: 6 11/02/13 12:51:49 (fd:3) (pid:28613) OpSysShortName: SLCern 11/02/13 12:51:49 (fd:3) (pid:28613) OpSysLongName: Scientific Linux CERN SLC release 6.4 (Carbon) 11/02/13 12:51:49 (fd:3) (pid:28613) OpSysAndVer: SLCern6 11/02/13 12:51:49 (fd:3) (pid:28613) OpSysLegacy: LINUX 11/02/13 12:51:49 (fd:3) (pid:28613) OpSysName: SLCern 11/02/13 12:51:49 (fd:3) (pid:28613) OpSysVer: 604 11/02/13 12:51:49 (fd:3) (pid:28613) OpSys: LINUX 11/02/13 12:51:49 (fd:3) (pid:28613) Reading from /proc/cpuinfo 11/02/13 12:51:49 (fd:3) (pid:28613) Found: Physical-IDs:True; Core-IDs:True 11/02/13 12:51:49 (fd:3) (pid:28613) Analyzing 16 processors using IDs... 11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #0 (PID:0, CID:0): 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0 and P#1 : pid:0!=0 or cid:0!=1 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0 and P#2 : pid:0!=0 or cid:0!=2 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0 and P#3 : pid:0!=0 or cid:0!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0 and P#4 : pid:0!=1 or cid:0!=0 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0 and P#5 : pid:0!=1 or cid:0!=1 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0 and P#6 : pid:0!=1 or cid:0!=2 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0 and P#7 : pid:0!=1 or cid:0!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0 and P#8 : pid:0==0 and cid:0==0 (match=2) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0 and P#9 : pid:0!=0 or cid:0!=1 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0 and P#10 : pid:0!=0 or cid:0!=2 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0 and P#11 : pid:0!=0 or cid:0!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0 and P#12 : pid:0!=1 or cid:0!=0 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0 and P#13 : pid:0!=1 or cid:0!=1 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0 and P#14 : pid:0!=1 or cid:0!=2 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#0 and P#15 : pid:0!=1 or cid:0!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) ncpus = 1 11/02/13 12:51:49 (fd:3) (pid:28613) P0: match->2 11/02/13 12:51:49 (fd:3) (pid:28613) P8: match->2 11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #1 (PID:0, CID:1): 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1 and P#2 : pid:0!=0 or cid:1!=2 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1 and P#3 : pid:0!=0 or cid:1!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1 and P#4 : pid:0!=1 or cid:1!=0 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1 and P#5 : pid:0!=1 or cid:1!=1 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1 and P#6 : pid:0!=1 or cid:1!=2 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1 and P#7 : pid:0!=1 or cid:1!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1 and P#8 : pid:0!=0 or cid:1!=0 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1 and P#9 : pid:0==0 and cid:1==1 (match=2) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1 and P#10 : pid:0!=0 or cid:1!=2 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1 and P#11 : pid:0!=0 or cid:1!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1 and P#12 : pid:0!=1 or cid:1!=0 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1 and P#13 : pid:0!=1 or cid:1!=1 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1 and P#14 : pid:0!=1 or cid:1!=2 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#1 and P#15 : pid:0!=1 or cid:1!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) ncpus = 2 11/02/13 12:51:49 (fd:3) (pid:28613) P1: match->2 11/02/13 12:51:49 (fd:3) (pid:28613) P9: match->2 11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #2 (PID:0, CID:2): 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2 and P#3 : pid:0!=0 or cid:2!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2 and P#4 : pid:0!=1 or cid:2!=0 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2 and P#5 : pid:0!=1 or cid:2!=1 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2 and P#6 : pid:0!=1 or cid:2!=2 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2 and P#7 : pid:0!=1 or cid:2!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2 and P#8 : pid:0!=0 or cid:2!=0 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2 and P#9 : pid:0!=0 or cid:2!=1 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2 and P#10 : pid:0==0 and cid:2==2 (match=2) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2 and P#11 : pid:0!=0 or cid:2!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2 and P#12 : pid:0!=1 or cid:2!=0 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2 and P#13 : pid:0!=1 or cid:2!=1 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2 and P#14 : pid:0!=1 or cid:2!=2 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#2 and P#15 : pid:0!=1 or cid:2!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) ncpus = 3 11/02/13 12:51:49 (fd:3) (pid:28613) P2: match->2 11/02/13 12:51:49 (fd:3) (pid:28613) P10: match->2 11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #3 (PID:0, CID:3): 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3 and P#4 : pid:0!=1 or cid:3!=0 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3 and P#5 : pid:0!=1 or cid:3!=1 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3 and P#6 : pid:0!=1 or cid:3!=2 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3 and P#7 : pid:0!=1 or cid:3!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3 and P#8 : pid:0!=0 or cid:3!=0 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3 and P#9 : pid:0!=0 or cid:3!=1 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3 and P#10 : pid:0!=0 or cid:3!=2 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3 and P#11 : pid:0==0 and cid:3==3 (match=2) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3 and P#12 : pid:0!=1 or cid:3!=0 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3 and P#13 : pid:0!=1 or cid:3!=1 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3 and P#14 : pid:0!=1 or cid:3!=2 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#3 and P#15 : pid:0!=1 or cid:3!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) ncpus = 4 11/02/13 12:51:49 (fd:3) (pid:28613) P3: match->2 11/02/13 12:51:49 (fd:3) (pid:28613) P11: match->2 11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #4 (PID:1, CID:0): 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#4 and P#5 : pid:1!=1 or cid:0!=1 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#4 and P#6 : pid:1!=1 or cid:0!=2 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#4 and P#7 : pid:1!=1 or cid:0!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#4 and P#8 : pid:1!=0 or cid:0!=0 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#4 and P#9 : pid:1!=0 or cid:0!=1 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#4 and P#10 : pid:1!=0 or cid:0!=2 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#4 and P#11 : pid:1!=0 or cid:0!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#4 and P#12 : pid:1==1 and cid:0==0 (match=2) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#4 and P#13 : pid:1!=1 or cid:0!=1 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#4 and P#14 : pid:1!=1 or cid:0!=2 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#4 and P#15 : pid:1!=1 or cid:0!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) ncpus = 5 11/02/13 12:51:49 (fd:3) (pid:28613) P4: match->2 11/02/13 12:51:49 (fd:3) (pid:28613) P12: match->2 11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #5 (PID:1, CID:1): 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#5 and P#6 : pid:1!=1 or cid:1!=2 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#5 and P#7 : pid:1!=1 or cid:1!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#5 and P#8 : pid:1!=0 or cid:1!=0 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#5 and P#9 : pid:1!=0 or cid:1!=1 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#5 and P#10 : pid:1!=0 or cid:1!=2 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#5 and P#11 : pid:1!=0 or cid:1!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#5 and P#12 : pid:1!=1 or cid:1!=0 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#5 and P#13 : pid:1==1 and cid:1==1 (match=2) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#5 and P#14 : pid:1!=1 or cid:1!=2 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#5 and P#15 : pid:1!=1 or cid:1!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) ncpus = 6 11/02/13 12:51:49 (fd:3) (pid:28613) P5: match->2 11/02/13 12:51:49 (fd:3) (pid:28613) P13: match->2 11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #6 (PID:1, CID:2): 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#6 and P#7 : pid:1!=1 or cid:2!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#6 and P#8 : pid:1!=0 or cid:2!=0 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#6 and P#9 : pid:1!=0 or cid:2!=1 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#6 and P#10 : pid:1!=0 or cid:2!=2 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#6 and P#11 : pid:1!=0 or cid:2!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#6 and P#12 : pid:1!=1 or cid:2!=0 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#6 and P#13 : pid:1!=1 or cid:2!=1 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#6 and P#14 : pid:1==1 and cid:2==2 (match=2) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#6 and P#15 : pid:1!=1 or cid:2!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) ncpus = 7 11/02/13 12:51:49 (fd:3) (pid:28613) P6: match->2 11/02/13 12:51:49 (fd:3) (pid:28613) P14: match->2 11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #7 (PID:1, CID:3): 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#7 and P#8 : pid:1!=0 or cid:3!=0 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#7 and P#9 : pid:1!=0 or cid:3!=1 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#7 and P#10 : pid:1!=0 or cid:3!=2 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#7 and P#11 : pid:1!=0 or cid:3!=3 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#7 and P#12 : pid:1!=1 or cid:3!=0 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#7 and P#13 : pid:1!=1 or cid:3!=1 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#7 and P#14 : pid:1!=1 or cid:3!=2 (match=No) 11/02/13 12:51:49 (fd:3) (pid:28613) Comparing P#7 and P#15 : pid:1==1 and cid:3==3 (match=2) 11/02/13 12:51:49 (fd:3) (pid:28613) ncpus = 8 11/02/13 12:51:49 (fd:3) (pid:28613) P7: match->2 11/02/13 12:51:49 (fd:3) (pid:28613) P15: match->2 11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #8 (PID:0, CID:0): 11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #9 (PID:0, CID:1): 11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #10 (PID:0, CID:2): 11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #11 (PID:0, CID:3): 11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #12 (PID:1, CID:0): 11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #13 (PID:1, CID:1): 11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #14 (PID:1, CID:2): 11/02/13 12:51:49 (fd:3) (pid:28613) Looking at processor #15 (PID:1, CID:3): 11/02/13 12:51:49 (fd:3) (pid:28613) Using IDs: 16 processors, 8 CPUs, 8 HTs 11/02/13 12:51:49 (fd:3) (pid:28613) Reading condor configuration from '/etc/condor/condor_config' 11/02/13 12:51:49 (fd:3) (pid:28613) condor_gethostname() claims we are valtical00.cern.ch 11/02/13 12:51:49 (fd:3) (pid:28613) NETWORK_INTERFACE=* matches lo 127.0.0.1, eth0 137.138.40.140, virbr0 192.168.122.1, choosing IP 137.138.40.140 11/02/13 12:51:49 (fd:3) (pid:28613) ENABLE_IPV6 is undefined, using default value of False 11/02/13 12:51:49 (fd:3) (pid:28613) Considering valtical00.cern.ch (Ranked at 3) as possible local hostname versus valtical00.cern.ch/ (0) 11/02/13 12:51:49 (fd:3) (pid:28613) Identifying myself as: Short:: valtical00, Long: valtical00.cern.ch, IP: 137.138.40.140 11/02/13 12:51:49 (fd:3) (pid:28613) Trying to getting network interface informations (after reading config) 11/02/13 12:51:49 (fd:3) (pid:28613) NETWORK_INTERFACE=* matches lo 127.0.0.1, eth0 137.138.40.140, virbr0 192.168.122.1, choosing IP 137.138.40.140 11/02/13 12:51:49 (fd:3) (pid:28613) condor_gethostname() claims we are valtical00.cern.ch 11/02/13 12:51:49 (fd:3) (pid:28613) NETWORK_INTERFACE=* matches lo 127.0.0.1, eth0 137.138.40.140, virbr0 192.168.122.1, choosing IP 137.138.40.140 11/02/13 12:51:49 (fd:3) (pid:28613) Considering valtical00.cern.ch (Ranked at 3) as possible local hostname versus valtical00.cern.ch/valtical00.cern.ch (0) 11/02/13 12:51:49 (fd:3) (pid:28613) Identifying myself as: Short:: valtical00, Long: valtical00.cern.ch, IP: 137.138.40.140 11/02/13 12:51:49 (fd:3) (pid:28613) CONDOR_FSYNC is undefined, using default value of True 11/02/13 12:51:49 (fd:3) (pid:28613) WARN_ON_UNUSED_SUBMIT_FILE_MACROS is undefined, using default value of True 11/02/13 12:51:49 (fd:3) (pid:28613) TOOL_LOG_KEEP_OPEN is undefined, using default value of True 11/02/13 12:51:49 (fd:3) (pid:28613) SUBMIT_SKIP_FILECHECKS is undefined, using default value of False 11/02/13 12:51:49 (fd:3) (pid:28613) SUBMIT_MAX_PROCS_IN_CLUSTER is undefined, using default value of 0 11/02/13 12:51:49 (fd:3) (pid:28613) KEYCACHE: created: 0x2684c70 11/02/13 12:51:49 (fd:3) (pid:28613) TIMEOUT_MULTIPLIER is undefined, using default value of 0 11/02/13 12:51:49 (fd:3) (pid:28613) SUBMIT_TIMEOUT_MULTIPLIER is undefined, using default value of 0 11/02/13 12:51:49 (fd:3) (pid:28613) *** TIMEOUT_MULTIPLIER :: 0 11/02/13 12:51:49 (fd:3) (pid:28613) New Daemon obj (schedd) name: "NULL", pool: "NULL", addr: "NULL" 11/02/13 12:51:49 (fd:3) (pid:28613) Neither name nor addr specified, using local values - name: "valtical00.cern.ch", full host: "valtical00.cern.ch" 11/02/13 12:51:49 (fd:3) (pid:28613) Finding classad for local daemon, SCHEDD_DAEMON_AD_FILE is "/var/lib/condor/spool/.schedd_classad" 11/02/13 12:51:49 (fd:4) (pid:28613) STRICT_CLASSAD_EVALUATION is undefined, using default value of False 11/02/13 12:51:49 (fd:3) (pid:28613) Found Name in ClassAd, using "valtical00.cern.ch" 11/02/13 12:51:49 (fd:3) (pid:28613) Found SCHEDDIpAddr in ClassAd, using "<137.138.40.140:39738>" 11/02/13 12:51:49 (fd:3) (pid:28613) Found CondorVersion in ClassAd, using "$CondorVersion: 7.8.8 Jun 17 2013 $" 11/02/13 12:51:49 (fd:3) (pid:28613) Found CondorPlatform in ClassAd, using "$CondorPlatform: X86_64-CentOS_6.4 $" 11/02/13 12:51:49 (fd:3) (pid:28613) Found Machine in ClassAd, using "valtical00.cern.ch" 11/02/13 12:51:49 (fd:3) (pid:28613) validate <137.138.40.140:39738> 11/02/13 12:51:49 (fd:3) (pid:28613) success 11/02/13 12:51:49 (fd:3) (pid:28613) Using port 39738 based on address "<137.138.40.140:39738>" Submitting job(s)11/02/13 12:51:49 (fd:4) (pid:28613) TIMEOUT_MULTIPLIER is undefined, using default value of 0 11/02/13 12:51:49 (fd:4) (pid:28613) SUBMIT_TIMEOUT_MULTIPLIER is undefined, using default value of 0 11/02/13 12:51:49 (fd:4) (pid:28613) *** TIMEOUT_MULTIPLIER :: 0 11/02/13 12:51:49 (fd:4) (pid:28613) validate <137.138.40.140:39738> 11/02/13 12:51:49 (fd:4) (pid:28613) success 11/02/13 12:51:49 (fd:4) (pid:28613) New Daemon obj (schedd) name: "NULL", pool: "NULL", addr: "<137.138.40.140:39738>" 11/02/13 12:51:49 (fd:4) (pid:28613) validate <137.138.40.140:39738> 11/02/13 12:51:49 (fd:4) (pid:28613) success 11/02/13 12:51:49 (fd:4) (pid:28613) Already have address, no info to locate 11/02/13 12:51:49 (fd:4) (pid:28613) validate <137.138.40.140:39738> 11/02/13 12:51:49 (fd:4) (pid:28613) success 11/02/13 12:51:49 (fd:4) (pid:28613) Using port 39738 based on address "<137.138.40.140:39738>" 11/02/13 12:51:49 (fd:4) (pid:28613) Guess address string for host = <137.138.40.140:39738>, port = 0 11/02/13 12:51:49 (fd:4) (pid:28613) it was sinful string. ip = 137.138.40.140, port = 39738 11/02/13 12:51:49 (fd:5) (pid:28613) OUT_LOWPORT is undefined, using default value of 0 11/02/13 12:51:49 (fd:5) (pid:28613) LOWPORT is undefined, using default value of 0 11/02/13 12:51:49 (fd:5) (pid:28613) CONNECT bound to <137.138.40.140:46124> fd=4 peer=<137.138.40.140:39738> 11/02/13 12:51:49 (fd:5) (pid:28613) SECMAN: command 1112 QMGMT_WRITE_CMD to schedd at <137.138.40.140:39738> from TCP port 46124 (blocking). 11/02/13 12:51:49 (fd:5) (pid:28613) SEC_SUBMIT_CLIENT_SESSION_DURATION is undefined, using default value of 0 11/02/13 12:51:49 (fd:5) (pid:28613) SEC_SUBMIT_DEFAULT_SESSION_DURATION is undefined, using default value of 0 11/02/13 12:51:49 (fd:5) (pid:28613) SEC_CLIENT_SESSION_DURATION is undefined, using default value of 0 11/02/13 12:51:49 (fd:5) (pid:28613) SEC_DEFAULT_SESSION_DURATION is undefined, using default value of 0 11/02/13 12:51:49 (fd:5) (pid:28613) SEC_CLIENT_SESSION_LEASE is undefined, using default value of 0 11/02/13 12:51:49 (fd:5) (pid:28613) SEC_DEFAULT_SESSION_LEASE is undefined, using default value of 0 11/02/13 12:51:49 (fd:5) (pid:28613) SECMAN: no cached key for {<137.138.40.140:39738>,<1112>}. 11/02/13 12:51:49 (fd:5) (pid:28613) SECMAN: Security Policy: AuthMethods = "FS,KERBEROS,GSI" SessionDuration = "60" Authentication = "NEVER" Enact = "NO" Subsystem = "SUBMIT" Integrity = "NEVER" NewSession = "YES" CryptoMethods = "3DES,BLOWFISH" OutgoingNegotiation = "PREFERRED" Encryption = "NEVER" CurrentTime = time() SessionLease = 3600 ServerPid = 28613 11/02/13 12:51:49 (fd:5) (pid:28613) SECMAN: negotiating security for command 1112. 11/02/13 12:51:49 (fd:5) (pid:28613) SECMAN: sending DC_AUTHENTICATE command 11/02/13 12:51:49 (fd:5) (pid:28613) SECMAN: sending following classad: Command = 1112 AuthMethods = "FS,KERBEROS,GSI" SessionDuration = "60" Authentication = "NEVER" Enact = "NO" Subsystem = "SUBMIT" Integrity = "NEVER" RemoteVersion = "$CondorVersion: 7.8.8 Jun 17 2013 $" NewSession = "YES" CryptoMethods = "3DES,BLOWFISH" OutgoingNegotiation = "PREFERRED" Encryption = "NEVER" CurrentTime = time() SessionLease = 3600 ServerPid = 28613 11/02/13 12:51:49 (fd:5) (pid:28613) selector 0x7fff9f3ca7e0 resetting 11/02/13 12:51:49 (fd:5) (pid:28613) condor_write(fd=4 schedd at <137.138.40.140:39738>,,size=370,timeout=0,flags=0) 11/02/13 12:51:49 (fd:5) (pid:28613) selector 0x7fff9f3ca7e0 adding fd 4 (socket:[1039394]) 11/02/13 12:51:49 (fd:5) (pid:28613) selector 0x7fff9f3ca7e0 adding fd 4 (socket:[1039394]) 11/02/13 12:51:49 (fd:5) (pid:28613) selector 0x7fff9f3ca7e0 adding fd 4 (socket:[1039394]) 11/02/13 12:51:49 (fd:5) (pid:28613) selector 0x7fff9f3ca6d0 resetting 11/02/13 12:51:49 (fd:5) (pid:28613) condor_read(fd=4 schedd at <137.138.40.140:39738>,,size=5,timeout=0,flags=0) 11/02/13 12:51:49 (fd:5) (pid:28613) selector 0x7fff9f3ca6d0 adding fd 4 (socket:[1039394]) 11/02/13 12:51:49 (fd:5) (pid:28613) condor_read(): Socket closed when trying to read 5 bytes from schedd at <137.138.40.140:39738> 11/02/13 12:51:49 (fd:5) (pid:28613) IO: EOF reading packet header 11/02/13 12:51:49 (fd:5) (pid:28613) Stream::get(int) failed to read padding 11/02/13 12:51:49 (fd:5) (pid:28613) SECMAN: no classad from server, failing 11/02/13 12:51:49 (fd:5) (pid:28613) CLOSE <137.138.40.140:46124> fd=4 11/02/13 12:51:49 (fd:4) (pid:28613) Destroying Daemon object: 11/02/13 12:51:49 (fd:4) (pid:28613) Type: 3 (schedd), Name: (null), Addr: <137.138.40.140:39738> 11/02/13 12:51:49 (fd:4) (pid:28613) FullHost: (null), Host: (null), Pool: (null), Port: 39738 11/02/13 12:51:49 (fd:4) (pid:28613) IsLocal: N, IdStr: schedd at <137.138.40.140:39738>, Error: (null) 11/02/13 12:51:49 (fd:4) (pid:28613) --- End of Daemon object info --- ERROR: Failed to connect to local queue manager SECMAN:2007:Failed to end classad message. Cheers,Gang ________________________________________ From: condor-users-bounces@xxxxxxxxxxx [condor-users-bounces@xxxxxxxxxxx] on behalf of Zachary Miller [zmiller@xxxxxxxxxxx] Sent: 31 August 2012 01:14 To: Condor-Users Mail List Subject: Re: [Condor-users] how to fix 'DC_AUTHENTICATE: Unable to reconcile!'? On Thu, Aug 30, 2012 at 10:05:39PM +0000, Gang Qin wrote: > Dear expert: > > Today I try to add a new machine as condor submitter, after adding 'SCHEDD' > to the DAEMON_LIST and restarting the condor service, condor_q and > condor_status could work. But when I try to submit a job, it fails with the > following error: > > ERROR: Failed to connect to local queue manager > SECMAN:2007:Failed to end classad message. > > And in SchedLog , I see the following error message: > > 08/30/12 23:54:33 (pid:12036) DC_AUTHENTICATE: Unable to reconcile! this means the security policy can't be agreed on by the client and server. you can more info by setting (in the condor_config) TOOL_DEBUG = D_ALL SUBMIT_DEBUG = D_ALL in the condor config file, and then running: condor_submit -debug <your submit file> if you want to send me the output of that (offlist is fine) i'll see if i can find the problem. cheers, -zach _______________________________________________ Condor-users mailing list To unsubscribe, send a message to condor-users-request@xxxxxxxxxxx with a subject: Unsubscribe You can also unsubscribe by visiting https://lists.cs.wisc.edu/mailman/listinfo/condor-users The archives can be found at: https://lists.cs.wisc.edu/archive/condor-users/
Attachment:
condor.tar.gz
Description: condor.tar.gz