[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] question related to condor_ssh_to_job and container



Dear all,

I am having some issue to use condor_ssh_to_job, resulting in an "ssh_exchange_identification: Connection closed by remote host".
I am using the condor version 9.0.14 for x86_64_CentOS7.

The situation is the following.

I am running HTCondor in an Apptainer container, and run a test job in an Apptainer container via this HTCondor setup.
My test job is running fine, I can retrieve the output, error and log files.

But when trying to access my job at runtime using condor_ssh_to_job, I got the error mentioned above.
From the StarterLog.slot1_1 file, I know that the sshd command is 
executed with the following configuration:
/usr/sbin/sshd -i -e -f 
/var/lib/condor/execute/dir_93473/.condor_ssh_to_job_1/sshd_config.
I have access to the directory /var/lib/condor/execute/dir_$PID during 
the container runtime, and can access all of its content,
except the file sshd_config located in the subdirectory 
.condor_ssh_to_job_1. Actually, I am not able to access this 
subdirectory at all,
and the content of /var/lib/condor/execute/dir_$PID I am able to access 
is as listed below:
.chirp.config
condor_exec.exe
_condor_stderr
_condor_stdout
.docker_sock
.job.ad
.machine.ad
tmp
var_tmp

The file sshd_config is the one of interest for me, would you know how it's possible to access it?
Thanks a lot in advance for your help!

Cheers,
Benoit