[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[HTCondor-users] question related to condor_ssh_to_job and container



Dear all,

I am having some issue to use condor_ssh_to_job, resulting in an "ssh_exchange_identification: Connection closed by remote host".

I am using the condor version 9.0.14 for x86_64_CentOS7.

The situation is the following.

I am running HTCondor in an Apptainer container, and run a test job in an Apptainer container via this HTCondor setup.

My test job is running fine, I can retrieve the output, error and log files.

But when trying to access my job at runtime using condor_ssh_to_job, I got the error mentioned above.

From the StarterLog.slot1_1 file, I know that the sshd command is executed with the following configuration: /usr/sbin/sshd -i -e -f /var/lib/condor/execute/dir_93473/.condor_ssh_to_job_1/sshd_config.

I have access to the directory /var/lib/condor/execute/dir_$PID during the container runtime, and can access all of its content, except the file sshd_config located in the subdirectory .condor_ssh_to_job_1. Actually, I am not able to access this subdirectory at all, and the content of /var/lib/condor/execute/dir_$PID I am able to access is as listed below:

.chirp.config
condor_exec.exe
_condor_stderr
_condor_stdout
.docker_sock
.job.ad
.machine.ad
tmp
var_tmp

The file sshd_config is the one of interest for me, would you know how it's possible to access it?

Thanks a lot in advance for your help!

Cheers,
Benoit