Hi again, Greg So, we were able to move to HTCondor 9.10.0 and, as expected, adding docker_network_type = host to my submit file got my docker container using the network from the host. One step forward! But, two steps back, … docker is also running MPI in the container and, when I use host network, I get an error like this:
MPID_nem_init(320).................:
MPID_nem_tcp_init(173).............:
MPID_nem_tcp_get_business_card(420):
MPID_nem_tcp_init(379).............: gethostbyname failed, <<address>> (errno 1)
Have you seen something like this? Do I somehow need to pass the hostname from the host into the container? Based on some similar errors and searching, that’s what it seems like:
But I wanted to see if I’m missing something obvious. Many thanks Mike -- Michael Fienen, Ph. D. Upper Midwest Water Science Center Madison, Wisconsin 53726 From:
HTCondor-users <htcondor-users-bounces@xxxxxxxxxxx> on behalf of Greg Thain <gthain@xxxxxxxxxxx>
On 7/25/22 5:30 PM, Fienen, Michael N via HTCondor-users wrote:
This feature was added in HTCondor 8.9.9, so it won't be in a stable version before that. Given that, maybe the best way to get this option added is to redefine the HTCondor knob "DOCKER" to point to a script you write, which inserts --network=host to the arguments, if we are running docker run ... This is a bit of work, but if you can't upgrade, I don't think there's a better way. -greg |