Quantcast
Channel: Clusters and HPC Technology
Viewing all articles
Browse latest Browse all 927

Bizarre authenticity of host issue when running across multiple nodes with Intel MPI

$
0
0

I am attempting to run a job across three nodes.  I have configured passwordless ssh and it definitely works in between every node (each node can ssh to the other two without a password).  The known_hosts file is definitely correct and all 3 nodes have identical .ssh directories.  I have also tried adding the keys to ssh-agent, although I'm not sure if that was necessary either as I didn't specify a pass phrase when generating the id_rsa key (I know this is terrible security but it's temporary for the sake of testing).

I can run a job across nodes 1 and 2 simultaneously without any difficulty, however if I try to use node 3 as well (or just nodes 1 and 3, or nodes 2 and 3) then the terminal is spammed with, "The authenticity of host 'node3 (IP of node 3)' can't be established." and there's no way to enter "yes" (even though I shouldn't have to in the first place as node 3's key is already in the known_hosts file of nodes 1 and 2).

If I try to launch the job on node 3, then I receive the same messages in the terminal with the hostname/IP of nodes 1 and 2.  I am able to run the job solely on node 3.

Any help would be greatly appreciated as this has been a real headache.  Clearly there is something I have overlooked even though the configuration and hardware of these three nodes is almost identical.  I am using Intel MPI 5.0.0.028 and CentOS 6.6.  The nodes are communicating over an Infiniband interface.  Thanks for any input.


Viewing all articles
Browse latest Browse all 927

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>