Quantcast
Channel: Clusters and HPC Technology
Viewing all articles
Browse latest Browse all 927

How to get the exit code from mpiexec.hydra

$
0
0

When running a workload on multiple nodes with mpiexec.hydra, the entire run aborts when even one node fails/shutsdown. I want to detect if the failure is due to node disconnection or something else. Trying to print out the exit code with "-print-all-exitcodes" does't seem to work

Is there any other option?


Viewing all articles
Browse latest Browse all 927

Trending Articles



<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>