Hi,
When we compile with '-pg' option, the following message was received during execution.
hfi_userinit: assign_context command failed: Interrupted system call hfp_gen1_context_open: hfi_userinit: failed, trying again (1/3) rank 0 : Hello, World! rank 1 : Hello, World!
This causes code performing heavy numerical computations to hang.
The only related information we can find on this issue is from Intel OPA repo: https://github.com/intel/opa-psm2/issues/28
Here are our system information:
- Linux 3.10.0-1062.el7.x86_64
- Intel 2019 Update 5
- hfi1-firmware-0.9-84
We appreciate your insight on how to minimize the interrupted system calls.
Regards.