Hi,
I want to calculate a Jacobian matrix, which is a sum of 960 (to be simple) 3x3 matrices by distributing the calculations of these 3x3 matrices to a Xeon Phi card. The calculation of the 3x3 matrices uses a third party library whose subroutines use an interger vector not only for the storage of parameter values but also to write and read intermidiate results. It is therefore necessary for each task to have this integer vector protected from other tasks. Can this be obtained on the physical core level or even for each thread (each Xeon Phi has 60x4=240 threads.
If I got it right the MPI commands can be used on the dual 6C Xeon host to let 2x6 cores or 2x6x2 threads offload to the Xeon Phi card.
To be clear I would like to do this under Windows if possible.
Best regards
Anders S