Under LSF how can I run mutiple executables with different thread counts and still use the nodes efficiently?
Currently I have to do
#BSUB -R [ptile=7]
#BSUB -R affinity[core(4)]
mpirun -n 8 -env OMP_NUM_THREADS=2 ./hellope : -n 12 -env OMP_NUM_THREADS=1 ./hellope : -n 8 -env OMP_NUM_THREADS=4 ./hellope
This will yield a otal number of 60 threads. For a node with 28 processors it would take 3 nodes, but since I have one executable with 4 threads I have to tile it to allow for the maximum thread count so as to not overlap processes on cores. This means I need to use another node even thougn I don't need all of the cores. Is there a way I can pack this onto a node? or place the tasks where I want them? I thought maybe something with using I_MPI_JOB_RESPECT_PROCESS_PLACEMENT and maybe defining the affinity somewhere else besides the job card? Any thoughts would be appreciated.