Hi,
I wonder if there can be any performance or space overheads for using 'tag' in MPI.
It would depend on how it is implemented, but I guess if it used a sort of hash map and keep (possible) multiple buffers for the buckets. The process memory size can blow up.
Can this happen?