Recently I got a NUMA system (32 sockets but not an SGI UV). I noticed my MPI job which is smaller than a NUMA size is broken into different nodes, although there are still empty nodes. The performance is suffering. My question is:
- lscpu and numastat both reported 16 nodes, is it considered by the scheduler?
- the PBSPro BigBook referred to Cpusets. Is it the same as cpuset (/dev/cpuset)?
- Should I establish 16 vnodes? if yes, what should I do?
(Running CentOS 7.3, from the rebuilded 14.1.0 srpm without modification)