Bootstrapping is fast on M1 Max, slow on HPC Knights Landing / Broadwell

It sounds like you are reporting that the code takes longer as you include more threads. Is this true? Have you plotted speedup versus number of threads and observed reasonable scaling? If not, then your threading strategy may need revision.

1 Like