Another thought: are you using BLAS at all? In my experience, Julia multi-threading does not play nice with BLAS multithreading, although I have never experienced it working that much worse than a single-thread performance (ref). I’m always sure to use BLAS.set_num_threads(1)
when I want thread-parallel and linear algebra (thanks again, @Elrod).
7 Likes