Parallelization from v1.1.1 to v1.4

I just want to know if I missed anything or if there’s a future fix planned. I have code written in v1.1.1 that uses a lot of the multi-threading parallelization listed here

When v1.2 and v1.3 came out, there was an attempt to make everything thread-safe.. However, this also made things slower. My code is about 3-5x slower as of v1.4.

I’m not sure that I can convert everything to the alternate parallelization system and was hoping that one of these releases would cure the inefficiency, but I haven’t heard anything about it. Is anyone working on this? Can I get involved to fix this somehow? Is there an alternate systems that people have started using instead?