Optimising code: Multiplying a list of matrices by a matrix

Apologies to enter this thread with another topic. I have a problem which might be close to this one.
I’d like to kindly ask if anyone has experienced or could look into it. It is performance of loops with larger matrices and multiplication. For loop in function and multiplication of larger matrices, slow speed in parallel
Thanks a lot.