`mul` dispatch to BLAS incomplete?

Which it sometimes isn’t, so that we have discussions. I’m at a loss, what to do.

Edit: it seems for high performance applications it then would be appropriate to depend on the low level BLAS wrappers.