Triangular Arrays in julia?

Even if there are only about half operations to do, that 1/2 is a constant factor: the cost is still increasing with the size of the matrix. So that doesn’t really make a difference from standard arrays, except for changing the threshold above which calling BLAS is beneficial.

For matrix multiplication, you can call Base.LinAlg.generic_matmatmul directly. I’m not sure for what operations and types BLAS is called, and for which pure Julia code is used instead.