How to improve efficiency of my linear algebra code (matrix equivalency)

The answer to my original question (2) can be found here: https://discourse.julialang.org/t/parallel-matrix-col-row-operations-give-incorrect-results/

There are also benchmarks for lu over finite fields.