You are still correct, although the example you used where wrong. A * transpose(A) dispatches to syrk instead of gemm.
A * transpose(A)
syrk
gemm