Cannon's or SUMA algorithm implementation?

Are there any examples of doing distributed matmuls with Cannon’s or SUMA algorithm in Julia?