- probably because you’re running inside of WSL and you’re having to deal with a slower abstraction layer.
- perhaps.
- nothing you’re doing in that code will take advantage of multithreading or GPUs.
FWIW, on an Ubuntu virtual machine, I’m getting slower performance than you are:
julia> @benchmark mm(random_image_cpu)
BenchmarkTools.Trial:
memory estimate: 16 bytes
allocs estimate: 1
--------------
minimum time: 221.761 μs (0.00% GC)
median time: 1.375 ms (0.00% GC)
mean time: 1.383 ms (0.00% GC)
maximum time: 2.863 ms (0.00% GC)
--------------
samples: 3555
evals/sample: 1
This is with 32GB of RAM, so it’s not a memory issue (the array is only about 23MB.)