Have you seen the code?
It runs 7 times and the measurement is the median.
It really not cool to be determined without checking.
Regarding the code, Please review the code and point me to any place you think I should add it and I will.
Just out of curiousity, I’d understand that for existing array it will be important to barodcast / fuse teh assignment with .=
. But I’d think that for a new generated array it won’t. Not that is the reason they are missing. I was just not aware enough of that. Not very experienced with Julia.
I think you’re wrong.
First those results don’t make sense. No way 1 Thread of OpenBLAS can match Intel MKL on 4 threads. I think both @ChrisRackauckas and I pointed at this when you first published those.
I also think your computer, which is a laptop, might be thermally limited and have single memory channel which means it is memory bounded easily. My CPU is Quad Channel memory hence threading has more potential to improve results.
Second, I do @elapsed
7 times and take the median. If Julia use the correct timers in the CPU when @elapsed
is used it should be a good measure technique. Probably this is what @btime
is doing under the hood.