Sure, but note of course that the Vega 64, like most consumer GPUs, is bad at Float64
:
This claims the Vega 64 has a max of 791 GFLOPS of Float64
, but I’ve measured more than that here. I think my card is clocked higher, but I haven’t fiddled with its speed, unlike the CPU.
Also worth pointing out that I originally had the 7980XE running at 3.8 GHz for all-core AVX-512, but it crashed while running the benchmark, so I lowered it to 3.7 GHz. If you want to test an overclock, MKL is the way to go .
If you want a GPU that’s good at Float64
, get the Nvidea Titan V. It has about 7450 GFLOPS of Float64
, so it should handily trounce the CPUs in the above benchmark. Or use the cloud, where you’ll probably get something like a V100, which has 7000 GFLOPS of Float64
.
With AMD, you’d need an MI series card, which you’d probably have a hard time finding on the cloud, and costs a lot more money than a Titan V for similar performance (while also being a data center card, making it awkward for personal use).
Also, for fun, my 10980XE again, which retails for half the price of the 7980XE:
It did exceed 2000 GFLOPS of
Float64
.
The gap between MKL and OpenBLAS is also much larger, which makes me realize I didn’t overclock the cache (uncore) to match the core, so I’m guessing MKL is much cache friendlier than OpenBLAS, and that they perhaps fair more similarly on closer-to-stock settings.
I should try overclocking the uncore a bit.