Sure, but note of course that the Vega 64, like most consumer GPUs, is bad at Float64:
This claims the Vega 64 has a max of 791 GFLOPS of Float64, but I’ve measured more than that here. I think my card is clocked higher, but I haven’t fiddled with its speed, unlike the CPU.
Also worth pointing out that I originally had the 7980XE running at 3.8 GHz for all-core AVX-512, but it crashed while running the benchmark, so I lowered it to 3.7 GHz. If you want to test an overclock, MKL is the way to go
.
If you want a GPU that’s good at Float64, get the Nvidea Titan V. It has about 7450 GFLOPS of Float64, so it should handily trounce the CPUs in the above benchmark. Or use the cloud, where you’ll probably get something like a V100, which has 7000 GFLOPS of Float64.
With AMD, you’d need an MI series card, which you’d probably have a hard time finding on the cloud, and costs a lot more money than a Titan V for similar performance (while also being a data center card, making it awkward for personal use).
Also, for fun, my 10980XE again, which retails for half the price of the 7980XE:
It did exceed 2000 GFLOPS of
Float64.
The gap between MKL and OpenBLAS is also much larger, which makes me realize I didn’t overclock the cache (uncore) to match the core, so I’m guessing MKL is much cache friendlier than OpenBLAS, and that they perhaps fair more similarly on closer-to-stock settings.
I should try overclocking the uncore a bit.