CUDA.jl - Better GPU but Worse Performance

gvijqb · June 27, 2022, 7:01pm

Gaming GPUs are best for FP16 and FP32. They are not performant for FP64 and are not designed for simulation workloads per se.

For FP64 you’d want to explore GPUs like Tesla V100 and A100s.

P2000 would be quite slow as well in comparison if you have large simulation workload.

Topic		Replies	Views
Why is my GPU kernel an order of magnitude slower than my CPU function? GPU question	8	239	June 4, 2025
Why is my kernel as slow in FP32 as in FP64 on A2000 Ada-based GPU? New to Julia gpu , cuda , float , kernel , cudajl	10	179	March 11, 2025
GPU compute & high precision general questions New to Julia gpu , cuda , opencl	19	3406	December 30, 2021
Performance comparison of Nvidia A100, V100, RTX2080Ti Performance gpu , cuda	17	5326	June 14, 2021
Why the Floating-Point Calculation Efficiency of CUDA.jl Does Not Reach the Official Theoretical Value GPU	1	111	February 2, 2025