Should be fixed by CUBLAS: Don't use BLAS1 wrappers for strided arrays, only vectors. by maleadt · Pull Request #2528 · JuliaGPU/CUDA.jl · GitHub