How to parallerize dual coordinet descent mehods on GPU using CUDA.jl?

msekino · May 29, 2020, 1:00am

At first, I run the following on CPU

using CUDA, CUDA.CUSPARSE
using LinearAlgebra, SparseArrays

numvec = 10
lenvec = 5
xs = [sprandn(lenvec, 0.5) for i in 1:numvec]
w = randn(numvec)
d = Vector{Float64}(sum(w .* xs))

obtains

5-element Array{Float64,1}:
 -0.12287585509797144
  0.33370779975590414
  2.9808236024752204
 -1.055451750249576
 -2.7127608286874803

Next I tried

x̃s = [CuSparseVector(x) for x in xs]
w̃ = CuVector(w)
d̃ = CuVector(zeros(lenvec))
for i in 1:numvec
    axpyi!(w̃[i], x̃s[i], d̃, 'O')
end
d̃

obtains the same result with warning for scalar operations…
I tried

function addto!(d, w, xs::Vector{CuSparseVector{Float64}})
    i = threadIdx().x
    axpyi!(w[i], xs[i], d, 'O')
    nothing
end

d̃ = CuVector(zeros(lenvec))
@cuda threads=10 addto!(d̃, w̃, x̃s)

results in KernelError: kernel returns a value of type Union{}.

Topic		Replies	Views
Problem with GPU programming GPU cudanative , cuda	4	1057	September 13, 2019
Solves the linear system using CuArrays.jl GPU	3	1610	December 27, 2019
How to vectorize any function on the GPU with CUDA.jl? GPU question , function	3	433	March 14, 2024
Sparse LU factorization on GPU GPU linearalgebra , factorization	12	498	November 2, 2024
Gradient of sum of singular values of a matrix with CUDA.jl GPU question , zygote	7	1014	June 8, 2022

How to parallerize dual coordinet descent mehods on GPU using CUDA.jl?

Related topics