Search in CUDA vector

MohHizzani · February 18, 2023, 12:03pm

I have a vector of binary values, need to find all indices that are one, and randomly select one to update another vector at the same index selected.

jpsamaroo · February 22, 2023, 8:59pm

Please provide a minimum working example (MWE) of your code, as I requested in the #gpu channel in Slack.

For reference, the following code was provided in Slack:

# These are the binary vectors for candidate flips as columns in a matrix
candflips = abs.(sOld .- s′) 
# Iterate over columns
for j=1:size(candflips, 2)
        ff = findall(isone, candflips[:, j])
        if isempty(ff)
            # This's just some random elment when stuck at local minima
            Eoffset[j] += offset
        else
            idx = rand(ff, 1)[1]
            # Here implement the flip
            sNew[idx, j] = abs(one(eltype(sOld)) - sOld[idx, j])
        end
    end

MohHizzani · February 28, 2023, 12:44pm

This’s the CPU implementation of it

N, trials = 300, 10000
# These are the binary vectors for candidate flips as columns in a matrix
Eoffset = zeros(trials)
offset = 0.1
candflips = rand([0, 1], N, trials)
# Iterate over columns
for j=1:size(candflips, 2)
        ff = findall(isone, candflips[:, j])
        if isempty(ff)
            # This's just some random elment when stuck at local minima
            Eoffset[j] += offset
        else
            idx = rand(ff, 1)[1]
            # Here implement the flip
            sNew[idx, j] = abs(one(eltype(sOld)) - sOld[idx, j])
        end
end

jpsamaroo · March 1, 2023, 3:15pm

How are you initializing sNew and sOld?

MohHizzani · March 1, 2023, 3:55pm

N, trials = 300, 10000
sNew = rand([0, 1], N, trials)
sOld = rand([0, 1], N, trials)
# These are the binary vectors for candidate flips as columns in a matrix
Eoffset = zeros(trials)
offset = 0.1
candflips = rand([0, 1], N, trials)
# Iterate over columns
for j=1:size(candflips, 2)
        ff = findall(isone, candflips[:, j])
        if isempty(ff)
            # This's just some random elment when stuck at local minima
            Eoffset[j] += offset
        else
            idx = rand(ff, 1)[1]
            # Here implement the flip
            sNew[idx, j] = abs(one(eltype(sOld)) - sOld[idx, j])
        end
end

MohHizzani · March 23, 2023, 12:35pm

I solved with the following:

using CUDA
N, trials = 300, 10000
sNew = cu(rand([0, 1], N, trials))
sOld = cu(rand([0, 1], N, trials))
# These are the binary vectors for candidate flips as columns in a matrix
Eoffset = CUDA.zeros(trials)
offset = 0.1
candflips = cu(rand([0, 1], N, trials))
# find which trial (col in candflips) has a flip and who hasn't
hascandflips = sum(candflips; dims=1)' .> (zero(eltype(candflips)))
hasnotcandflips = sum(candflips; dims=1)' .== (zero(eltype(candflips)))
# use previous as a mask to update Eoffset vector
Eoffset .= hasnotcandflips .* (Eoffset .+ offset)
# generate rand matrix only with value (non-zero) is cand flip
# then select the max of these rand values and find the one that 
# equal and use it as a maks to update cusNew
maxes = CUDA.ones(1, trials)
mask = CUDA.rand(N, trials) .* candflips
maximum!(maxes, mask)
maskInv .= mask .== maxes
cusNew .= maskInv .* hascandflips' .* abs.(one(eltype(cusOld)) .- cusOld) .+ (one(eltype(maskInv)) .- (maskInv .* hascandflips')) .* cusOld

Topic		Replies	Views
GPU: Scalar indexing in kernel programming GPU cuda	2	256	June 5, 2023
CuArray local scope memory issue GPU	4	308	January 4, 2023
Elegant kernel for vectors in matrix GPU cuda	3	489	February 5, 2021
Add specific elements of a CUDA matrix GPU question , indexing , cuda , arithmetic	1	277	March 21, 2024
A reverse! for CuArrays and Nd arrays Performance cuda	6	817	July 8, 2019

Search in CUDA vector

Related topics