CUDA.jl with missing data?

zlewko · November 4, 2020, 2:56am

I’m trying to work with a dataset with missing data using the CUDA.jl. I’d like to skip the missing data in some statistics (mean, covariance etc). Is there any general solution to this that performs well on a GPU? I have written custom kernels to do this but I think I may be missing something and there could be a cleaner way to do it.

I’d like to be able to write something short like:

using CUDA
a = rand(1000,1000)  #random data
a[ a.>0.5 ] .= NaN   # simulate missing data using nan since no missings support
a = cu(a)     
mapslices( x->mean(filter(!isnan,x)),a,dims=1)  #average non-missing data in each colum

This works but is extremely slow on the GPU. Everything I have tried is either very slow or won’t compile for the GPU. Is there a reasonably fast way to do this without custom kernels?

Thanks

piever · November 4, 2020, 9:05am

I don’t have a GPU to test this right now, but reduce and mapreduce should be fast on CUDA arrays, so things like

reduce(a, dims=1) do acc, val
    isnan(val) ? acc : val + acc
end

should be an efficient way to filter NaNs out in a sum. For more complex statistics, it’s worth checking if it’s easy to get them using say Transducers or OnlineStats. In principle both packages should work with reduce and thus be GPU compatible (haven’t checked though).

maleadt · November 4, 2020, 4:56pm

You should CUDA.allowscalar(false), which will probably reveal that mapslices isn’t available for CuArray. Generally, missing isn’t supported either since CuArray doesn’t support Union eltypes.

What @piever mentions should work though.

zlewko · November 5, 2020, 2:02am

Thanks, I knew missing wasn’t supported. I’ll turn that switch on, I suspected that is why it was slow but never checked.

zlewko · November 5, 2020, 2:06am

Thanks this does work and gives me a good starting point. I did need to add an init to your example.

I’m familiar with OnlineStats, I’ll try that with reduce. I’ll also look into Transducers, I’m not familiar with that one.

Topic		Replies	Views
How do I to transform mapreduce function to work well with CUDA? GPU	5	1547	May 14, 2021
Using mapreduce on GPU with CUDA.jl GPU question , cuda	1	706	December 25, 2023
Optimizing CUDA.jl performance for small array operations GPU performance , cuda	5	2479	February 8, 2021
[ANN] CuCountMap.jl - CUDA.jl-enabled faster `StatsBase.countmap` for small types Performance	0	403	September 23, 2020
CUDA.jl v3.0 Package Announcements	10	1364	April 16, 2021

CUDA.jl with missing data?

Related topics