How to run many copies of a random function in parallel on GPU?

maleadt · August 11, 2022, 6:47am

You won’t be able to run arbitrary code like that on the GPU. GPU kernels need to be relatively simple, for example, have a look at this introductory tutorial from the CUDA.jl docs: Introduction · CUDA.jl

If you want more complicated operations to work, without the experience to write your own kernels, you’ll be better off relying on the array abstractions we provide. These are vectorized operations (think map, reduce, scan, sort) that have been implemented in a parallel manner, and use the GPU efficiently. If it’s possible to rephrase your problem in terms of such operations, that will be the easier way to use the GPU.

If that doesn’t work you’ll need to look into writing your own kernel, but there’s a lot of caveats: you cannot use dynamic dispatch, GC allocations, non-isbits types, etc.

Topic		Replies	Views
CUDAnative use multiple GPUs GPU gpu , cudanative , parallel	5	1796	March 24, 2018
[blog post] Introduction to GPU programming Community gpu , cudanative , gpuarrays , blog-post	15	3409	December 20, 2018
Generating Random Number from inside Kernel GPU	12	5008	January 2, 2018
Presentation on effective use of CUDAnative/CuArrays GPU	7	2408	January 3, 2019
Gpu_rand inside @cuda GPU	2	386	November 18, 2020

How to run many copies of a random function in parallel on GPU?

Related topics