Do a function like relu need a kernel ? When you need to write a GPU kernel rather than "just" using CuArray?

sylvaticus · June 26, 2024, 2:28pm

The relu function works elementwise by returning the input that is non-negative, i.e. relu(x) = ifelse.(x .> 0, x, 0).

Does the fact that it works elementwise means I need to write a GPU kernel for it, or I can simply use CUArrays on it ?
Or perhaps I need to convert it to relu!(y,x) = begin y .= ifelse.(x .> 0, x, 0); return nothing end ?

Also, if I am writing a package and I have no idea if the user has a CPU or a specific GPU, how can I write code that works independently of the hardware, such that the user may have data in a standard Array, a CuArray, a ROCArray, oneArray or MtlArray… and she just calls the function (my function) and the computation is done on the appropriate hardware ?

DNF · June 26, 2024, 3:03pm

No answer, just a follow-up question: is ifelse.(x .> 0, x, 0) better than max.(x, 0)?

GunnarFarneback · June 26, 2024, 3:12pm

A few years ago I implemented CUDA inference code for some deep learning layers and relu was just max.(x, 0).

ericphanson · June 26, 2024, 4:54pm

The CUDA.jl have some useful docs on this; writing it in terms of broadcasting is array programming which uses the gpu (when the input is a CuArray) and if you can express the operation in terms of operations like that then you don’t need to write a kernel. So broadcasting is a good way to go (and supports other accelerators than CUDA, GPUArrays is the generic package I believe). If you do end up needing to write kernels and want to do so in a way that is generic, then KernelAbstractions.jl is the package for that.

Topic		Replies	Views
Flux relu fails with CUDA gpu New to Julia flux	2	499	November 1, 2020
How to vectorize any function on the GPU with CUDA.jl? GPU question , function	3	450	March 14, 2024
Presentation on effective use of CUDAnative/CuArrays GPU	7	2385	January 3, 2019
"Quality of life" functions for CUDA.jl or GPUArrays.jl GPU	1	189	January 5, 2025
[blog post] Introduction to GPU programming Community gpu , cudanative , gpuarrays , blog-post	15	3332	December 20, 2018

Do a function like relu need a kernel ? When you need to write a GPU kernel rather than "just" using CuArray?

Related topics