Can't compile a GPU kernel function for =>

Stephan_Meyn · January 10, 2021, 1:42am

I’m trying to persuade CUDA.jl to compile a kernel function. After reducing the issue I have come to this sample code

using CUDA

x=Float32(2.0)
knots =CuArray{Float32,1}([1.2, 2.0, 3.0])
mapOp(v::Float32)::Int = v >= x ? 1 : 0
mapreduce(mapOp, + , knots)

When I run this I get this

ERROR: InvalidIRError: compiling kernel partial_mapreduce_grid(typeof(identity), typeof(+), Int64, CartesianIndices{1,Tuple{Base.OneTo{Int64}}}, CartesianIndices{1,Tuple{Base.OneTo{Int64}}}, Val{true}, CuDeviceArray{Int64,2,1}, Base.Broadcast.Broadcasted{CUDA.CuArrayStyle{1},Tuple{Base.OneTo{Int64}},typeof(mapOp),Tuple{CuDeviceArray{Float32,1,1}}}) resulted in invalid LLVM IR        
Reason: unsupported dynamic function invocation (call to >=)
Stacktrace:
 [1] mapOp at D:\steph\Google Drive\src\julia\bspline\broke2.jl:4

so it seems it can’t compile the >= function as it can’t determine the types that go into it. Hw do I address this?

stillyslalom · January 10, 2021, 1:55am

x is in the global scope, which prevents its type from being concretely inferred. You need to pass it as a function argument a la

julia> count_geq(arr, val) = count(>=(val), arr)
count_geq (generic function with 1 method)

julia> count_geq(cu([1.2, 2.0, 3.0]), 2.0f0)
2

edit: it’s almost as fast to just use count(knots .>= 2).

julia> using BenchmarkTools

julia> knots = 3*CUDA.rand(2^14);

julia> @btime CUDA.@sync count_geq($knots, 2.0f0)
  53.600 μs (77 allocations: 2.06 KiB)
5437

julia> @btime CUDA.@sync count($knots .>= 2)
  55.299 μs (100 allocations: 2.69 KiB)
5437

maleadt · January 11, 2021, 10:36am

If x were const, it would have worked too.

Topic		Replies	Views
Using mapreduce on GPU with CUDA.jl GPU question , cuda	1	721	December 25, 2023
Invalid LLVM IR error using CUDA GPU question , bug , cuda , error	3	242	August 17, 2024
Compiling kernel resulted in invalid LLVM IR Reason: unsupported dynamic function invocation GPU	11	4347	October 16, 2020
CUDA.jl unsupported call to an unknown function, unsupported dynamic function invocation GPU	3	1246	July 11, 2022
GPU Kernel Compilation Error -- Error in sharing functions in memory? New to Julia gpu	1	487	October 3, 2022

Can't compile a GPU kernel function for =>

Related topics