Factorial on GPU

jeremiedb · December 15, 2021, 5:11am

I wanted to perform a factorial calculation within a kernel for implementing a statistical measure (Poisson likelihood) on GPU,
It turns out like factorial isn’t supported on GPU:

using CUDA

function gpu_fac(y, x)
    for i = 1:length(y)
        @inbounds y[i] += factorial(x[i])
    end
    return nothing
end

N = 10
x = CUDA.fill(3, N)
y = CUDA.fill(1, N)

@cuda gpu_fac(y, x)

ERROR: LoadError: InvalidIRError: compiling kernel gpu_fac(CuDeviceVector{Int64, 1}, CuDeviceVector{Int64, 1}) resulted in invalid LLVM IR

Would it makes sense to have factorial support on the GPU?

jling · December 15, 2021, 6:07am

not really, for integers, there are less than 20 numbers that fits in the range (depending on if you pick Int32 or Int64). Besides, 20 scalar multiplication is definitely not worth going to GPU by any means.

(if you’re thinking using floating numbers, you’re also out of luck:

julia> reduce(*, 1:1.0:25) |> BigInt
15511210043330986055303168

julia> factorial(BigInt(25))
15511210043330985984000000

Topic		Replies	Views
ANN: Anyone need my 2.7x faster factorial (or gamma) function? Performance	22	2513	January 23, 2019
Why is my GPU kernel an order of magnitude slower than my CPU function? GPU question	8	227	June 4, 2025
Compiling kernel resulted in invalid LLVM IR Reason: unsupported dynamic function invocation GPU	11	4319	October 16, 2020
Laguerre polynomials on GPU GPU gpu , cuda , polynomials	27	1331	June 9, 2023
Factorial function of a real number of complex number Numerics	2	469	May 11, 2022

Factorial on GPU

Related topics