Solve linear systems inside CUDA kernel function

trahflow · February 14, 2024, 1:58pm

That’s very unlikely to work. You cannot dynamically allocate memory inside a GPU kernel (see also this recent post: Modifying a thread-local vector within CUDA Dynamic Parallelism - #2 by vchuravy).

What should work though is to allocate all CuArrays outside the kernel, then inside the kernel convert the relevant views into your arrays into SMatrix/SVectors and do the solve on StaticArrays only. (I don’t have access to a GPU atm to check)

Topic		Replies	Views
Solve linear system repeatedly without allocation General Usage question , linearalgebra , memory-allocation	11	1203	July 9, 2022
Static matrix equation solve in-place? Specific Domains linearalgebra	7	812	March 15, 2021
Linear system solution not working in CUDA General Usage cuda , linearalgebra , linearsolve	4	108	March 1, 2025
Solves the linear system using CuArrays.jl GPU	3	1612	December 27, 2019
CUDAnative dynamic allocation GPU question , cudanative	5	1810	March 4, 2020

Solve linear systems inside CUDA kernel function

Related topics