ANN: CUDA.jl 3.3

maleadt · June 14, 2021, 5:55am

Hi all,

I’ve released CUDA.jl 3.3 on Friday, with several exciting new features. There’s a blog post summarizing those features, as well as some from CUDA.jl 3.1 and 3.2 (for which there wasn’t a blog post): CUDA.jl 3.3 ⋅ JuliaGPU

Key highlights:

CuArray support for isbits union element types (useful for nothing, missing)
Ability to emit debug and location information for GPU code
Support for CUDA’s semantic versioning (so you can use CUDA 11.3 on a driver for 11.0)
High-level wrappers for the CUDA graph APIs

fedoroff · June 14, 2021, 7:01am

How the new graph APIs should be applied to custom kernels?
Something like this?

@captured @cuda threads=Nth blocks=Nbl kernel(A)

And in case if I use kernel configuration, e.g.

ckernel = @cuda launch=false kernel(A)
config = launch_configuration(ckernel.fun)
threads = min(N, config.threads)
blocks =  cld(N, threads)
ckernel(a, b; threads=threads, blocks=blocks)

how should I apply the @captured macro?
Like this

@captured begin
    ckernel = @cuda launch=false kernel(A)
    config = launch_configuration(ckernel.fun)
    threads = min(N, config.threads)
    blocks =  cld(N, threads)
    ckernel(a, b; threads=threads, blocks=blocks)
end

or like this

ckernel = @cuda launch=false kernel(A)
config = launch_configuration(ckernel.fun)
threads = min(N, config.threads)
blocks =  cld(N, threads)
@captured ckernel(a, b; threads=threads, blocks=blocks)

Thank you.

maleadt · June 14, 2021, 7:10am

The macro doesn’t care, just encapsulate any chunk of code that performs a launch. See the tests, for example: CUDA.jl/graph.jl at 71d5f39daf4ffcb8d104f5a10a26f096b8150695 · JuliaGPU/CUDA.jl · GitHub. Note that graph recording doesn’t support all CUDA APIs, but the occupancy API shouldn’t be a problem (the broadcast example in the blog post uses the occupancy API to determine a launch configuration).

Topic		Replies	Views
Call libcuda cuLaunchKernel from Julia New to Julia cuda , c	2	130	January 5, 2025
How to generate Coverage for CUDA functions New to Julia gpu , coverage	2	937	July 27, 2020
Isbits union support in CUDA.jl GPU question	2	549	November 29, 2021
Julia 1.9.0-DEV.239 with CUDA CuArray GPU	12	919	March 28, 2022
Julia version/CUDA compatibility with Quadro K4100 compute capbility of 3 GPU	1	508	March 29, 2021

ANN: CUDA.jl 3.3

Related topics