CUDAnative: register host memory for pinned memory access

maleadt · April 18, 2019, 9:27pm

Your use of register is confusing, do you want pinned memory and an async memcpy, or do you want to register an existing host pointer and map it into device space?

Here’s an example of the former:

julia> A = zeros(nx);

julia> A_cpuptr = pointer(A)
Ptr{Float64} @0x00007f360f7ff040

julia> A_buf = Mem.register(Mem.Host, A_cpuptr, sizeof(A), Mem.HOSTREGISTER_DEVICEMAP)
CUDAdrv.Mem.HostBuffer(Ptr{Nothing} @0x00007f360f7ff040, 8388608, CuContext(Ptr{Nothing} @0x000000000255dc70, false, true), true)

julia> A_gpuptr = convert(CuPtr{Float64}, A_buf)
CuPtr{Float64}(0x0000000202c40040)

julia> A_d = unsafe_wrap(CuArray, A_gpuptr, size(A));


# proof the devicemap works

julia> A[1] = 42
42

julia> A_d[1]
42.0

A_d is now a device array bound to a CPU memory allocation. Accessing that memory from the GPU is pretty expensive though, since it incurs PCIE reads.

Topic		Replies	Views
Initializing @cuStaticSharedMem array? GPU	3	1337	May 12, 2018
Shared memory limitations GPU	4	944	April 29, 2020
Constant Memory? GPU	11	2587	July 18, 2018
Local thread memory in GPU using StaticArrays GPU question , gpu , cuda	4	6245	January 26, 2020
Release: CUDAdrv/CUDAnative 2.0, CuArrays 1.0 Package Announcements gpu	0	898	March 22, 2019

CUDAnative: register host memory for pinned memory access

Related topics