Why GPU still OOM when using CUDA unified memory?

pxshen · March 12, 2024, 9:46pm

I’ve way more RAM than VRAM. Code runs fine on CPU. I thought using CUDA unified memory lets GPU tap system RAM but it still OOM.

I’m asking bc I’m getting a new laptop for ML work. Does unified memory mean VRAM is no longer a hard ceiling and as important of a spec? (I know having GPU tap system RAM is way slower but at least it runs)

Benny · March 12, 2024, 10:10pm

What you’re describing sounds more like an integrated GPU sharing memory with the CPU. Apple has a Unified Memory Architecture that does this in small part. CUDA’s Unified Memory is an unrelated software abstraction that lets the GPU and CPU code access data with the same pointer, even when the data is actually being migrated between the CPU’s and GPU’s memory.

xgdgsc · March 13, 2024, 6:13am

Depends on your OS. Windows might have this Can PyTorch GPU Use Shared GPU Memory (from RAM, shows in Windows Task Manage)? - Stack Overflow

photor · March 13, 2024, 2:43pm

interesting

maleadt · March 13, 2024, 3:08pm

Works here:

julia> Sys.free_memory() |> Base.format_bytes
"55.165 GiB"

julia> CUDA.available_memory() |> Base.format_bytes
"46.865 GiB"


julia> a = CuVector{UInt8,Mem.Unified}(undef, 50*2^30);

julia> sizeof(a) |> Base.format_bytes
"50.000 GiB"


julia> a .= 1;

julia> Sys.free_memory() |> Base.format_bytes
"7.322 GiB"

julia> CUDA.available_memory() |> Base.format_bytes
"2.000 MiB"

What platform are you on?

pxshen · March 14, 2024, 5:01pm

Cool didn’t know it can split a single variable. I’m on Windows cuda 12 but a really old gpu so prob not bother debugging. for otherfolks: you can let cuda.jl alloc unified buffer by default by adding LocalPreferences.toml to your env folder w/ lines of
[CUDA]
default_memory =“unified”

maleadt · March 14, 2024, 6:44pm

From CUDA C++ Programming Guide

Devices of compute capability lower than 6.0 cannot allocate more managed memory than the physical size of GPU memory.

Topic		Replies	Views
Using Unified Memory with oversubscription GPU	3	738	March 4, 2022
Faster small CUDA memory transfers (UnifiedMem?) GPU cuda	2	896	August 31, 2020
Why is it consuming and not freeing GPU memory? GPU	5	455	April 18, 2024
Is there a way to explicitly free GPU memory? GPU	3	2614	December 15, 2019
Trying to understand the use of shared memory on GPUs GPU	3	2220	May 25, 2021

Why GPU still OOM when using CUDA unified memory?

Related topics