Running OOM trying to load data to GPU

lepton01 · June 17, 2023, 2:01am

Hello there.
After training the CNN, I wrote a function to estimate the accuracy of it.

function accuracy(A, B, name)
    BSON.@load name * ".bson" model
    model = model |> gpu
    X1, Y1 = A
    X2, Y2 = B
    Y_tr_r = model(X1 |> gpu) |> cpu
    Y_te_r = model(X2 |> gpu) |> cpu
    a = mean(isapprox.(Y_tr_r, Y1; atol=0.015)) * 100
    b = mean(isapprox.(Y_te_r, Y2; atol=0.015)) * 100
    return a, b
end

Problem is: the GPU runs OOM when trying to load the data to it. I do not understand why, the data is not big in size:
X1 is Array{Float64,4} dims=(128,128,1,5000), X2 is similarly Array{Float64,4} dims=(128,128,1,1250), Y1 and Y2 are even smaller.
The exact line it errors is Y_tr_r = model(X1 |> gpu) |> cpu
GPU: Nvidia GeForce GTX 1660 SUPER (6 GB VRAM).
Yes, I CUDA.reclaim() finishing training, so the VRA;M is mostly free…
Using the CPU works, but I would like to know if using the GPU is faster to evaluate.

ToucheSir · June 17, 2023, 2:38pm

What is model? If your model is large enough, it’s possible the allocations from the forward pass would be enough to OOM. Even for something the size of a Resnet-18, 128^2 with a batch size of 5000 could OOM a 8GB GPU, let alone a 6GB one (consider that batch sizes are usually < 512)!

lepton01 · July 8, 2023, 6:26am

My apologies for not answering earlier. I do believe that was the problem, I was abusing mu GPU with ridiculous amounts of neurons after the Conv layers…

Topic		Replies	Views
Flux's model-zoo CIFAR10 example saturates 8GB gpu General Usage gpu , flux	5	672	June 29, 2020
OOM when using Flux and loops GPU	4	610	April 14, 2022
Out of memory using Flux CNN during back propagation phase Machine Learning	2	650	June 28, 2019
Flux: GPU not working as expected Machine Learning flux	6	2221	July 28, 2020
`CUDA error: out of memory` with Flux Machine Learning flux	4	1668	August 24, 2020

Running OOM trying to load data to GPU

Related topics