GPU memory in Julia

Wenbo_Li · January 24, 2024, 9:07pm

I am training unet with 3d input size 512512128. However, with the same model, same loss function, same size of input and label in float32, python indeed can fit and train while julia not. Also, julia need much longer time to prepare for the training. Are there some problems or it’s just the nature of juliia?

Wenbo_Li · January 24, 2024, 9:07pm

roflmaostc · January 24, 2024, 9:26pm

I’m not quite familiar with Lux but rand(512,512,128,1,1) creates a Float64 array.

Does dev convert it to Float32 in case of the GPU? Did you check?

Did you also close the Python process before such that Julia and Python don’t have to share memory?

Dale_James_Black · January 25, 2024, 12:39am

It would be helpful to share the code for the PyTorch model and the Julia model

PyTorch:

class UNet(nn.Module)
....

Julia:

function UNet()
    return Chain(...)
end

Wenbo_Li · January 25, 2024, 1:13am

I converted all the arrays to Float32 and then |> dev and tried again. Still no luck. And yes, I made sure the python process was closed before running Julia. I’m watching nvdia-smi every 2 seconds.

Dale_James_Black · January 25, 2024, 6:02pm

Please share a MWE so that others can help debug better. Without both models and the full training loops it’s hard to know where to start

Topic		Replies	Views
Data Science lessons: Making "10 - Neural Networks" run on GPU? New to Julia gpu , flux	4	737	January 14, 2022
When calling PyTorch using PyCall or pythoncall, I run out of memory on GPU cards Machine Learning	1	237	July 31, 2023
Quite bad performance of Julia 0.6.4 vs Python+Numpy General Usage	26	5189	November 13, 2018
Tips for handling large Datasets with a lot of preprocessing Machine Learning question , gpu , data	1	117	July 27, 2024
Julia Performance - Help Needed Performance question , python	40	2902	September 17, 2021

GPU memory in Julia

Related topics