Checking that work is being sent to processors: GPU vs Multiple CPUs

ChrisRackauckas · September 14, 2022, 1:56am

That’s really odd. The REPL isn’t showing you anything? If you run other GPU codes, is it fine?

jmair · September 14, 2022, 10:04am

Using watch -n 1 nvidia-smi might be useful to keep it going.

When CUDA loads it downloads and installs the toolkit. You might want to do this manually first to be sure it works:

using CUDA
CUDA.versioninfo()

This should download everything you need.

yoshi · September 14, 2022, 1:03pm

This helped a little: I see that the GPU’s fans are turning on as a result some usage. But I see nothing in the processes section. Should I expect to see something here?

yoshi · September 14, 2022, 1:06pm

Is there some example julia code floating around where I would expect to see something populating in the processes section in the nvidia-smi tool? Sorry, I’m new to all this. I’m just relying on script kiddie luck and matlab experience to get me through right now.

ChrisRackauckas · September 14, 2022, 1:14pm

Did you run the first CUDA tutorial?

https://cuda.juliagpu.org/dev/tutorials/introduction/#Your-first-GPU-computation

You should be able to see utilization from just a simple matmul.

using CUDA
A = cu(rand(1000,1000))
B = cu(rand(1000,1000))
A*B

yoshi · September 14, 2022, 1:21pm

The only indication that anything is occurring is that the GPU fan turns on, the temp goes up, and the power usage increases. I don’t see anything in the processes section though. I’m running watch -n .2 nvidia-smi.

Ya – I went through this tutorial and watch some of the youtube videos online.

carstenbauer · September 14, 2022, 1:26pm

Does the performance of the matmul increase? Like by orders of magnitude.

yoshi · September 14, 2022, 1:34pm

yes

maybe this is something about how the integrator is programmed. are processes running so fast that they’re not being picked up in nividia-smi?

carstenbauer · September 14, 2022, 2:33pm

Well, could be, nvidia-smi probably also has a certain sampling rate. In any case, the performance increase tells you that it’s running on the GPU. Otherwise, if you only want to run “something” on the GPU and see it pop up in nvidia-smi you could try to run a GPU stresstest with GPUInspector.jl

jmair · September 14, 2022, 3:06pm

If you are running the examples in the REPL and look at nvidia-smi in a different process concurrently then a process of julia should show up, even if it is not processing at the time, since it still has memory allocated.

yoshi · September 14, 2022, 5:00pm

This is strange. Using the stresstest, I still see nothing in nvidia-smi’s processes.

However, when I look at the reporting tools that come with GPUInspector, I can see the GPU is being utilized. Its reporting some of the metrics that I mentioned before (power, temperature, fan use). It also shows GPU% utilization which appears non-zero (not the case in nvidia-smi – see the screenshot above).

I’m not sure why nvdia-smi isn’t reporting these processes. Do you see them when you run stresstest?

Update: I ran the monitor commands from GPUInspector and wrapped them around the ODEcomputations that I was originally trying to run and still saw 0% GPU utilization, until the very end of the computation.

yoshi · September 14, 2022, 5:01pm

Ya that’s strange, I see nothing. What do you see in nvidia-smi when you run julia?

Topic		Replies	Views
DiffEqFlux GPU example slow GPU performance	3	627	January 14, 2021
Potentially Getting Started With DifferentialEquations.jl New to Julia gpu	1	404	July 25, 2022
DiffEqGPU - slow parallel solving of SDEs on GPU GPU	6	403	March 3, 2024
Julia compiler v.s. Julia GPU compiler GPU	4	399	December 19, 2024
How to distribute computation over different CPU's of my desktop New to Julia parallel , multithreading	14	702	February 5, 2022

Checking that work is being sent to processors: GPU vs Multiple CPUs

Related topics