Extremely high memory consumption on CPU

cirobr · January 13, 2025, 5:00pm

Cheers. I have developed a model for semantic segmentation in Flux. It has around 7M parameters. Have checked its inference performance for segmenting two classes, when an array with size (512,512,3,1) is at the input.

The outcome from BenchmarkTools.jl is shown at the below table for the same pc. In the first row, gpu is disabled. It really calls the attention the memory figure in the GB range for the cpu case, while the gpu case is in KB case. I wonder if BenchmarkTools is not considering GPU memory for the metric? For the CPU case, does the metric mean the model is too expensive for running on limited devices such as IoT application processors?

As a side question: its speed on gpu is 2-3X lower than equivalent models found elsewhere on GitHub. Have already applied many recommendations to reduce allocation, with little improvement. Any hint on where to look at for improvement tips is welcome.

Thanks in advance.

raman_kumar · January 13, 2025, 5:15pm

You can use @allocated macro to know allocation at each step where you have doubt. For detailed information about profiling you can use PProf.jl.

ToucheSir · January 13, 2025, 6:20pm

This is correct. To capture information about GPU allocations, check out the tools mentioned in Benchmarking & profiling · CUDA.jl.

Maybe, maybe not. Equally if not more important than the total amount of memory allocated could be the maximum memory the model uses at any given point in time.

There’s no central, definitive source, but plenty of previous discussions to search through with the obvious keywords. Just in the past little while on Discourse, I see Unreasonable memory usage with M4 GPU and Memory usage increasing with each epoch - #14 by JoshuaBillson. While on GitHub, cuda gpu memory usage increasing in time · Issue #2523 · FluxML/Flux.jl · GitHub covers much the same. There have been similar discussions on Slack.

Side note, are you aware of the existence of Machine Learning - Julia Programming Language ? Posts that are in too general a category can get lost, because some people only check specific categories frequently.

Topic		Replies	Views
@benchmark reporting (probably) wrong memory usage (BenchmarkTools.jl) Tooling	4	902	June 2, 2018
Flux: GPU slower than CPU? GPU question , performance , flux	7	2211	August 10, 2018
Julia slowdown on long running programs with many allocations Performance memory-allocation , flux	6	1555	July 6, 2021
Benchmarktools explanation for dummies New to Julia benchmarktools	4	1136	January 14, 2019
Is map(f,collection) memory inefficient? Internals & Design array	3	804	August 7, 2017

Extremely high memory consumption on CPU

Related topics