Function closure with large arrays/matrices scaling horribly

vivian-rogers · November 5, 2021, 8:38pm

Hi!

So I am trying to set up a useful function f to pass around the rest of my code as follows:


function f_prepostproc(params)
   # do stuff
   f = f_generator(params)
   # evaluate f at certain x values, save to file
   return f
end

function f_generator(params)
   A = A_gen(params)
   B = B_gen(params)
   function f(x)
      # do an operation on A, B(x), get C (large matrix)
     return C #
  end
  return f
end 

function main(params)
   f = f_prepostproc(params)
   dostuff(f)
end

Unfortunately, this takes upwards of an hour to return from this call stack, especially as the size of the matrices A and array B get large (~14,000 x ~14,000 and ~80,000 respectively). After this, the code takes a negligible amount of time to actually evaluate the f(x) as it is passed around in the main function. I am new to function closure in julia, which I understand is invoked here. Is there anything immediately obvious with this setup, or should I rework it to define f(x) under the main scope?

Many thanks

zgornel · November 5, 2021, 8:48pm

Are you sure it is not a swapping issue?

goerch · November 5, 2021, 8:52pm

So OP should probably scale down the problem a bit and post a complete MWE?

vivian-rogers · November 5, 2021, 9:06pm

I doubt it, as I should have more than enough memory to deal with these sparse matrices.

But that is a good guess, and I think I was running into that before I got an extra 32Gb stick in my machine. I am not sure how to check the swapfile usage in julia, will look into that.

goerch · November 5, 2021, 9:12pm

Swap file is part of your OS. So maybe you should look into that manual? Without a complete MWE we can only guess about possible memory usage.

Sukera · November 5, 2021, 11:16pm

Since the inner function captures the outer variables, my guess would be that this capturing requires redefinition (and thus recompilation) of the inner function. This would be consistent with the observed behavior of f being fast after the initial call, as the function is already compiled.

Do you have timings/proper benchmarks to compare to? An avenue for investigating which part is actually slow is PPerf.jl, the Profile stdlib or tools like ProfView.jl.

zgornel · November 6, 2021, 3:19pm

Makes sense. Precompilation with large arrays can be slow. To monitor the swap usage, one can just use htop interactively (on *nix systems).
@vivian-rogers perhaps calling f with smaller matrix sizes (just for precompilation) might speed up things.

Topic		Replies	Views
Efficient closures Performance question , closure	4	1220	June 13, 2020
Allocations when passing closure as parameter Performance memory-allocation , closure	2	574	June 17, 2021
Avoiding Allocations with User provided Closures Performance memory-allocation , closure	12	1591	February 2, 2023
Passing a function as an argument via closure Performance question , package , performance , numerics , closure	9	1728	August 28, 2019
For loop in function and multiplication of larger matrices, slow speed in parallel Performance performance , parallel , loops	3	1300	November 22, 2019

Function closure with large arrays/matrices scaling horribly

Related topics