Unexpected allocation at each loop iteration

simon79 · June 11, 2022, 1:26pm

Hi,
The following overly-simplified extraction from my code shows a section of the code which is allocating memory at every loop iteration.

I think this should not happen and I’d like to have advise and suggestion on how to optimize/correct it.

cache_unique_edges = array_cache(conn_unique_edges) # allocation done here to prevent further allocations later.
for iel = 1:nelem    
    for e = 1:E
        ...
        for g = 1:G
            ...
            ai = getindex!(cache_unique_edges, conn_unique_edges, g)

            if (ai[2] == ai[1]) 
                # THIS if STATEMENT CAUSES OVER-ALLOCATION but I want to avoid it!
            end
        end
    end
end

When the if statement is commented out, the allocation and timing are:
[ Info: 17.088092 seconds (37.64 k allocations: 2.114 MiB)

However when the the code executes if (ai[2] == ai[1]), then allocation and timing are:
[ Info: 35.950601 seconds (676.71 M allocations: 10.085 GiB, 1.46% gc time)

simon79 · June 11, 2022, 1:28pm

NOTE on getindex! and array_cache: these are functions that use my own definition of tables. This being said, the same exact behavior is observed if I use julia native Arrays.

Sukera · June 11, 2022, 1:45pm

Without knowing what ai ultimately is, it’s quite impossible to diagnose from afar. Do you have a minimal, self contained example people could run to debug on their machine?

simon79 · June 11, 2022, 2:02pm

hi @Sukera thanks for replying. I am extracting a working code for you to test. Because it is part of a major code that I am developing, you will need to run it from within its own --project=. and add a couple libraries.
I hope that is ok. I’ll post a github link shortly

baggepinnen · June 11, 2022, 2:23pm

Is the cache array a global variable? If so, that’s your problem.

simon79 · June 11, 2022, 2:30pm

It is not. This being said, reducing the code to a minimal working code for this forum seems to have helped found the culprit. More soon. Still assessing this statement.

simon79 · June 11, 2022, 4:00pm

Ok, I pushed the minimal code that allows you to run it.

git clone --branch discourse/julia git@github.com:smarras79/jexpresso.git

To run it simply do:

>> cd jexpresso
>> julia --project=.
>> include("./src/jexpresso.jl")

(It will ask you to add some libraries. Let me know if that is giving you issues.)

The function of interest is:

github.com

smarras79/jexpresso/blob/discourse/julia/src/mesh/mod_mesh_minimal.jl#L141


      
              
              #Edges
              @time add_high_order_nodes_edges!(mesh)
          
          
end
          
          

          
#
          # FOR DISCOURSE.JULIA
          #
          function  add_high_order_nodes_edges!(mesh::St_mesh)
              @info " CCCC"
          
          
    # INCREASE/DECREASE "NEL" to see how allocation changes
              NEL     = 20 #mesh.nelem
          
          
    #Do not touch NGLOBAL and NLOCAL
              NGLOBAL = mesh.nedges
              NLOCAL  = mesh.NEDGES_EL
              #

Increase/decrease NEL at line 145 of the code
OR

comment/uncomment the IF statement at line 166 of the code

to see how the code is behaving.

Thanks for willing to help.

Sukera · June 11, 2022, 5:04pm

github.com

smarras79/jexpresso/blob/7b786c2fbac66de8b6d726883517d0a8840cfba2/src/mesh/mod_mesh_minimal.jl#L153


      
          
          
# INCREASE/DECREASE "NEL" to see how allocation changes
          NEL     = 20 #mesh.nelem
          
          
#Do not touch NGLOBAL and NLOCAL
          NGLOBAL = mesh.nedges
          NLOCAL  = mesh.NEDGES_EL
          #
          
          

          
cache_unique_edges = array_cache(mesh.conn_unique_edges) # allocation here
          @info typeof(cache_unique_edges)
          
          
for iglob = 1:NGLOBAL
          
          
    ai = getindex!(cache_unique_edges, mesh.conn_unique_edges, iglob)        
          
          
    #@info sizeof(ai[1]) sizeof(ai[2])
              #@info typeof(ai)
              
              for iel = 1:NEL

I don’t know where array_cache comes from, but my guess is that the function allocates a new array internally?

github.com

smarras79/jexpresso/blob/7b786c2fbac66de8b6d726883517d0a8840cfba2/src/mesh/mod_mesh_minimal.jl#L166


      
              for iglob = 1:NGLOBAL
          
          
        ai = getindex!(cache_unique_edges, mesh.conn_unique_edges, iglob)        
          
          
        #@info sizeof(ai[1]) sizeof(ai[2])
                  #@info typeof(ai)
                  
                  for iel = 1:NEL
                      for iloc = 1:NLOCAL
                          
                          if(issetequal([ai[1], ai[2]], [ai[1], ai[2]]))
                              # (UN)COMMENT THIS IF to SEE HOW ALLOCATION BEHAVES    
                          end
                          
                      end
                  end
              end
              @info " DDDD"
              
              return 
          end

This creates two new arrays per iteration, together with the surrounding loops that’s a total of NLOCAL * NEL * NGLOBAL * 2 allocations. Either use a tuple (so (ai[1], ai[2]) etc) or write the comparison exiplicitly.

I do not have a local MPI setup, so I can’t really run your code sorry. I also don’t know where getindex! is coming from - github code search does not show any hits in your repository, so I don’t know what it’s type would be and thus can’t really figure out which getindex method on ai would be called. However, since you report the same behavior with standard arrays, I’m assuming the getindex itself does not allocate.

simon79 · June 11, 2022, 5:19pm

right, getindex does NOT allocate. You can find how getindex is defined here: Gridap.Arrays · Gridap.jl}

Topic		Replies	Views
Allocation puzzler General Usage	6	476	November 19, 2018
Unexpected allocation Performance	7	305	January 30, 2024
Comparison performance with `any` Performance	5	620	January 11, 2019
Memory allocation inconsistency (again...) General Usage	14	681	July 13, 2021
Memory on array element assignment Performance	10	441	August 3, 2022

Unexpected allocation at each loop iteration

Related topics