Allocation of Memory while evaluate a model

HenrZu · November 6, 2021, 8:52pm

While working on the performance of a model used in a numerical simulation, I noticed that memory is allocated every time a model is evaluated.

Since I call the model several times within the simulation, there is a significant loss in runtime.

Already with a simple model with an input of length 10 and a binary classification as output, more than 800 bytes are allocated.

using Flux
using BenchmarkTools

model = Chain(
  Dense(10, 5, σ),
  Dense(5, 2),
  softmax)

input = Vector{Float64}(rand(10))
@benchmark model($input)

BenchmarkTools.Trial: 10000 samples with 187 evaluations.
 Range (min … max):  539.037 ns … 47.713 μs  ┊ GC (min … max): 0.00% … 98.36%
 Time  (median):     568.449 ns              ┊ GC (median):    0.00%
 Time  (mean ± σ):   656.073 ns ±  1.633 μs  ┊ GC (mean ± σ):  9.27% ±  3.67%

    ▁█▄▄▇▂                  
  ▁▂██████▅▃▃▂▂▂▁▂▂▂▂▂▂▂▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▂▂▃▂▂▂▂▂▁▁▁▁▁▁▁▁▁▁▁▁▁ ▂
  539 ns          Histogram: frequency by time          824 ns <

 Memory estimate: 848 bytes, allocs estimate: 10.

Is there a way to evaluate the model without allocating memory?

CC: @sloede

ToucheSir · November 7, 2021, 3:51am

If your networks are small and allocations really are the bottleneck, you could take a look at SimpleChains.jl. I believe it’s still experimental, but it allows for preallocating certain buffers whereas Flux does pretty much everything out-of-place (mostly for AD compat).

sloede · November 7, 2021, 11:39am

Thanks for the suggested package - it seems like it maybe could do the trick. However, the referenced repo does not seem to exist anymore - at least the GH URL https://github.com/JuliaSIMD/SimpleChains.jl gives me a 404.

HenrZu · November 7, 2021, 5:23pm

Thanks a lot for the answer. SimpleChains looks really interesting and could be a possible solution to my problem.

Unfortunately, as Michael already mentioned, the repo returns a 404.

ToucheSir · November 7, 2021, 7:17pm

IIRC the repo has been made private, but you can still use the registered package.

sloede · November 8, 2021, 9:40am

@Elrod Would you mind shedding some light on the whereabouts of SimpleChains.jl and your future plans for it?

Elrod · November 8, 2021, 10:52am

As ToucheSir noted, it’s been made private.
When/if it can be opened again will require an internal discussion/ business plan delineating how/where it fits.
It almost certainly won’t be a product itself, but may be a piece of some other offering, so I think we should open it.
I jumped the gun by creating it as an open repo in the first place…

sloede · November 8, 2021, 11:02am

Thanks for the clarification. Thus, to conclude, there does not seem to be a “canonical” Julia package for ANN models available at the moment that does not allocate during each model evaluation. That’s too bad, and I would like to - in case anyone cares - express strong interest in having such an option available for Julia!

Topic		Replies	Views
Which library supports a non-allocating neural network model Specific Domains package , gpu , machine-learning	13	799	August 17, 2022
Avoid allocation of a Flux model on the CPU Machine Learning flux	3	122	October 27, 2024
Is there a way to avoid allocations when calling a Flux model? Especially on a GPU Machine Learning memory-allocation , flux	5	877	May 4, 2021
Flux: How to minimise the garbage collection time? Machine Learning question , flux	22	716	April 20, 2023
Implementing a ConvNet that doesn't allocate during inference (SimpleChains.jl?) Machine Learning	3	136	February 27, 2025

Allocation of Memory while evaluate a model

Related topics