Optimization question

lancejnelson · September 30, 2024, 4:22am

I’m trying to optimize the code block below:

using Distributions
order = 2
σ_Priors = Array{Distribution,2}(undef,order,order)
ϵ_Priors = Array{Distribution,2}(undef,order,order)
σ_Priors[:,:] .= Gamma(10,0.1)
ϵ_Priors[:,:] .= Gamma(10,0.1)
std_Prior = Gamma(10,0.3)

function totalEnergy(crystal::Crystal,LJ::LJ)
    totalEnergy = 0
    for i in eachindex(LJ.ϵ)
        totalEnergy += -LJ.ϵ[i] * LJ.σ[i] ^6 * crystal.r6[i] + LJ.ϵ[i] * LJ.σ[i] .^12 * crystal.r12[i]
    end
#        @time "total Energy" totalEnergy = sum(-LJ.ϵ .* LJ.σ .^6 .* crystal.r6 .+ LJ.ϵ .* LJ.σ .^12 .* crystal.r12 ) 
end

function logNormal(data::MatSim.DataSet,LJ::MatSim.LJ,σ::Float64)
    thesum = 0
    for i = 1:data.nData
        thesum += (data.crystals[i].energyFP -MatSim.totalEnergy(data.crystals[i],LJ))^2
    end
    thesum *= -data.nData/2 *log(σ^2) - 1/(2 * σ^2)
#    @time "thesum" thesum =  -data.nData/2 *log(σ^2) - 1/(2 * σ^2) * sum([(i.energyFP - totalEnergy(i,LJ))^2   for i in data.crystals])
    return thesum

end

function logPost(data,model,σ)
    # return 3.0
    thesum = logNormal(data,model,σ)  + logpdf(std_Prior,σ)
    for i in eachindex(model.σ)
        thesum += logpdf(σ_Priors[i],model.σ[i])
        thesum += logpdf(ϵ_Priors[i],model.ϵ[i])
    end
    return thesum
end

@time logPost(trainingSet,LJ,0.3)

I recognize that I’m not giving you all of the code but I’m hoping it’s enough to go on. Currently a single call to logPost requires 34 allocations. Doesn’t seem too bad except that this function is being called hundreds of thousands of times. I’m convinced that this function is the performance bottleneck. Can anyone see any glaring problems here? Perhaps using globals for σ_Priors instead of passing them into the function?

Thanks in advance.

jling · September 30, 2024, 4:30am

Yea, mark that const is a good start

lancejnelson · September 30, 2024, 4:31am

Or if I just pass them into the function that would work too right?

baggepinnen · September 30, 2024, 4:52am

Distribution is an abstract type, hence

logpdf(σ_Priors[i],model.σ[i])

is a dynamic (and thus expensive) dispatch. If you only have gamma distributions, do this instead

σ_Priors = fill(Gamma(10,0.1), order, order)

(same for ϵ_Priors)

lmiq · September 30, 2024, 11:52am

For example, following the previous tips of the other thread:

With a parametric struct:

julia> struct ModelA{T}
           σ::Vector{Float64}
           ϵ::Vector{Float64}
           σ_Priors::Matrix{T}
           ϵ_Priors::Matrix{T}
       end

julia> function logPost(model)
           # return 3.0
           thesum = 0.0 # simplified version
           for i in eachindex(model.σ)
               thesum += logpdf(model.σ_Priors[i],model.σ[i])
               thesum += logpdf(model.ϵ_Priors[i],model.ϵ[i])
           end
           return thesum
       end
logPost (generic function with 2 methods)

julia> m = ModelA(rand(100), rand(100), fill(Gamma(10,0.1), 10, 10), fill(Gamma(10,0.1), 10, 10));

julia> typeof(m)
ModelA{Gamma{Float64}}

julia> @btime logPost($m)
  3.100 μs (0 allocations: 0 bytes)
-675.8052525561492

With the abstract field, instead:

julia> struct ModelB
           σ::Vector{Float64}
           ϵ::Vector{Float64}
           σ_Priors::Matrix{Distribution}
           ϵ_Priors::Matrix{Distribution}
       end

julia> mB = ModelB(m.σ, m.ϵ, m.σ_Priors, m.ϵ_Priors);

julia> typeof(mB)
ModelB

julia> @btime logPost($mB)
  6.455 μs (600 allocations: 9.38 KiB)
-675.8052525561492

FWIW, I think this blog post might be helpful: New blog post about Julia parametric types and constructors

DNF · September 30, 2024, 12:42pm

I think perhaps this is not causing any problems right here, but I don’t think it’s good style to shadow the typename like this.

lancejnelson · September 30, 2024, 2:54pm

lmiq:

struct ModelA{T}
           σ::Vector{Float64}
           ϵ::Vector{Float64}
           σ_Priors::Matrix{T}
           ϵ_Priors::Matrix{T}
       end

julia> function logPost(model)
           # return 3.0
           thesum = 0.0 # simplified version
           for i in eachindex(model.σ)
               thesum += logpdf(model.σ_Priors[i],model.σ[i])
               thesum += logpdf(model.ϵ_Priors[i],model.ϵ[i])
           end
           return thesum
       end
logPost (generic function with 2 methods)

julia> m = ModelA(rand(100), rand(100), fill(Gamma(10,0.1), 10, 10), fill(Gamma(10,0.1), 10, 10));

julia> typeof(m)

isconcretetype(m) #false

@time logPost(m)

I ran this exact code and got


0.000010 seconds (1 allocation: 16 bytes)

DanielVandH · September 30, 2024, 3:13pm

Note that it’s @btime not @time (from BenchmarkTools.jl).

lancejnelson · September 30, 2024, 4:24pm

I see what you’re saying. This declaration:

σ_Priors = Array{Distribution,2}(undef,order,order)

is problematic. It turns out that not all of the distributions are identical. They may all be Gammas, but with different parameters. How could I avoid this problem in this case?

abraemer · September 30, 2024, 4:29pm

If you know all possible Distributions then you can just pack them into a single type by using DynamicalSumTypes.jl

DanielVandH · September 30, 2024, 4:34pm

Does the gamma distribution use different types for different parameters? If they are all the same type, you could just use typeof(Gamma(10, 0.1)) for example. If you have a few different distribution types, wrapping them in a Union type could be fine. Alternatively, use LightSumTypes.jl as mentioned above

lancejnelson · September 30, 2024, 11:00pm

I’m showing that the function logPost is type unstable. Since model.σ_Priors[i] is a distribution the type of logpdf is Any.

Here’s the snippet from @code_warntype:

%138 = thesum::Any
%139 = Base.getproperty(metrop, :ϵ_Priors)::Matrix{Distribution}
%140 = i@_29::Int64
%141 = Base.getindex(%139, %140, j)::Distribution
%142 = Base.getproperty(model, :ϵ)::LinearAlgebra.UpperTriangular{Float64, Matrix{Float64}}
%143 = i@_29::Int64
%144 = Base.getindex(%142, %143, j)::Float64
%145 = MatSim.logpdf(%141, %144)::Any
%146 = (%138 + %145)::Any

lancejnelson · September 30, 2024, 11:05pm

My model.σ is an upper triangular matrix:


σ:: UpperTriangular{Float64, Matrix{Float64}}
ϵ:: UpperTriangular{Float64, Matrix{Float64}}

Could that be part of the problem?

lmiq · September 30, 2024, 11:40pm

I don’t think so. Where that UpperTriangular type is coming from?

The main reason is probably the abstract Distribution type. If you only need distributions of type Gamma as elements of of the priors (or other of the same type with varying parameters), you should parameterize the type.

lancejnelson · October 1, 2024, 3:07pm

Let’s assume the matrix was going to hold some Gammas and some Uniforms, would the following type definition work?

struct ModelA{T<:Union{Gamma,Uniform}}
           σ::Vector{Float64}
           ϵ::Vector{Float64}
           σ_Priors::Matrix{T}
           ϵ_Priors::Matrix{T}
       end

DNF · October 1, 2024, 3:53pm

You should possibly parameterize Gamma{Float64} and Uniform{Float64}, so the compiler can predict the type of the sampled data.

lmiq · October 1, 2024, 4:04pm

Actually that can be specified on the structure creation, not on the type. With the same generic type, you get, for example:

julia> using Distributions, BenchmarkTools

julia> @kwdef struct ModelA{T}
           σ::Vector{Float64}
           ϵ::Vector{Float64}
           σ_Priors::Matrix{T}
           ϵ_Priors::Matrix{T}
       end

julia> function logPost(model)
           # return 3.0
           thesum = 0.0 # simplified version
           for i in eachindex(model.σ)
               thesum += logpdf(model.σ_Priors[i],model.σ[i])
               thesum += logpdf(model.ϵ_Priors[i],model.ϵ[i])
           end
           return thesum
       end
logPost (generic function with 1 method)

julia> T = Union{Gamma{Float64},Uniform{Float64}}

julia> m = ModelA(
           σ=rand(100), 
           ϵ=rand(100), 
           σ_Priors=T[ rand(Bool) ? Gamma(10,0.1) : Uniform(0.1,10) for _ in 1:10, _ in 1:10 ],
           ϵ_Priors=T[ rand(Bool) ? Gamma(10,0.1) : Uniform(0.1,10) for _ in 1:10, _ in 1:10 ],
       );

julia> typeof(m)
ModelA{Union{Gamma{Float64}, Uniform{Float64}}}

julia> @btime logPost($m)
  1.911 μs (0 allocations: 0 bytes)
-Inf

lancejnelson · October 1, 2024, 4:23pm

Interesting. Thank you.

lancejnelson · October 1, 2024, 4:30pm

Can I ask one more kinda related question and then I’ll be quiet.

One “type” of array is Array{Float64,2}, meaning a 2 dimensional array filled with floats. Are users allowed to define types with integers in their type definition? If so, what does that look like?

struct myType{T,N}
    x::T
    ...?

For best performance, user-defined types should be concrete when instantiated right? (Not sure I’m even asking this question right. I’m just wanting to build intuition for how to use types correctly and most efficiently. ). For example, I notice that isconcretetype(Matrix{Distribution}) is true, but I wouldn’t have thought it was concrete because Distribution isn’t concrete. I guess I’m asking when is a composite type deemed concrete and is that something that should be strived for.

lancejnelson · October 1, 2024, 6:19pm

Why wouldn’t you also parameterize the Float64?

struct ModelA{F,T}
     σ::Vector{F}
     ϵ::Vector{F}
     σ_Priors::Matrix{T}
     ϵ_Priors::Matrix{T}
end

I understand that parameterizing with a concrete doesn’t create a family of types because you’re at the bottom of the type tree, but is there no performance gain by doing this?

Topic		Replies	Views
Using gradient function with Optim New to Julia question	4	1708	September 18, 2017
Reducing allocations (again) New to Julia	10	695	March 24, 2022
Learning to optimize Performance	12	1209	March 15, 2021
Learning Julia by an example of a simple Bayesian linear regression Performance question , performance , distributions , bayesian-inference , linear-regression	10	1120	June 4, 2023
Help optimizing simple example New to Julia question , performance	5	397	March 27, 2022

Optimization question

Related topics