Optimizing or replacing IF-ELSEIF-ELSE statements

hssn15 · March 12, 2025, 12:33pm

I am developing a simulator but I dont have a lot experience and tricks about Julia. I have a case that seems like it has some kind of other way around that is much more efficient and optimized.

I will give you and example that is similar to my case.

struct Struct1
    value::Float64
    type::Symbol
end

obj1 = Struct1(1.0, :A)
obj2 = Struct1(1.0, :B)
obj3 = Struct1(1.0, :C)

objs = [obj1, obj2, obj3]
results = zeros(length(objs)*10)


for i in eachindex(objs)
        obj = objs[i]
        type = obj.type
    for n in 1:10
        position = (i-1)*10+n
        ############Here we have 20-40 lines of code block############
        if type == :A
            val = obj.value
            results[position] = val # or much more complex operations in a few lines
        elseif type == :B
            val = obj.value
            results[position] = val*2 # or much more complex operations in a few lines
        elseif type == :C
            val = obj.value
            results[position] = val*3 # or much more complex operations in a few lines
        end

    end
end

Explanation:

Vector “objs” always stays constant. So types always will be same in that order. In that way, we always need to check if type is A or B or C for each iteration of loop 2 even we know that type will be always A through “loop 2” since we choose “obj1” at first iteration of “loop 1”.

one way to reduce meaningless condition checks 10 times in “loop2” is writing conditional statements before the “loop 2”:

for i in eachindex(objs)
    obj = objs[i]
    type = obj.type
    if type == :A
        val = obj.value
        for n in 1:10
            position = (i-1)*10+n
            ############Here we have 20-40 lines of code block############
            results[position] = val # or much more complex operations in a few lines
        end

    elseif type == :B
        val = obj.value
        for n in 1:10
            position = (i-1)*10+n
            ############Here we have 20-40 lines of code block############
            results[position] = val # or much more complex operations in a few lines
        end

    elseif type == :C
        val = obj.value
        for n in 1:10
            position = (i-1)*10+n
            ############Here we have 20-40 lines of code block############
            results[position] = val # or much more complex operations in a few lines
        end

    end
end

But in this way we need to write a loop for each condition so we need to write " 20-40 lines of code block" for each which is not efficient. is there a way around?

Thank You

YabusameHoulen · March 12, 2025, 2:25pm

DRY ? make “20-40 lines of code block” into a function
since Julia can’t dispatch on value ?
I don’t have a lot experience and tricks about Julia either

mbauman · March 12, 2025, 2:34pm

Before worrying about this sort of micro-optimization, you’ve got some bigger fish to fry. Namely, you should put this computation into a function instead of running it at top-level:

objs = [obj1, obj2, obj3]
function do_computation(objs)
    results = zeros(length(objs)*10)
    for i in eachindex(objs)
        #...
    end
    return results
end
do_computation(objs)

A chain if if-elseif-elseif-elseif often performs better than you’d expect. The compiler may even re-implement it into a jump table if it thinks that’d be helpful. But the difference here will almost surely be peanuts compared to optimizing the “real work” of your algorithm. Be sure to check out the performance tips — which stresses the importance of functions like I suggested above, as well as many other tips:

Paul_Schrimpf · March 12, 2025, 2:36pm

You can use multiple dispatch if you make Struct1 a parametric type.

struct Struct1{type}
    value::Float64
end

f(obj::Struct1{:A}) = obj.value
f(obj::Struct1{:B}) = 2*obj.value
f(obj::Struct1{:C}) = 3*obj.value

obj1 = Struct1{:A}(1.0)
obj2 = Struct1{:B}(1.0)
obj3 = Struct1{:C}(1.0)

objs = [obj1, obj2, obj3]
results = zeros(length(objs)*10)

for (i,obj) in enumerate(objs)
  for n in 1:10
    position = (i-1)*10+n
    results[position] = f(obj)
  end
end

Depending on the bigger context, this may not be the best solution, but it does shorten your loop.

mbauman · March 12, 2025, 2:38pm

That will certainly perform worse

kellertuer · March 12, 2025, 2:42pm

Even in short you can also dispatch on value

f(::Val{:a}) = 1
f(::Val{:b}) = 2
f(s::Symbol) = f(Val(s))
[f(:a), f(:b)] # a long dway to wite [1,2]

but if you are lucky that improves readability somewhere, but not necessarily performance.

For exactly the case you have you could still store a factor = Dict(:A=>1, :B=>2, :C=>3)
and do a

results[position] = factor[type] * val

thing? Depends of course bit what the concrete example needs, you could also store that factor in the struct for example.

Paul_Schrimpf · March 12, 2025, 3:10pm

I checked because it wasn’t obvious to me. At least in this simple example, I’m finding there’s no difference. To measure, I wrapped the loop in function, changed 10 to a much bigger number, and preallocated results to focus on the difference in the conditional. I get identical benchmarks.

julia> @benchmark dispatch!(results,objs,multiple)

BenchmarkTools.Trial: 1942 samples with 1 evaluation per sample.
 Range (min … max):  2.477 ms …   3.203 ms  ┊ GC (min … max): 0.00% … 0.00%
 Time  (median):     2.513 ms               ┊ GC (median):    0.00%
 Time  (mean ± σ):   2.557 ms ± 108.804 μs  ┊ GC (mean ± σ):  0.00% ± 0.00%

  ▆██▆▅▆▅▅▃▃▂    ▁▁▂▂▁ ▂▁                                     ▁
  ███████████████████████▇█████▆█▇▇▇▅▅▆▇▇█▆▇▇▇▇▇▆▆▄▅▁▄▆▆▅▄▄▄▄ █
  2.48 ms      Histogram: log(frequency) by time      2.99 ms <

 Memory estimate: 0 bytes, allocs estimate: 0.

julia> @benchmark conditional!(results,objs2,multiple)
BenchmarkTools.Trial: 1923 samples with 1 evaluation per sample.
 Range (min … max):  2.477 ms …   3.328 ms  ┊ GC (min … max): 0.00% … 0.00%
 Time  (median):     2.519 ms               ┊ GC (median):    0.00%
 Time  (mean ± σ):   2.582 ms ± 145.734 μs  ┊ GC (mean ± σ):  0.00% ± 0.00%

  ▇█▇▆▆▅▃▂ ▁▂▂▁▂▂▂▂▁▁▁▂▁    ▁▁ ▁                              ▁
  ███████████████████████▇▇██████▇██▇▇▇▆▇▆▆▅▇▆▇▇▅▄▅▄▁▄▅▅▆▄▄▅▄ █
  2.48 ms      Histogram: log(frequency) by time      3.16 ms <

 Memory estimate: 0 bytes, allocs estimate: 0.

I can imagine that (ab)using multiple dispatch in this way might cause performance problems in more complex situations, but I’m not sure where that line is.

Full code below.

struct Struct1{type}
    value::Float64
end

f(obj::Struct1{:A}) = obj.value
f(obj::Struct1{:B}) = 2*obj.value
f(obj::Struct1{:C}) = 3*obj.value

obj1 = Struct1{:A}(1.0)
obj2 = Struct1{:B}(1.0)
obj3 = Struct1{:C}(1.0)

objs = [obj1, obj2, obj3]

function dispatch!(results,objs,multiple)
  for (i,obj) in enumerate(objs)
    for n in 1:multiple
      position = (i-1)*multiple+n
      results[position] = f(obj)
    end
  end
  return(results)
end


struct Struct2
    value::Float64
    type::Symbol
end

obj1 = Struct2(1.0, :A)
obj2 = Struct2(1.0, :B)
obj3 = Struct2(1.0, :C)

objs2 = [obj1, obj2, obj3]

function conditional!(results,objs,multiple)
  for i in eachindex(objs)
    obj = objs[i]
    type = obj.type
    for n in 1:multiple
      position = (i-1)*multiple+n
      ############Here we have 20-40 lines of code block############
      if type == :A
        val = obj.value
        results[position] = val # or much more complex operations in a few lines
      elseif type == :B
        val = obj.value
        results[position] = val*2 # or much more complex operations in a few lines
      elseif type == :C
        val = obj.value
        results[position] = val*3 # or much more complex operations in a few lines
      end
    end
  end
  return(results)
end

using BenchmarkTools, Test

results = zeros(length(objs)*100)
dispatch!(results,objs,100)
results2 = zeros(length(objs)*100)
conditional!(results2,objs2,100)
@test results ≈ results2

multiple=1_000_000
results = zeros(length(objs)*multiple)
@benchmark dispatch!(results,objs,multiple)
@benchmark conditional!(results,objs2,multiple)

mbauman · March 12, 2025, 3:30pm

You’re dancing right at the edge of a 1000x tall cliff there:

julia> @benchmark dispatch!($results,$objs,$multiple)
BenchmarkTools.Trial: 10000 samples with 1 evaluation per sample.
 Range (min … max):  370.125 μs …   1.523 ms  ┊ GC (min … max): 0.00% … 0.00%
 Time  (median):     411.916 μs               ┊ GC (median):    0.00%
 Time  (mean ± σ):   450.258 μs ± 105.652 μs  ┊ GC (mean ± σ):  0.00% ± 0.00%

  ▅█▇▆▄▃▃▂▂▅▇▇▅▄▃▂▂▁▁                                        ▁  ▂
  █████████████████████▇▇▇▆▇▆▅▆▅▄▅▅▅▅▅▃▅▅▄▅▅▄▃▃▅▃▃▁▁▃▃▃▅▇▇█████ █
  370 μs        Histogram: log(frequency) by time        964 μs <

 Memory estimate: 0 bytes, allocs estimate: 0.

julia> f(obj::Struct1{:D}) = 4*obj.value
f (generic function with 4 methods)

julia> @benchmark dispatch!($results,$objs,$multiple)
BenchmarkTools.Trial: 22 samples with 1 evaluation per sample.
 Range (min … max):  227.317 ms … 240.387 ms  ┊ GC (min … max): 0.66% … 4.48%
 Time  (median):     229.196 ms               ┊ GC (median):    1.54%
 Time  (mean ± σ):   230.290 ms ±   3.112 ms  ┊ GC (mean ± σ):  1.57% ± 0.79%

        █
  ▅▁▅▅▁██▅▁▅█▁▁▅█▅▁▁▁▁▁▁▅▁▁▁▁▁▁▅▁▁▁▁▁▁▁▁▁▁▁▁▁▁▅▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▅ ▁
  227 ms           Histogram: frequency by time          240 ms <

 Memory estimate: 91.54 MiB, allocs estimate: 5999489.

The more important point, though, is that parametric structs are more advanced techniques. Focus on the simple stuff first.

Alexander-Barth · March 12, 2025, 4:00pm

It is quite interesting to see the degradation of performance when you add a new type. The solution of @Paul_Schrimpf looks to me quite idiomatic. I don’t see why this could be an abuse of dispatch. I think one can also use union types, to keep the same performance:

f(obj::Struct1{:D}) = 4*obj.value

TU = Union{Struct1{:A},Struct1{:B},Struct1{:C},Struct1{:D}}
objs_union = TU[obj1, obj2, obj3]
@benchmark dispatch!(results,objs_union,multiple)
# same as conditional!(results,objs2,multiple)

mbauman · March 12, 2025, 4:30pm

The OP here came to us with code that didn’t even apply the very first and most important performance tip: use a function. I don’t think it’s very useful to dive straight into the deep end with parametric structs.

To be clear, I didn’t add a type. I added a method. When there are only a few methods that Julia might possibly call, it’ll convert dispatch into a bunch of ifelses for you. But you’re dancing on the edge of a cliff. Similarly, if an array only has a few types it could possibly contain, it’ll convert dispatch to a bunch of ifelses for you. But you’re once again dancing on the edge of a cliff. If you don’t know that the cliffs are there, you’re gonna fall.

Topic		Replies	Views
IF...ELSEIF...ELSE performance Performance	16	24029	May 29, 2019
Comparing Julia structs Performance question , struct	8	1397	July 13, 2023
If, elseif, else vs Case When New to Julia question	5	644	September 10, 2021
Nested if statements better formulation New to Julia question , performance , optim , optimization	12	2378	November 8, 2020
Cost of checking if statements in type parameters in a big loop and generated functions General Usage	8	1097	July 2, 2017

Optimizing or replacing IF-ELSEIF-ELSE statements

Related topics