Builtin argmin is slower than manual

endremborza · November 14, 2020, 2:27am

I don’t understand how this is possible. I tried it many times, targmin always wins, and by quite a significant margin (1.5.2)

using BenchmarkTools

function targmin(a)
    m = Inf
    ind = 0
    for i in 1:length(a)
        if a[i] < m
            m = a[i]
            ind = i
        end
    end
    ind
end

a = rand(10 ^ 8);

println(argmin(a) == targmin(a))

@btime argmin(a)
@btime targmin(a)

true
382.492 ms (1 allocation: 16 bytes)
182.026 ms (1 allocation: 16 bytes)

Tamas_Papp · November 14, 2020, 8:10am

I can replicate this on master. It would be great to get to the bottom of this — ideally with a fix, but at least opening an issue.

Sukera · November 14, 2020, 9:19am

Note that your code assumes that eltype(a) is comparable to Float64, since m is Inf in the first iteration. That might not necessarily be the case. You’re also assuming 1-based indexing, eachindex would be better.

That said, I think the crux of the matter is that argmin does findmin, which itself calls findminmax! (seemingly very complex function)

Additionally, argmin allows for reduction across arbitrary dimensions, your targmin always reduces over the whole array.

kristoffer.carlsson · November 14, 2020, 9:21am

The core of argmin is:

https://github.com/JuliaLang/julia/blob/ba06f439d187ff98530487508fd9844fce6a9e49/base/array.jl#L2205-L2226

It does some extra stuff to handle NaN etc and it is generic on the type of the array. But maybe it could be optimized as well (the code in question is quite old), for example, is both the m != m and ai != ai needed in every loop iteration?

Sukera · November 14, 2020, 9:25am

Oh wow, that code is ages old!

Those checks are needed to ensure the same behaviour as min and max for NaN, I think.

kristoffer.carlsson · November 14, 2020, 9:32am

Yeah, I commented on the NaN handling but it does for exampe m != m over and over in the loop, even if m hasn’t changed.

Sukera · November 14, 2020, 9:35am

I wonder why the “change in iteration protocol” commit didn’t change to the new for loop as well and instead did it manually?

kristoffer.carlsson · November 14, 2020, 9:50am

Probably because you want to peel off the first value to use as the initial minimum.

How about:

findmax(a) = _findmax(a, :)

function _findmax(a, ::Colon)
    p = pairs(a)
    y = @inbounds iterate(p)
    if y === nothing
        throw(ArgumentError("collection must be non-empty"))
    end
    (mi, m), s = y
    isnan(m) && return m, mi
    i = mi
    while true
        y = @inbounds iterate(p, s)
        y === nothing && break
        (i, ai), s = y
        isnan(ai) && return ai, i
        # Neither `m` nor `ai` can be NaN here so can use `<` instead of `isless`.
        # Edit: not true, consider 0.0 < -0-0 == false
        if m < ai
            m, mi = ai, i
        end
    end
    return m, mi
end

does anyone see any problems with that?

Edit, the isnan should be changed back to ai != ai to handle missing…
Edit Edit: actually, findmax is already broken with respect to missing

julia> findmax([missing, 1])
ERROR: TypeError: non-boolean (Missing) used in boolean context

Edit Edit Edit: My code fails to handle signed zero I think.

mschauer · November 14, 2020, 10:05am

You would not need to bail out early at all, isless already treats NaN the way you want

          while true
               y = iterate(p, s)
               y === nothing && break
               (i, ai), s = y
               if isless(m, ai)
                   m = ai
                   mi = i
               end
           end

julia> findmax([1, NaN,  2])
(NaN, 2)

Not sure if you should have a branch just for finding NaNs faster at all.

kristoffer.carlsson · November 14, 2020, 10:09am

But early exit is nice in case you have a NaN at the beginning, or?

mschauer · November 14, 2020, 10:14am

There are many early exits you could imagine, e.g. typemax(T) where T <: Union{Int32, Int64, Int128} etc., the NaN early exit seems to be of less use. And here luckily

julia> isless(NaN, NaN)
false

so you actually find the first NaN as promised.

kristoffer.carlsson · November 14, 2020, 10:25am

Yeah, that’s true.

mschauer · November 14, 2020, 10:26am

PS: This behaviour is also not very convincing to me:

julia> struct LargerThanNaN
       end

julia> Base.isless(_, ::LargerThanNaN) = true

julia> findmax([1, NaN, LargerThanNaN()])
(NaN, 2)

malacroi · November 14, 2020, 3:57pm

IEEE specifies that NaN is supposed to propagate to indicate an error in an earlier computation. We’re not returning NaN because it’s the maximum (or minimum), but because it flags an illegal operation in a preceding computation.

Personally, since m < NaN is false, I would never want NaN as the result of maximum (or minimum for that matter), but that’s so 2008 of me. As of 2019, we should really be specifying whether we want maximum or maximumNumber to clarify our intent as to whether we want errors to propagate or be discarded.

How fun it is that Julia predates the latest floating point standard.

Topic		Replies	Views
Slow argmin? General Usage	3	84	November 12, 2024
Why is minimum so much faster than argmin? Performance	11	2000	August 23, 2021
`Nulls.skip` is very slow Data	4	1074	October 15, 2017
Julia is slower than Python when appending elements to untyped arrays General Usage question	42	798	February 10, 2025
`minimum()` 3x - 6x slower than `numpy.min()` Performance question	16	526	November 2, 2024

Builtin argmin is slower than manual

Related topics