Argmax returns wrong value

lukas_r · November 22, 2022, 2:41pm

I have an array of n vectors (of varying length) that I am searching for the largest value. For this I first find the vector containing the largest value with argmax(array) and then find the index of that value with argmax(array[vector]). For one array of 6 vectors with values between 4 and 16.75 this reliably works. For another array of 8 Vectors with values between 4.25 and 34 it finds a relatively high value but not the highest overall. The arrays/vectors are only filled with Float64s and the value returned is not NaN.
Does anyone have an idea why this happens?

Oscar_Smith · November 22, 2022, 2:45pm

can you give an example where it returns incorrect results?

pdeffebach · November 22, 2022, 2:45pm

In Julia, arrays are ordered lexicographically… Comparring two arrays will give the largest array in a lexicographic order, not the highest value. The same goes for comparing values in an array of arrays.

julia> x = [1, 100]; y = [3, 4];

julia> x > y
false

julia> y > x
true

julia> t = [x, y];

julia> argmax(t)
2

DNF · November 22, 2022, 3:16pm

You could use

x = [1, 100]; y = [3, 4];
t = [x, y];
maxval, ind = findmax(maximum, t)
loc = findfirst(==(maxval), t[ind])

lukas_r · November 23, 2022, 8:06am

When the right value is returned my array consists of 6 vectors, containing 30-50 values ranging between 4 and 16.75. When the wrong value is returned the array consists of 8 vectors, containing between 110-118 values between 4.5 and 34. Here argmax finds neither the longest vector (number 2) nor the one containing the highest value (number 8) but instead vector number 6. This contains 115 values with the highest being 32.75

albheim · November 23, 2022, 8:56am

Do you have some example code that we can run to reproduce this?

Based on your description I think @pdeffebach have given the reason, though it is would be easier to say for sure with some code to look at.

DNF · November 23, 2022, 10:08am

The description of all the different inputs aren’t really that relevant. The point is that this code

is wrong. It does not find the vector with the largest element. It doesn’t matter that it returns the desired result occasionally, by pure chance.

You can use findmax(maximum, array) instead, as shown in my code example.

zamk · November 23, 2022, 11:10am

In the documentation argmax is defined as:

  argmax(A; dims) -> indices

  For an array input, return the indices of the maximum elements over the given dimensions. NaN is treated as greater
  than all other values except missing.

  Examples
  ≡≡≡≡≡≡≡≡≡≡

  julia> A = [1.0 2; 3 4]
  2×2 Matrix{Float64}:
   1.0  2.0
   3.0  4.0

  julia> argmax(A, dims=1)
  1×2 Matrix{CartesianIndex{2}}:
   CartesianIndex(2, 1)  CartesianIndex(2, 2)

  julia> argmax(A, dims=2)
  2×1 Matrix{CartesianIndex{2}}:
   CartesianIndex(1, 2)
   CartesianIndex(2, 2)

It returns the maximum elements over given dimensions of an array, not array of vectors. If you can shape your vectors in the same length and create a matrix by concatenating them, then you can get the maximum values you are looking for using argmax.

lukas_r · November 23, 2022, 12:16pm

This worked for me, thanks!

Dan · November 23, 2022, 1:15pm

It is always possible to keep things simple () with:

julia> x = [1, 100]; y = [3, 4];
julia> t = [x, y];
julia> Iterators.flatten(
         Iterators.map(
           (x,y)->Iterators.map(tuple,x,Iterators.repeated(y)),
           Iterators.map(x->Iterators.map(reverse,x),enumerate.(t)),
           Iterators.countfrom(1)
         )
       ) |>  maximum
((100, 2), 1)

The actual benefit this has is: 1. Not going through one vector twice which maximum and findfirst can ; 2. This can work if the inputs are iterators.

abraunst · November 23, 2022, 1:31pm

What about

findmax(findmax(y) for y in t)

DNF · November 23, 2022, 1:52pm

Dan:

Iterators.flatten(
         Iterators.map(
           (x,y)->Iterators.map(tuple,x,Iterators.repeated(y)),
           Iterators.map(x->Iterators.map(reverse,x),enumerate.(t)),
           Iterators.countfrom(1)
         )
       ) |>  maximum

Seems like a simple loop would be more readable

Really elegant The only thing is that if two vectors have the same maximum element, it will select the last one, which is, I think, not the most common behaviour. Selecting the first among several equal elements is the norm.

abraunst · November 23, 2022, 2:44pm

I think that it will select the one in which the common element appear first, which, admittedly, is even less common behavior

rafael.guerra · November 23, 2022, 2:48pm

The following code is not the most efficient, but it illustrates what I would expect as result:

vv = [[9, 3, 9, 0], [1, 9, 2], [0, -1]]
mx = maximum(reduce(vcat,vv))       # mx = 9
loc = @. findall(==(mx), vv)

# result:
3-element Vector{Vector{Int64}}:
[1, 3]
[2]
[]

Topic		Replies	Views
Argmax, but n biggest elements General Usage arrays	3	1719	January 13, 2022
Argmax() -> vector of index result General Usage maxima	5	739	February 16, 2022
How to find the index of the two largest values in a 1D array New to Julia question	8	456	May 25, 2023
How to perform an argmax / argmin on a subset of a vector? New to Julia array	21	5975	October 30, 2021
Argmax, returned the first one New to Julia	1	480	December 19, 2022

Argmax returns wrong value

Related topics