Help me understand unintuitive behavior of array comparison

levasco · March 25, 2021, 4:19pm

julia> minimum([[1,1e10,1e10],[2,0,0]])
3-element Array{Float64,1}:
 1.0
 1.0e10
 1.0e10

I would expect this to return either the same as min.() or a MethodError, but instead only the first elements are compared. Similarly [1,1e10,1e10]<[2,0,0] is true, there is probably a good reason why but it’s hard to google for it.

jling · March 25, 2021, 4:22pm

it’s comparing element by element (pair from both arrays) until it finds two elements that are not the same, and it will compare those two:

"""
    isless(A::AbstractVector, B::AbstractVector)

Returns true when `A` is less than `B` in lexicographic order.
"""
isless(A::AbstractVector, B::AbstractVector) = cmp(A, B) < 0

levasco · March 25, 2021, 7:16pm

Thanks for the answer. I guess I understand what it does, but not quite sure about the why. Is this standard in other languages? I can think of at least two other equally plausible ways to determine this comparison:

which array’s sum is larger
which array is larger in most element-wise comparisons

lmiq · March 25, 2021, 7:18pm

It is in python, at least.

I think that is the why.

The option makes mostly sense for strings, arrays of characters, such things, which is how we organize things in dictionaries:

julia> [ 'B', 'A' ] < [ 'C' ]
true

julia> [ 'B', 'A' ] < [ 'A' ]
false

julia> "ba" < "c"
true

julia> "ba" < "a"
false

rdeits · March 25, 2021, 7:26pm

Just for fun, I tried a few. Here’s Python:

>>> min([1,1e10,1e10], [2,0,0])
[1, 10000000000.0, 10000000000.0]

and Ruby:

irb(main):001:0> [[1,1e10,1e10], [2,0,0]].min
=> [1, 10000000000.0, 10000000000.0]

and C++:

julia> using Cxx

C++ > std::vector<std::vector<double>> values = {{1,1e10,1e10}, {2,0,0}}
true

C++ > auto min = *std::min_element(values.begin(), values.end())
true

C++ > min[0]
(double &) 1.0

C++ > min[1]
(double &) 1.0e10

C++ > min[2]
(double &) 1.0e10

These all agree with Julia because they’re all doing the same basic thing: comparing each vector in the list lexicographically.

GunnarFarneback · March 25, 2021, 7:26pm

Yes. It’s called lexicographic order.

levasco · March 25, 2021, 7:31pm

Alright, thank you everyone for the responses! I never think about strings. Interesting that R is then the exception, as it does element-wise comparison by default, returning a boolean vector.

Henrique_Becker · March 25, 2021, 8:01pm

This is because R has basically no concept of scalars, just vectors of unitary length. Consequently, element-wise operations are the default. Julia goes on the opposite direction, with a general broadcast operator that does element-wise operations when requested, so the default of every operation can be the non-element-wise interpretation (to be broadcasted on demand by just adding a single dot).

Tamas_Papp · March 26, 2021, 1:00pm

Pretty much, but not may not be the relevant question. The important thing is that it is documented, see ?isless.

Topic		Replies	Views
Meaning of `isless`, `<`, `cmp`, `isequal`, `==`, total order, partial order, unordered Internals & Design	24	2319	January 7, 2018
Elementwise comparison of Arrays yield a x-element BitArray{1} with zeros instead of Bool New to Julia question	2	2807	September 13, 2019
Max() bug? New to Julia	6	628	October 11, 2020
Base Method for isless on arrays or matrices General Usage	4	823	May 21, 2019
Cartesian Index Inequalities General Usage	8	1051	January 15, 2019

Help me understand unintuitive behavior of array comparison

Related topics