Need for explicitly defining subtypes in function arguments?

hatmatrix · October 12, 2023, 3:20am

Sorry I’m very new to type systems here.

I’m wondering why you need to state that your function should accept all subtypes in the class hierarchy for nodes which are defined.

julia> x = collect(1:2)
2-element Vector{Int64}:
 1
 2

julia> g(x::Array{T}) where {T<:Real} = x .* 2
g (generic function with 2 methods)

julia> g(x)
2-element Vector{Int64}:
 2
 4

Is there a case where you would ever define a type higher up in the hierarchy - e.g.,

g(x::Array{Real}) = x .* 2

where you wouldn’t want the function to apply to subtypes? I.e., why is the definition above not permitted?

Oscar_Smith · October 12, 2023, 3:26am

The key point is that Array{Real} and Array{Float64} have completely different memory layouts. Array{Real} is an array of pointers to elements who’s type is unknown. By comparison Array{Float64} is simply a bunch of Float64s next to each other in memory (and you know the element types at compile time). In Julia, no concrete types have subtypes. Array{Float64} is not a subtype of Array{Real} (i.e. Julia’s type system is invariant).

baggepinnen · October 12, 2023, 4:23am

It is

julia> g(x::Array{Real}) = x .* 2
g (generic function with 1 method)

julia> g(Real[i for i in 1:3])
3-element Vector{Int64}:
 2
 4
 6

You just need to call the method with the appropriate type:

julia> typeof(Real[i for i in 1:3])
Vector{Real} (alias for Array{Real, 1})

If it makes any difference to you, you can shorten this syntax a bit:

g(x::Array{<:Real}) = x .* 2

hatmatrix · October 12, 2023, 8:08am

Doesn’t Float64 <: Real => true mean that a concrete type has a subtype? Though Real is not an abstract type.

I now see your argument made in Parametric Composite Types section.

https://docs.julialang.org/en/v1/manual/types/#Parametric-Types

So it’s really because the implementation in memory and performance is different that Array{Real} cannot implicitly accept Array{Float64}? I thought there is a layer of abstraction in this but I guess I was mistaken - wo what happens when g is defined as g(x::Array{<:Real})? Does it just accept Array{Float64} arguments but implements it with the inefficient memory layout?

hatmatrix · October 12, 2023, 8:12am

Interesting that you can make the data type Real- while it has child nodes it’s a concrete data type, but I guess if you were to return a vector with different data types, this is useful?

Thanks for the shortcut… I suppose that makes more sense in this case.

hatmatrix · October 12, 2023, 8:31am

I guess it really is about composite types and its effect on memory management, and not about subtypes strictly, since this works:

julia> a(x::Real) = x * 2
a (generic function with 1 method)

julia> a(2)
4

julia> a.(1:2)
2-element Vector{Int64}:
 2
 4

where what is passed is either an Int64 or Vector{Int64}

Sevi · October 12, 2023, 11:45am

No:

julia> isabstracttype(Real)
true

Not sure I understand this point. In the first call you pass an Int (which is a subtype of Real), but in the second one you broadcast, so you can input an iterable, but the function itself will still only accept objects of type Real (meaning, any of its subtypes, since there cannot be an instance of Real; it is an abstract type).

julia> a(x::Real) = x * 2
a (generic function with 1 method)

julia> a(2)
4

julia> a(1:2)
ERROR: MethodError: no method matching a(::UnitRange{Int64})

# Broadcasting over anything else than `Real`s fails as well
julia> a.([0.0im, 0.0im])
ERROR: MethodError: no method matching a(::ComplexF64)

caleb-allen · October 12, 2023, 2:21pm

My understanding is that the memory layout for Array{Float64} is that way because Float64 is a primitive type whose size is known.

Is there any similar benefit in using Array{String} as opposed to Array{AbstractString}, despite the fact that String is not a primitive type?

Oscar_Smith · October 12, 2023, 3:14pm

yes. The benefit is that by knowing you have Strings, you know the type of data you get from the array.

DNF · October 12, 2023, 3:37pm

It doesn’t have to be primitive. Immutable structs with immutable field members works as well. So you can create your own composite types that can be stored inline in arrays.

hatmatrix · October 12, 2023, 8:18pm

Didn’t realize Real was an abstract type.

Anyway my point was that Int64 <: Real is true but Array{Int64} <: Array{Real} is false is counterintuitive.

abraemer · October 12, 2023, 8:31pm

What helped me understand the distinction between Vector{Real} and Vector{<:Real} is the insight that abstract types always denote a collection of types. So like Real is the collection of types like Int64, Float64 and so on. Then you can see that Vector{<:Real} is also a collection of types. Even clearer if you write it more explicitely as Vector{T} where T<:Real. On the other hand, Vector{Real} denotes just a single type (the vectors that store anything from the set of Real) and is thus a concrete type.

nsajko · October 12, 2023, 9:03pm

See:

DNF · October 12, 2023, 9:03pm

One way to think of it is that Vector{Real} is a collection that can hold Ints, Floats, Rational, etc. etc. You can put all sorts of numbers into it. You cannot do that with a Vector{Int}. The Vector{Real} promises to accept for example the number 2.5. What happens if you try to put 2.5 into a Vector{Int}? That promise is broken.

All technicalities aside, from a purely intuitive standpoint, Vector{Int} does not have the properties of a Vector{Real}.

nsajko · October 12, 2023, 9:06pm

This question is actually mentioned in the Julia FAQ: entry. The FAQ entry links to this section in the manual for further info.

nsajko · October 12, 2023, 9:11pm

The ::Array{<:Real} type signature/annotation basically translates as “any type Array{T} for some T that subtypes Real”. In the REPL:

julia> Array{<:Real} == (Array{T} where {T<:Real})
true

Note that only abstract types can have subtypes in Julia.

To understand the above example better, also see the documentation on “UnionAll” types: 1 2.

implements it with the inefficient memory layout?

Regarding this part of the question specifically, note that Julia specializes code for the given argument types when compiling the function, so there’s no performance penalty, at least after the compilation is done. See Monomorphization - Wikipedia

aplavin · October 13, 2023, 1:48am

Yes, totally this! I’ve already seen this question coming up a few times here on discourse, and answers typically start from those internal/technical aspect like memory layout. Meanwhile, there’s a clear intuitive explanation that Vector{Int} shouldn’t be accepted if function declares Vector{Real} – you can only put 2.5 into the latter, not the former. Maybe there’s some canonical place to put this short explanation, so that to link easily afterwards?..

hatmatrix · October 13, 2023, 1:07pm

my bad

hatmatrix · October 13, 2023, 1:09pm

Thanks - very great reference to place this into proper context for a non-typist.

hatmatrix · October 13, 2023, 1:13pm

I understand what type-invariance is now but I don’t get your example - if I define a function with argument type Vector{Real}, I would expect it to take a collection that contains a value 2.5, which is not an Int but Float, which is still a subtype of Real. I would not expect that for a function with argument type Vector{Int}.

Topic		Replies	Views
Why [1, 2, 3] is not a Vector{Number}? New to Julia question , parametric-types	41	6951	December 9, 2022
Why? isa([(x,1),(y,1)], Array{Tuple{Stuff,Number},1}) = false General Usage	25	1656	February 23, 2021
Simple question about Julia types New to Julia type	11	767	June 10, 2023
Type Definition of Array Arguments to Functions General Usage type-stability , function-parameters	8	145	September 13, 2024
Question about function argument New to Julia	7	1048	December 6, 2017

Need for explicitly defining subtypes in function arguments?

Related topics