Is there a reason a `MersenneTwister` value is not broadcastable?

fjarri · February 14, 2019, 4:53am

I have multiple functions with the same pattern:

foo(rng::AbstractRNG, x::ValueType) = ...

which are commonly applied to arrays as

foo.(rng::AbstractRNG, a::Array{ValueType})

which is supposed to be equivalent to, essentially

reshape([foo(rng, x) for x in a], size(a))

rng here is a MersenneTwister-type value, but I guess the question applies to any other kind of RNG. For just the code above, I get an error

MethodError: no method matching length(::MersenneTwister)

In order to make it work I have to wrap a MersenneTwister in my own type and define length() and iterate() for it.

Is there some caveat I’m missing which is the reason these methods are not defined in Random?

In fact, since I need this kind of “broadcastability” for many other types, I have the following abstract type defined:

abstract type BroadcastableSingleton end
Base.length(::BroadcastableSingleton) = 1
Base.iterate(x::BroadcastableSingleton) = (x, nothing)
Base.iterate(x::BroadcastableSingleton, state) = nothing

This reduces boilerplate code, but won’t be too clear if there are already some complicated abstract type hierarchies in place. I wonder if there is a better way to do it? (Can’t help but miss Haskell’s data classes here)

rdeits · February 14, 2019, 5:01am

A Ref is already designed to do exactly what your custom BroadcastableSingleton does. Wrapping something in a Ref will scalarize it in a broadcasted computation:

foo.(Ref(rng), a)

fjarri · February 14, 2019, 5:05am

Thanks, it does work that way, but:

This call pattern is quite common in my code, and is present in the public API as well. It won’t look good to ask a user to wrap every value that needs this in Ref.
Neither the name nor the docstring for Ref give any indication that it may be used for this purpose. It seems more like an undocumented feature that can change at any time.

Tamas_Papp · February 14, 2019, 5:57am

You are mistaken — it is the standard API, and happens to be well-documented.

https://docs.julialang.org/en/v1/manual/arrays/#Broadcasting-1
https://docs.julialang.org/en/v1/base/arrays/#Base.Broadcast.broadcast

fjarri · February 14, 2019, 6:19am

I would disagree.

‘treats any argument that is not an array … as a “scalar”’ in the first link is incorrect; the second link correctly states that everything but certain predetermined types are iterated over.
Moreover, in the first link it says ‘and treats any argument that is not an array, tuple or Ref (except for Ptr) as a “scalar”’, but the Ref arguments are treated as scalars.
The second link seems more consistent with the actual behaviour. But it only mentions Ref as one of the types that can be broadcasted over by default. There is no indication that it is supposed to be used to create singletons (even though it does create them). Neither, again, there is any indication of that in the docstring of Ref itself.
I don’t need all these low-level effects Ref provides (“This type is guaranteed to point to valid, Julia-allocated memory of the correct type. The underlying data is protected from freeing by the garbage collector as long as the Ref itself is referenced.” and so on). I only need a broadcastable singleton. So in addition to not being the intended usage, it seems like a serious overkill.
My point about this being an unnecessary boilerplate (exaggerated by the fact that from the user’s standpoint it seems like some kind of black magic) is still valid.
Edit: One more thing. typeof(Ref(x)) != typeof(x), so I would have to explicitly ask for Ref objects in the function signature.

Raf · February 14, 2019, 7:28am

Why don’t you just fix the docs a little? The problem seems to be that the docs for Ref are incorrect and not lightweight enough in the introductory paragraphs, not that Ref is not lightweight enough.

And Ref arguments are not treated as scalars. In broadcast Ref is unwrapped and its contents passed to the method. A scalar is itself passed to the method unaltered. You don’t have to ask for Ref in the method signature because it isn’t treated as a scalar.

fjarri · February 14, 2019, 8:43am

They are not incorrect. I actually don’t have anything against the docs for Ref, they seem to contain exactly what they need to contain. It could perhaps have been stated that it is an iterable, but that seems like an implementation detail to me, not related to its purpose.

On the other hand, Single- and multi-dimensional Arrays · The Julia Language seems to contain some incorrect statements (not related to Ref). Although I wouldn’t mind if it was correct though, that is if every non-iterable object was treated as a scalar.

Yes, I agree, that is technically correct. Ref’s contents is treated as a scalar.

Yes, I agree with that as well.

Nevertheless, I still feel like I failed to put the issue across. Instead of Ref(x) I could have used (x,), [x], Set(x) or whatever else built-in iterable container to create a singleton (in fact, I’m surprised that Ref was proposed instead of a tuple - what makes it a better choice? At least with a tuple you can guess that the author of the code wanted to make an iterable). But all of these are clutches. They create noise in the code, they are unintuitive. I think that behaving like a singleton in broadcasting is a natural behaviour for anything that is not an iterable.

Raf · February 14, 2019, 9:06am

Sure, I meant either of those docs, some improvements connecting Ref and broadcast. But I also agree that it seems arbitrary which wrapper to use. (,) does seem more intuitive.

Treating everything as an iterable by default could probably be achieved with:

length(x) = 1

but there is probably a reason that’s a bad idea, broadcast has a few more moving parts than this…

tkf · February 14, 2019, 9:12am

julia> (1,) .+ 2
(3,)

julia> Ref(1) .+ 2
3

fjarri · February 14, 2019, 9:14am

You’d have to define iterate as well. Alternatively, you could broadcast as singleton anything that doesn’t define length(), but I suspect it would affect performance (reflection like that is probably too slow).

fjarri · February 14, 2019, 9:16am

I see. I feel like we’re getting offtopic, but this behaviour of Ref seems weird to me. It’s a container like a tuple, why isn’t it preserved?

Edit: Actually, sorry, I see that there’s a difference, but I don’t see why it makes Ref a better choice.

kristoffer.carlsson · February 14, 2019, 9:23am

julia> axes((1,))
(Base.OneTo(1),)

julia> axes(Ref(1))
()

Basically, a tuple is one dimensional while a Ref is zero dimensional.

stevengj · February 14, 2019, 12:37pm

As mentioned by others, broadcast defaults to treating its arguments as collections. Types can opt in to being treated as “scalars” by defining Base.broadcastable(x::MyType) = Ref(x), or you can opt-in at the call site with an explicit Ref.

That being said, I agree that RNG types should opt-in to being treated as scalars. Fortunately, it is a one-line non-breaking change to add this to any existing type:

https://github.com/JuliaLang/julia/issues/31071

fjarri · February 14, 2019, 1:00pm

Thanks for that, it seems like a better way to customise broadcasting behaviour than overriding length and iterate.

I’m still a bit weirded out by using Ref in this context though. Namely, I don’t understand why it having zero dimensions makes it a better choice; and, more importantly, why Ref incidentally having zero dimensions (which is kind of implied by Ref objects are dereferenced (loaded or stored) with []. in its docstring, but seems like an implementation detail, because it can easily get a custom getindex() in future) means that it should be used here. Would it really be clear for anyone with some Julia programming experience why Ref is used in that line?

Tamas_Papp · February 14, 2019, 1:15pm

I think that historically Ref precedes the current broadcasting implementation, and was kind of hijacked for the purpose. It is innocuous, but I agree that it can be confusing. See

tkf · February 14, 2019, 11:37pm

It’s still an off topic, but let me mention that a syntax &x for making x a scalar in broadcast is proposed:

https://github.com/JuliaLang/julia/issues/27563

If this happens I think Ref just becomes a lower-level detail for most of users.

fjarri · February 14, 2019, 11:51pm

Honestly, I don’t really see a qualitative difference between it and Ref. It’s a contraction, which is nice, but & is just as low-level. It is still associated with “reference”, not with “scalar”. My point (which I think people keep missing) was that I, or a user of my library, shouldn’t have to do anything to mark that something that is not an iterable (and the compiler knows that) should be treated as a scalar. @stevengj above agreed that RNG objects should be treated as scalars; why doesn’t it apply to any other non-iterable? Is there some conceptual problem with that, or is it just impossible to implement at this point without breaking existing code?

stevengj · February 15, 2019, 1:54am

It’s not currently possible to identify “non-iterable” reliably at compile-time, because “iterable” is not a type.

There was a basic design choice that needed to be made for broadcast: should it default to treating arguments as scalars or as collections, with the opposite behavior being opt-in? Initially, it defaulted to treating things as scalars, but it was changed to default to collections after a long discussion (https://github.com/JuliaLang/julia/issues/18618 and https://github.com/JuliaLang/julia/pull/26435). At this point, the default is set in stone — it cannot be changed in 1.x without breaking backwards compatibility.

There were reasonable arguments on both sides. One advantage of defaulting to a collection is that when a type like AbstractRNG arises where this is the wrong choice, we can update it in 1.x without breaking anything, which is not true of the opposite default.

Topic		Replies	Views
recent broadcast changes (iterate by default), scalar struct, and `@.` Internals & Design broadcast	68	7765	January 9, 2019
Marking types as scalar for broadcasting, Ref vs. Tuple? General Usage	2	691	September 24, 2019
A more intuitive way to treat an object as a scalar in broadcasting? Internals & Design broadcast	9	692	March 5, 2020
What is Ref? New to Julia broadcasting	26	12002	January 19, 2022
Use of `Ref` in broadcasting user-defined functions New to Julia question	2	853	July 6, 2021

Is there a reason a `MersenneTwister` value is not broadcastable?

Related topics