Proposal: function for mutating operators?

aryavorskiy · February 4, 2024, 7:55pm

I want my custom type to implement indexed increments A[...] += B, but not assignments A[...] = B.

Currently, syntax like A[1:3] += [1, 2, 3] is parsed A[1:3] = A[1:3] + [1, 2, 3]. This is straightforward, but unoptimal, because we have excessive memory allocation when A[1:3] is called. Another bad consequence of this language design is that we cannot overload the += operator.

My proposal is the following: interpret the aforementioned expression to a call like mutate_setindex!(A, [1, 2, 3], +, 1:3), similar to how setindex! works now. The default implementation will be very simple:

mutate_setindex!(A::Any, B::Any, f::Function, I...) = A[I...] += f(A[I...], B)

#optimization for AbstractArrays
function mutate_setindex!(A::AbstractArray, B::AbstractArray, f::Function, I...)
    # Shape check...
    @. A[I...] += f(view(A, I...), B)
end

Also now package developers will be able to make their own implementations of mutating operators like +=.
Does Julia really need such a feature?

jar1 · February 4, 2024, 8:03pm

Does this do what you want?

julia> f(x, y) = x[1:3] += y
f (generic function with 1 method)

julia> g(x,y) = x[1:3] .+= y
g (generic function with 1 method)

julia> h(x,y) = @views x[1:3] .+= y
h (generic function with 1 method)

julia> let x=[1,2,3,], y=[4,5,6]; 
       @time f(x,y)
       @time g(x,y)
       @time h(x,y)
       end;
  0.000002 seconds (2 allocations: 160 bytes)
  0.000000 seconds (1 allocation: 80 bytes)
  0.000000 seconds

aryavorskiy · February 4, 2024, 8:10pm

Not really. I’m speaking of interface, not performance. For my custom type, I want getindex and setindex! forbidden, but such mutating operations possible.

To be more precise, I want to create an abstraction that allows seamlessly building sparse arrays:

A = SparseMatrixBuilder(3, 3)
A[2, 3] += 4
A[1, 1] -= 1
A[2, 2] = 3       # This throws an error!
print(A[2, 2])    # This is an error, too!
a = to_matrix(A)   # This is a 3x3 sparse array:
# -1 ⋅  ⋅
#  ⋅  ⋅  4
#  ⋅  ⋅  ⋅

The reason - using a simple SparseMatrix this way is extremely slow.

Benny · February 4, 2024, 8:34pm

In many scenarios, the allocation condenses the elements in a type that allows for optimized operations, and this can outweigh the performance lost to frequent allocations.

This is very likely a breaking change. += would need to give the same results across v1 to not be breaking. You could instead make a macro that transforms this syntax, and it shouldn’t be in base Julia.

It doesn’t make sense to disallow A[2, 2] = 3 but allow A[2, 3] += 4. In both cases you are semantically setting an index of A, the only difference the latter adds the index’s existing value to the input value before setting the index. You can pull this off; your type doesn’t need to implement getindex and setindex! and can instead implement a different index-setting function digging into internal functions. But I fail to see the advantage of cutting yourself off from code using the AbstractArray interface just to impose operations before setting indices. If your issue is with the copy made when the indices are not all scalars, then you should use views, like broadcasting does.

Elrod · February 4, 2024, 8:43pm

Sparse arrays (mentioned above) have a heavy lookup/indexing cost, making them really benefit.
Dictionaries are another example.
In C++, d[k] += v would only do a single lookup, in Julia it’d do two. Pretty relevant when d is a dictionary or sparse array.

aryavorskiy · February 4, 2024, 9:19pm

I can’t see why it is a breaking change. It introduces a new level of abstraction - true. But the default implementation I provided just copies the behavior that existed before, so no new errors should be expected from this. However, I agree that one should be careful with optimizations like I proposed.

In my example this is defined by how my SparseMatrixBuilder looks like under the hood.
The fastest way to build a sparse is providing horizontal indices, vertical indices and values of nonzero elements as three separate Vectors. So the implementation looks something like this:

using SparseArrays
struct SparseMatrixBuilder{T}
    a::Int
    b::Int
    is::Vector{Int}
    js::Vector{Int}
    vs::Vector{T}
    SparseMatrixBuilder{T}(a, b) where T = new{T}(a, b, [], [], [])
end

function mutate_setindex!(A::SparseMatrixBuilder, v, ::typeof(+), i, j)
    push!(A.is, i)
    push!(A.js, j)
    push!(A.vs, v)
end

to_matrix(A::SparseMatrixBuilder) = sparse(A.is, A.js, A.vs, A.a, A.b)

Defining getindex in any way is a bad idea in this case.

Also there are other cases where one might want to overload compound assignment so that its semantics has nothing to do with indexing. For example, in some languages the += operator attaches an event listener.

Benny · February 4, 2024, 9:30pm

The closest issue I could look up: Feature request: overloadable updating operators · Issue #3217. Not very long, only covers the obvious concerns, and doesn’t address A[...] += specifically, just += in general.

That issue at least mentions that they are avoiding divergences between + and +=. So I think it would be sensible if += lowers to a higher order function that takes +.

The issue mentions a pretty serious issue with += when there is no indexing, but what Elrod said about repeated indexing makes sense. Instead of 1) index array in getindex or materialize! (1 element vs many) 2) do operation 3) index array again in setindex! or materialize! to write result, you could lower to something that does 1) index array and remember the address 2) do operation 3) write result to address.

aryavorskiy · February 4, 2024, 9:51pm

Thank you for the info! Didn’t know this question was already addressed so long ago

But my initial idea was only about indexed compound assignment - this will not break anything. As for just += without indexing - one can come up with a syntax that will clearly show that A is mutable. Like A[] += B or A[:] += B or A[!] += B (taken from DataFrames.jl).

Also if we stay limited to the features of modern Julia, we can still hack into the broadcast system to make the A[I...] .+= B notation do what we want (by redefining copyto! and dotview for A). Not sure if it’s a good idea though.

Benny · February 4, 2024, 9:54pm

Oh we wouldn’t need any extra indication that A is mutable. A[...] = ... does not work for any immutable A. But that linked issue informs that updating operators should not diverge from the original operators, so my point was rather than overloading the updating operators individually, you could have a higher-order mutate_setindex! that takes an operator + as input. This would be quite a hefty addition to the AbstractArray interface, though hopefully it can be mostly implemented in fallbacks, and a significant rewrite of broadcasting.

fredrikekre · February 4, 2024, 10:23pm

You can use this abomination:

using SparseArrays

struct SparseMatrixBuilder{T}
    a::Int
    b::Int
    is::Vector{Int}
    js::Vector{Int}
    vs::Vector{T}
    SparseMatrixBuilder{T}(a, b) where T = new{T}(a, b, Int[], Int[], T[])
end

struct SparseMatrixBuilderScalarIndexView{T}
    builder::SparseMatrixBuilder{T}
    i::Int
    j::Int
end

function Base.getindex(builder::SparseMatrixBuilder, i::Int, j::Int)
    return SparseMatrixBuilderScalarIndexView(builder, i, j)
end

struct SparseMatrixBuilderNewAddition{T}
    builder_view::SparseMatrixBuilderScalarIndexView{T}
    v::T
end

function Base.:(+)(builder_view::SparseMatrixBuilderScalarIndexView{T}, v::T) where T
    return SparseMatrixBuilderNewAddition(builder_view, v)
end

function Base.:(-)(builder_view::SparseMatrixBuilderScalarIndexView{T}, v::T) where T
    return SparseMatrixBuilderNewAddition(builder_view, -v)
end

function Base.setindex!(builder::SparseMatrixBuilder{T}, v::SparseMatrixBuilderNewAddition{T}, i::Int, j::Int) where T
    if !(i == v.builder_view.i && j == v.builder_view.j && builder === v.builder_view.builder)
        error("good error message")
    end
    if !(1 <= i <= builder.a && 1 <= j <= builder.b)
        throw(BoundsError(builder, (i, j)))
    end
    push!(builder.is, i)
    push!(builder.js, j)
    push!(builder.vs, v.v)
    return v
end

to_matrix(A::SparseMatrixBuilder) = sparse(A.is, A.js, A.vs, A.a, A.b)

julia> builder = SparseMatrixBuilder{Float64}(2, 2);

julia> builder[1, 1] += 1.2;

julia> builder[2, 2] += 3.4;

julia> builder[1, 2] -= 5.6;

julia> to_matrix(builder)
2×2 SparseMatrixCSC{Float64, Int64} with 3 stored entries:
 1.2  -5.6
  ⋅    3.4

Note that builder[2, 2] works but return a “useless” object – there are no methods for it so will just give MethodError as soon as you try to use it. builder[2, 2] = 3 is a MethodError for example.

Topic		Replies	Views
Reference/Mutation Problem General Usage	4	549	December 26, 2017
An Issue in Defining a Mutating Function General Usage question , function , functions	6	698	March 12, 2023
_set_value!(w, key, val) = w[key] = val General Usage question	2	115	January 11, 2024
Help defining setindex! with a custom structure General Usage	9	1089	September 12, 2021
Mutating arrays inside function New to Julia functions	13	1258	January 8, 2024

Proposal: function for mutating operators?

Related topics