Reducing over Point objects

xor0110 · October 30, 2025, 9:32am

Julia has many statistics-computing functions, typically a “reduce” kind of function, that can take either a simple vector, or go over an array and compute the output over row, columns, etc according to the dims parameter.

Is there a version for this that goes over a collection of vectors, tuples or Points?

WalterMadelim · October 30, 2025, 10:41am

According to my understanding, the function map is way more useful than reduce in your statistics context. Because reduce is for 2-arg functions e.g. * and +.

eldee · October 30, 2025, 6:27pm

Could you give some examples of what exactly you want to achieve? I would expect reduce to already work for a well-chosen function.

julia> x = rand(Tuple{Float64, Float64}, 3)
3-element Vector{Tuple{Float64, Float64}}:
 (0.25205814867458654, 0.14284143829270268)
 (0.10081672872462866, 0.5909316913764835)
 (0.14787926347212343, 0.4637800398205445)

julia> reduce((a, b) -> max.(a, b), x)
(0.25205814867458654, 0.5909316913764835)

julia> reduce((a, b) -> a .+ b, x)
(0.5007541408713386, 1.1975531694897308)

stevengj · October 30, 2025, 6:30pm

You can also just do reduce(.+, x).

If your type supports a + function, like a vector, you can just use a function like sum and it will work.

WalterMadelim · October 31, 2025, 12:28am

Admittedly, reduce is in some cases fast. But I still don’t think it is proper to be used in statistics, e.g.

julia> v = [1,2,3,4]; # a vector of samples

julia> maximum(abs2, v)
16

julia> maximum(abs2, reverse(v))
16

julia> reduce((a, b) -> max(abs2(a), abs2(b)), v)
256

julia> reduce((a, b) -> max(abs2(a), abs2(b)), reverse(v))
65536

As the doc suggests, reduce is almost exclusively only for +, *, max, min.

Sevi · October 31, 2025, 7:36am

Well, these are just two different things that are being computed. Not sure if that makes reduce more or less suitable for statistics than map…

But we don’t even have to choose between map and reduce, we can just use mapreduce

julia> v = [1,2,3,4];

julia> mapreduce(abs2, max, v)
16

julia> mapreduce(abs2, max, reverse(v))
16

I’m not sure about this either, but the docs do mention explicitly that maximum, sum, etc. should be preferred over writing out reduce(max, ...), reduce(+, ...) etc. if possible. So in that example, I agree that maximum(abs2, v) is more intuitive.

xor0110 · November 20, 2025, 6:59pm

Apologies that my first message was not too clear. These examples should better illustrate what happens.


julia> data = randn(Point3f,111);

julia> mean(data)
3-element Point{3, Float32} with indices SOneTo(3):
 0.09946571
 0.043787897
 0.029109783

julia> median(data)
-0.20733127f0

julia> maximum(data)
3-element Point{3, Float32} with indices SOneTo(3):
  2.6677918
  1.3926599
 -0.09724049

The challenge is that Point has specific semantics associated with each of these operations I’m interested in. For mean or sum, it works as I want. For the other operations, that’s not what I want. I want to apply that computation across each different dimension. This would take some kind of specialized functor for Points that does that, it would be the same as stacking the vector of points, then doing maximum(xx, dims=2), and then returning that as a Point. The whole point of this is that I’m trying to use Point more often instead of vectors, so I’m trying to make operations that I commonly use with vectors and arrays easier to perform when my data is represented as a collection of Points.

eldee · November 20, 2025, 7:52pm

Addition and scaling of Point3fs indeed behaves as you expect, i.e. componentwise, and therefore sum and mean work for you without issue. But Point3fs are sorted lexicographically, so maximum(data), which should return the largest Point3f in data according to this order, typically simply yields the one with largest first component.

I’m not completely sure what the rationale is for median, but we get the middle of the midpoint according to the order: mean(extrema(sort(data)[56])). In fact, as median(rand(Point3f, 2)) throws, this is probably not intended. Personally, I would expect to get a Point3f back: the (mean of the) middle (two) Point3f(s), where we again use the default lexicographic order.

In any case, my point is that Vector{Point3f}s are not the type to use if you want e.g. componentwise maximum. But you can always just ‘convert’ it to a Matrix for free using reinterpet and then use the maximum(..., dims=2) you mentioned:

julia> data = randn(Point3f, 3)
3-element Vector{Point{3, Float32}}:
 [1.0567589, 0.5325625, -0.87686133]
 [1.8618298, 0.081127204, 0.8937693]
 [1.2074091, 0.44339427, -0.09087456]

julia> maximum(data)
3-element Point{3, Float32} with indices SOneTo(3):
 1.8618298
 0.081127204
 0.8937693

julia> maximum(reinterpret(reshape, Float32, data), dims=2)
3×1 Matrix{Float32}:
 1.8618298
 0.5325625
 0.893769

xor0110 · November 21, 2025, 8:47am

Correct! The point is, I would like a functor that does this job generically. Not sure how to call it and what would be the best implementation… pointreduce(F, mypoints) = F(reinterpret(mypoints ,...), dims=2) something like this…

eldee · November 22, 2025, 10:27am

You could use something like

componentwise(pts::AbstractVector{Point3f}, f, args...; kwargs...) = Point3f(ntuple(i -> f(args..., view(reinterpret(reshape, Float32, pts), i, :); kwargs...), Val(3)))

julia> data = randn(Point3f, 3)
3-element Vector{Point{3, Float32}}:
 [1.5850062, 0.63555634, -2.3332644]
 [-0.85313797, 0.6584275, -0.274162]
 [-0.47674105, -0.7140366, -1.309917]

julia> componentwise(data, median)
3-element Point{3, Float32} with indices SOneTo(3):
 -0.47674105
  0.63555634
 -1.30991

julia> componentwise(data, reduce, max; init=1.f0)
3-element Point{3, Float32} with indices SOneTo(3):
 1.5850062
 1.0
 1.0

I’m not sure if this is better for e.g. maximum(reinterpret(reshape, Float32, data), dims=2), but note that in componentwise(data, maximum) we are certainly not accessing the entries in memory-order. For Point3f this is probably not too much of an issue though. Also, for some reason (related to the splatting and maybe the anonymous function capture) we have two allocations even though componentwise feels like it should be allocation-free if f is so.

If you want to squeeze out every last bit of performance, this approach is then not fully optimal, but if not, it could be convenient.

Topic		Replies	Views
Maximal value - product of two lists General Usage question	6	329	December 22, 2022
Method of `mapreduce` with multiple arguments General Usage question	2	1330	June 3, 2018
BoundsError using reduce function (Julia 1.3.1) General Usage	7	614	March 20, 2020
Very best way to concatenate array of arrays, while applying a function New to Julia question	10	994	October 4, 2022
Apply reduction along specific axes New to Julia	14	4218	May 3, 2017

Reducing over Point objects

Related topics