Package for lazy hcat/vcat of a large number of vectors

Tamas_Papp · August 31, 2022, 10:50am

I am looking for the lazy equivalent of

v = [rand(500) for _ in 1:5000];
mapreduce(permutedims, vcat, v)

where both dimensions may be large (up 10⁵). This is trivial to code up but I want to avoid duplication.

Is there a package which has such functionality?

I thought of LazyArrays.jl but it only has vararg syntax, cf

fabiangans · August 31, 2022, 10:56am

This one from RecursiveArrayTools usually works quite well:

https://recursivearraytools.sciml.ai/stable/array_types/#RecursiveArrayTools.VectorOfArray

Tamas_Papp · August 31, 2022, 10:57am

Thanks! After I pressed submit, I also realized that

using JuliennedArrays
Align(v, False(), True())

works too.

aplavin · August 31, 2022, 11:04am

SplitApplyCombine.jl has functions specifically for this, both lazy and eager.

julia> using SplitApplyCombine

# eager - combinedims
julia> combinedims(v)
500×5000 Matrix{Float64}:
...

julia> combinedims(v, 1)
5000×500 Matrix{Float64}:
...

# lazy - just change to combinedimsview

The inverse operation is there as well - splitdims.

rafael.guerra · August 31, 2022, 11:18am

You could also use TensorCast, which has the most intuitive syntax:

using TensorCast
@cast m[i,j] := v[i][j]

Tamas_Papp · August 31, 2022, 1:58pm

This lead me to LazyArrays.stack, which I ended up using.

Thanks for all the great replies.

Tamas_Papp · August 31, 2022, 2:01pm

A related question: what if the elements are matrices, as in

v = [rand(500, 50) for _ in 1:5];
reduce(vcat, v) # need lazy version

rafael.guerra · August 31, 2022, 2:22pm

To combine two indices into one, you can use TensorCast’s operator ⊗:

@cast m[j⊗i, k] := v[i][j,k]  (i in 1:5)

quinnj · August 31, 2022, 4:59pm

There is also SentinelArrays.ChainedVector type specifically for vectors. It uses a vector internally, so won’t have the same StackOverflow problem.

aplavin · September 1, 2022, 10:11am

Exactly the same solution with combinedimsview (:

Topic		Replies	Views
Package for lazy concatenation along arbitrary higher dimensions? General Usage lazy-evaluation , splitapplycombine	4	2001	February 17, 2021
Lazy vcat of matrices just before multiplication Performance	3	234	April 2, 2024
Lazy virtually indexed version of vcat hcat cat? General Usage	2	163	October 17, 2023
Fastest way to concatenate many arrays along existing axis? General Usage linearalgebra , arrays	5	423	June 19, 2024
Transforming an Array{Matrix{T, N}, M} to an Array{T, N+M} New to Julia arrays , splitapplycombine	10	942	July 24, 2021

Package for lazy hcat/vcat of a large number of vectors

Related topics