Deepsimilar()?

nicolas · August 28, 2022, 6:34pm

I am writing an optimization package where users can input arrays of arrays. For simple arrays, initialization can be done using similar() without copying values. For arrays of arrays however, I haven’t found anything other than deepcopy() to initialize them. Since I only need the memory allocated with the appropriate structure, something like deepsimilar() would seem cleaner and save some time. Does it already exist? Is it too specific to be useful?

ToucheSir · August 28, 2022, 6:40pm

Depending on how many levels of nesting your array has, map(similar, array_of_arrays) does the trick. This assumes none of the inner arrays are the same, e.g. (x = [...]; array_of_arrays = [x, x]).

On that note, many packages which expose specialized types for arrays of arrays also implement this for you. Here is how RecursiveArrayTools does it.

nicolas · August 28, 2022, 6:47pm

Thanks that works! Always amazed seeing answers coming in faster than the time it took to write my question .

jling · August 28, 2022, 7:14pm

why does this limitation exist? I think even there’s aliasing in the inner arrays, when you “deep similar”, you probably don’t want the new array to keep the aliasing structure anyway?

nicolas · August 29, 2022, 1:24am

As you pointed out, it depends on how many levels x has. map(similar, x) works for arrays of arrays, but not for arrays of arrays of arrays (and deeper). And RecursiveArrayTools works for vectors of arrays. I think your solution can be made to work for an arbitrary number of levels this way:

deepsimilar(x) = eltype(x) == eltype(eltype(x)) ? similar(x) : map(deepsimilar, x)

nicolas · August 29, 2022, 8:08pm

By the way, the code for deepsimilar will create independent elements even when the initial arrays all pointed to the same object:

julia> deepsimilar(x) = eltype(x) == eltype(eltype(x)) ? similar(x) : map(deepsimilar, x)
deepsimilar (generic function with 1 method)

julia> x = [1,2];

julia> y = [x,x];

julia> z = [y,y]
2-element Vector{Vector{Vector{Int64}}}:
 [[1, 2], [1, 2]]
 [[1, 2], [1, 2]]

julia> deepsimilar(z)
2-element Vector{Vector{Vector{Int64}}}:
 [[234558320, 234558208], [287367152, 1976577520]]
 [[287367152, 2094960320], [287367152, 1976577520]]

jling · August 29, 2022, 8:10pm

that’s what I want to know, is there any reason to NOT do this?

ToucheSir · August 29, 2022, 10:56pm

There are cases where you want structural sharing in a collection of arrays (e.g. tied weights in ML models), but I’m not aware of any for arrays of arrays. The note about inner arrays was mostly added for completeness.

Topic		Replies	Views
Deepsimilar Internals & Design proposal	3	836	February 25, 2017
A function like copyto! that acts recursively as deepcopy General Usage	12	549	March 27, 2021
Why does `similar(::SharedArray)` create an `Array`? General Usage question	2	773	October 25, 2017
`similar` does not produce a SharedArray when fed one Internals & Design parallel	1	544	April 19, 2019
Deepcopying struct with array fields General Usage question , deepcopy	14	1551	August 12, 2019

Deepsimilar()?

Related topics