Avoid getting a ReshapedArray when calling reshape?

DrChainsaw · October 29, 2020, 6:16pm

Flux has the clever array type Zeros for which arithmetic operations are implemented as basically noops (well, I guess you get the idea).

The main use case is to turn of the bias term but the problem is that some ops need to reshape it and then the magic disappears:

julia> using Flux

julia> z = zeros(1000,1000);

julia> zz = Flux.Zeros(1000,1000);

julia> using BenchmarkTools
julia> @btime z+z;
  3.947 ms (2 allocations: 7.63 MiB)

julia> @btime z+zz;
  64.287 ns (1 allocation: 32 bytes)

julia> @btime z+reshape(zz,1000,:);
  3.987 ms (11 allocations: 7.63 MiB)

julia> typeof(reshape(zz,1000,:))
Base.ReshapedArray{Bool,2,Flux.Zeros{Bool,2},Tuple{}}

What is the best way to prevent this from happening?

Implementing reshape for Zeros is an option of course, but since it has more than a handful of methods which all specialize on the second argument one must implement all of them or else there will be ambiguity and this feels a bit brute-force-ish. I guess another option is to implement methods for ReshapedArray of Zeros but this is probably even worse in this aspect.

Is there a more elegant way?

Topic		Replies	Views
Return type of reshape General Usage	8	892	February 22, 2019
How do I reinterpret and reshape a multi-dimensional array `J` into a two-dimensional array `j` such that `typeof(j) <: Matrix` is true in Julia 0.7 General Usage question	2	2506	October 5, 2018
Deferred-shape arrays Internals & Design question	11	1248	October 13, 2020
Reshape in place General Usage array , matrices , reshaping	1	414	February 17, 2024
Custom reshape() function for specific usage Performance array , memory , memory-allocation , tensors , tensoroperations	3	275	April 25, 2024

Avoid getting a ReshapedArray when calling reshape?

Related topics