Why does `Flux.stack` use splatting?

bad_at_math · July 19, 2021, 12:45pm

The Flux.stack function is defined here: https://github.com/FluxML/Flux.jl/blob/b78a27b01c9629099adb059a98657b995760b617/src/utils.jl#L476, and it is very simple:
stack(xs, dim) = cat(unsqueeze.(xs, dim)..., dims=dim).

However, its implementation seems contrary to the “Flux Performance Tips” here: Performance Tips · Flux, specifically " When doing this kind of concatenation use reduce(hcat, xs) rather than hcat(xs...) . This will avoid the splatting penalty, and will hit the optimised reduce method."

Is there a reason for why Flux.stack uses splatting rather than reduce?

baggepinnen · July 19, 2021, 1:12pm

hcat fixes the dimension to 2, stack is generic w.r.t. the dimension.

Topic		Replies	Views
Unstack-stack-unstack-stack Machine Learning question	1	411	July 25, 2020
Flux: concatenate layers Machine Learning	7	3210	September 18, 2020
What is "splatting" in flux (loss?)? Machine Learning	2	431	April 29, 2020
Reduce vs splat with intersect() function performance Performance question	1	547	December 11, 2020
Using hcat with splatting on arrays of different lengths kills Julia General Usage compilation , memory , memory-allocation	2	1395	October 2, 2018

Why does `Flux.stack` use splatting?

Related topics