Normal vs broadcasted slice assignment

sijo · February 16, 2024, 10:24am

This must have been discussed a dozen times but I couldn’t find a thread about this precise issue:

using BenchmarkTools

f1(v, x) = v[1:length(x)] = x
f2(v, x) = v[1:length(x)] .= x

julia> @btime f1($(rand(1000)), $(rand(100)));
  10.268 ns (0 allocations: 0 bytes)

julia> @btime f2($(rand(1000)), $(rand(100)));
  25.415 ns (0 allocations: 0 bytes)

Is this expected? Does it have to do with unaliasing? I wonder what’s causing the slowdown exactly since there’s 0 allocation in both cases.

mbauman · February 16, 2024, 5:49pm

No, I think this is just the difference between a highly specialized memcpy that hits the Vector’s memory directly and a hand-written for loop that works with all abstract arrays.

Oscar_Smith · February 16, 2024, 6:19pm

These both should turn into memcpy. IMO this is unexpected.

Oscar_Smith · February 16, 2024, 6:34pm

So the difference is that f1 turns into a copyto!(view(a, 1:100), b) while f2 turns into a setindex!. So the problem is just that we don’t have an optimized method for copying a view of an Array to another Array.

danielwe · February 16, 2024, 7:25pm

Another difference is that f2 returns a view of v, while f1 returns x. Changing both functions to return nothing improves the performance of f2 somewhat, although not enough to make up the difference.

jishnub · February 16, 2024, 8:10pm

In

Chris Elrod had suggested that the differences arise from non-temporal stores, and that LoopVectorization provides a Julia equivalent.

Topic		Replies	Views
Memory allocation with `view` and array assignment General Usage array , memory-allocation	12	1288	December 17, 2021
Why is copying using a loop is much slower than `copy` for large arrays? Performance question , copy	10	1429	November 27, 2022
Too many allocations when indexing with slices Performance indexing , memory-allocation	16	2746	August 17, 2018
Unexpected Memory Allocation of Broadcasting Copy Performance question	4	129	June 15, 2024
Preallocating the result of getindex(...) General Usage question	8	1389	July 13, 2017

Normal vs broadcasted slice assignment

Related topics