Correct, here is what I see using @btime:
27.400 μs (1023 allocations: 43.67 KiB)
5.283 μs (2 allocations: 64 bytes)
5.267 μs (1 allocation: 16 bytes)
If you need more speed, this MWE doesn’t seem to be representative for your real problem? I can’t estimate if your problem size is bigger or if this represents a hot loop, which is executed a lot of times?