, sorry, this did not come across as I intended and I didn’t even think that Julia was not tested enough. I only wanted – and still want – to see if this is a common problem.
Look, @tisztamo’s result is way better than mine but still it takes 3.863/(2.331+2.372+2.293)*3 = 1.656
times longer on thread 1 than on his other ones. This seems significant enough to me. Maybe @tisztamo, you can repeat with @btime startonthread(x, start, 100_000)
to see if this is no statistical fluke.
Are we yet so far? I didn’t think yet. Where should the issue go?