SIMD struggles, seeking solutions (with KangarooTwelve.jl)

SIMD.jl has some checks that are supposed to fix the undefined behavior:

But perhaps it is buggy?

That’s interesting Julius. Thanks for that analysis.

I might consider that hacky approach in future, though I’m also interested to see if https://github.com/JuliaLang/julia/pull/44186 (kindly pointed out to me by Sukera) might remove the overhead that’s being such a pain here?

Likely! Shouldn’t these be y >= sizeof(T1)*8?

Indeed, fix some shift operations to agree with Base (and prevent undefined behavior) by KristofferC · Pull Request #119 · eschnett/SIMD.jl · GitHub should fix that I think

3 Likes