Investigating numerical change in function return value between v1.4 vs v1.5

Elrod · June 13, 2020, 9:53pm

I’m curious about this as well.
FWIW, you don’t need -O3, all you need is -instcombine to get the fdiv fast. So it seems to have been a change with the instcombine pass between versions.

I also recently encountered an issue with fdiv and instcombine that started with LLVM 9 where the instcombine moves an fdiv inside a loop, dramatically worsening performance.
My issue on LLVM was closed because that was intended behavior; you’re supposed to place a licm at some point after the last instcombine to move the division back out of the loop.

Maybe you could file this as an instcombine issue with LLVM. It seems reasonably likely they’re connected (instcombine getting more aggressive with divisions from LLVM 8 to 9), but your example seems harder to close as a Julia issue / shows up with the default -O3 optimization pipeline.

Topic		Replies	Views
Output for @code_llvm changed from 0.6 to 0.7, 1.0 General Usage	17	1317	August 31, 2018
How to avoid ForwardDiff.jl generating a second-order derivative that wastes flops by eventually multiplying by zero Performance llvm , differentiation , forwarddiff	10	1084	January 25, 2021
Trivial code change causes vectorization failure Performance	3	681	February 1, 2019
Different `@code_llvm` output on macos and x86 Performance simd	4	314	December 8, 2023
Significant difference in execution speed between similar functions with nested loops Performance question	4	247	March 8, 2023

Investigating numerical change in function return value between v1.4 vs v1.5

Related topics