Fast math in NASA benchmark

PetrKryslUCSD · July 23, 2020, 7:28pm

I noticed in this benchmark that the --fast-math GCC option was far more effective in reducing the runtime then the --math-mode=fast Julia option. What do you think is the reason for that?

simonbyrne · July 23, 2020, 7:41pm

I think it comes down to being fundamentally bad idea, hence:

https://github.com/JuliaLang/julia/issues/36246

simonbyrne · July 23, 2020, 7:48pm

But to give you a better answer: the biggest speed-up in gcc is in the matrix multiplication benchmark: using -Ofast here would let them reassociate arithmetic operations (to exploit SIMD instructions), and use FMA instructions if they’re available. Julia’s @simd macro and muladd functions will do something similar, without the blatant unsafe-ness of fastmath.

evanfields · July 23, 2020, 8:41pm

Is there a “reader’s digest” version of why @fastmath is bad?

mbauman · July 23, 2020, 8:45pm

It’s an under-specified grab-bag of math-breaking the compiler can do everywhere. Some algorithms actually depend upon math working correctly.

Oscar_Smith · July 23, 2020, 8:47pm

One specific problem with it is that it’s recursive. While at first this feels like a good thing (I want all my code fast), the problem is it can end up applying to stuff in Base or other libraries where it is specifically broken.

tbeason · July 23, 2020, 8:48pm

Relevant issue https://github.com/JuliaLang/julia/issues/36246 [oops… this was already linked]

mbauman · July 23, 2020, 8:55pm

There was once a very long (non-readers-digest) discussion about this, but this post is fairly self-contained:

Especially if you combine it with the fact that you can pick a collection of 2046 numbers that can sum to just about any number depending upon their ordering:

Base’s code actually depends upon smartly re-ordering summations to improve precision that fast math might break.

PetrKryslUCSD · July 23, 2020, 9:16pm

I filed an issue to alert developers responsible for the NASA software comparison benchmark suite to the inappropriateness of including fast math in the results.
https://github.com/JulesKouatchou/basic_language_comparison/issues/9

When fast math option is not used, in the Jacobi iteration benchmark
Julia is faster than the C-language implementation. 6.96 s versus 7.51 s.
Tested with WSL 2, Julia 1.6, generic Linux.

Topic		Replies	Views
Floating point optimizations General Usage question , float	5	474	November 1, 2022
Does the julia intrinsic sum() apply fastmath by default? New to Julia fast-math , sum	15	766	February 21, 2023
Why I love Julia Community announcement	34	4817	October 11, 2019
Performance challenge: can you write a faster sum? Performance simd	25	1167	July 8, 2025
[ANN] New benchmark comparison General Usage benchmark	5	1038	March 13, 2018

Fast math in NASA benchmark

Related topics