A subtle bug?

tomtom · May 24, 2021, 3:27pm

I got a Float64 number from the p-value of a GLM model. I could NOT show the fitted model with the following error:

ERROR: p-values must be in [0; 1]
Stacktrace:
 [1] error(::String) at ./error.jl:33
 [2] StatsBase.PValue(::Float64) at /Users/Thomas/.julia/packages/StatsBase/EA8Mh/src/statmodels.jl:430

after some investigations, I found that some p-value floats are strange:

julia> p = (GLM.coeftable(mod).cols[4])[5]    # mod is a model from GLM.lmI()

julia> typeof(p)
Float64

julia> p == 0
true

julia> p >= 0
true

julia> 1 >= p >= 0
true

julia> 0 <= p <= 1    # strange !!!
false

julia> p == 0.0    # strange !!!
false

in particular, that strange false from 0 <= p <= 1 is the cause of the error messages inside StatsBase.PValue().

I don’t know if it’s a bug from GLM or a more general and subtle bug in Julia? I’m using v1.5.3, thanks.

Ronis_BR · May 24, 2021, 3:41pm

Without a MWE, I think it will be very difficult to debug your code. Can you please post the result of dump(p)?

tomtom · May 24, 2021, 3:51pm

julia> dump(p)
Float64 0.0

… there’s a long process to produce the data needed for the linear model… I don’t need how to give a MWE …

pdeffebach · May 24, 2021, 4:11pm

I can’t reproduce

julia> using StatsBase

julia> StatsBase.PValue(1e-100)
<1e-99

Ronis_BR · May 24, 2021, 4:35pm

Would you mind reinstalling Julia using a newer version?

GunnarFarneback · May 24, 2021, 5:44pm

This is so far from reasonable that you don’t need to worry about the M. Any complete and working example which can reliably reproduce the output from your first message at the end would be good enough.

Pretty much the only ways to explain those results are:

It’s not actually the same p everywhere.
Someone has committed an absolutely heinous act of type piracy and redefined floating point comparisons.

tomtom · May 24, 2021, 5:51pm

just installed v1.6.1 and the problem persists.

tomtom · May 24, 2021, 5:55pm

is it possible for me to save the data as a file and attach here?

it IS the same p everywhere. That’s why it’s “strange”.

===============================================
I saved the design matrix X and response vector y into a .jld (by using JLD).
Then in a fresh session I loaded them and called mod = GLM.lm(X, y, true); and now the problem has gone?! that means I could not reproduce the bug by uploading the data…

pdeffebach · May 24, 2021, 5:59pm

Can you check the following?

julia> using StatsBase

julia> StatsBase.PValue(0.0)
<1e-99

first on a fresh session and then after running your code.

tomtom · May 24, 2021, 6:18pm

both on a fresh session and after running my code:

julia> StatsBase.PValue(0.0)
<1e-99

my work flow is:

julia> mod = GLM.lm(X, y, true);
julia> pvalues = GLM.coeftable(mod).cols[4];
julia> [println(i, " ", pvalues[i], " ", StatsBase.PValue(pvalues[i]) ) for i in 1:length(pvalues)]
1 0.0 <1e-99
2 2.83800947839654e-159 <1e-99
3 0.17914739722604017 0.1791
4 0.00728418728531684 0.0073
ERROR: p-values must be in [0; 1]
Stacktrace:
 [1] error(s::String)
   @ Base ./error.jl:33
 [2] StatsBase.PValue(v::Float64)
   @ StatsBase ~/.julia/packages/StatsBase/DU1bT/src/statmodels.jl:485


julia> p = pvalues[5]  # 1.6.1 shows NaN, 1.5.3 shows 0.0
NaN

julia> p == 0
true

julia> p == 0.0
false

julia> isnan(p)
false

julia> isfinite(p)
true

julia> 0 <= p <= 1
false

julia> 1 >= p >= 0
true

Ronis_BR · May 24, 2021, 6:37pm

I have no idea, I could not reproduce it here… Thus, I will start to say random things

Is it possible to execute changing the name of mod to something else? mod is a function in Base. It is not supposed to give you any problem, but I am lacking ideas…

EDIT: Wait, I am not understanding, p is NaN in 1.6 and still isnan(p) is false?

viraltux · May 24, 2021, 6:57pm

Yeah, unfortunately seems calculations of p-values are not quite right in a few key packages, we have the same error with:

a = [12,10,7,6,3,1]
b = [11,9,8,5,4,2]
MannWhitneyUTest(a,b)

Error showing value of type ExactMannWhitneyUTest{Float64}:
ERROR: p-values must be in [0; 1]
Stacktrace:
...

And this is an old problem: p val > 1 in ExactMannWhitneyUTest · Issue #126 · JuliaStats/HypothesisTests.jl · GitHub

sostock · May 24, 2021, 7:27pm

Can you post the output of bitstring(p) for this weird p?

tomtom · May 25, 2021, 3:56am

julia> bitstring(p)
"0111111111111000000000000000000000000000000000000000000000000000"

tomtom · May 25, 2021, 4:07am

now I know the cause of the problem: --math-mode=fast

I understand that any operation on NaN is unpredictable in fastmath mode, we need an important exception: isnan(NaN).

now in fastmath mode:

julia> isnan(NaN)
false

and this failure to detect NaN is the cause of all confusions. In this case, the following isnan(v) fails to catch the NaN and throws an error:

struct PValue <: Real
    v::Real
    function PValue(v::Real)
        0 <= v <= 1 || isnan(v) || error("p-values must be in [0; 1]")
        new(v)
    end
end

we often have no control whether a package would produce NaN, and on the other hand, the package has no control if the user is doing fastmath or not. I strongly recommend that isnan(NaN) to return true even in fastmath mode!

kristoffer.carlsson · May 25, 2021, 4:56am

Exactly, which is why you should never use global "fast"math in real applications.

tomtom · May 25, 2021, 5:56am

but why don’t we allow isnan(NaN) == true in fastmath mode?

Syx_Pek · May 25, 2021, 6:20am

https://github.com/JuliaLang/julia/issues/21375
" Asking isnan correctly is computationally expensive and slow "

kristoffer.carlsson · May 25, 2021, 9:05am

Next thing you will get some other error somewhere else because of another assumption of IEEE math in another package. It’s just completely unsafe to use this globally for running code you do not control 100% yourself.

tomtom · May 25, 2021, 1:42pm

…it is just sad to hear that …

well… seems like I could only override my own isnan() to cope with the issue…
a question to ask: how could I detect if the current session is --math-mode=fast or not? thanks.

Topic		Replies	Views
Bugs in isfinite(), isnan and isinf()? New to Julia	11	997	February 10, 2021
What's going on with exp() and --math-mode=fast? General Usage fast-math	29	4449	October 23, 2021
Fastmath error? New to Julia fast-math	2	486	July 16, 2021
GLM.jl module : 1 test failed Statistics glm	3	1443	April 28, 2018
How to tell VS Code to use the fast math option? VS Code fast-math	30	3960	December 7, 2018

A subtle bug?

Related topics