Optim returns slightly different result in unit testing environment

tk3369 · September 3, 2018, 1:32am

I’ve been scratching my head in the past few hours with no answer. I hope someone has a clue.

I added some unit test cases for the BoxCoxTrans.jl package (unregistered) but the test fails only in the unit testing environment due to numerical precision problem. If I run the exact same code in REPL then the results matches exactly the test code.

If anyone wants to replicate the problem, this branch exhibits the problem with atol=1e-9.

(v0.7) pkg> add https://github.com/tk3369/BoxCoxTrans.jl#tk/precision-issue

crbinz · September 3, 2018, 1:49am

Are you running into this (from NEWS.md)?

isapprox(x,y) now tests norm(x-y) <= max(atol, rtol*max(norm(x), norm(y))) rather than norm(x-y) <= atol + ... , and rtol defaults to zero if an atol > 0 is specified (#22742).

This bit me a few days ago.

EDIT: maybe not, since this only surprised me when updating from julia 0.6 to 0.7, and it looks like your package used 0.7 from the get-go.

tk3369 · September 3, 2018, 1:55am

No… A little more information below.

The log shows that Base.Test calculated lambda of -0.9917203620435803.

Test Failed at /Users/tomkwong/.julia/dev/BoxCoxTrans/test/runtests.jl:32
  Expression: ≈(λ, -0.99172, atol=precision)
   Evaluated: -0.9917203620435803 ≈ -0.99172 (atol=1.0e-9)

If I run it from REPL, I get:

julia> lambda(𝐱)
-0.9917203225477127

crbinz · September 3, 2018, 2:00am

I can confirm what you’re seeing both in the test failure and the REPL. But… maybe (probably) I’m missing something, but wouldn’t the test also fail if the REPL result was used?

tk3369 · September 3, 2018, 2:02am

Yes, it fails when I use REPL results in the test script.

For now, I’m working around the problem by using atol=1e-4 (see here), which is low enough for it to pass. But it bothers me because the calculation shouldn’t be any different depending on whether I run it from the REPL or not…

kristoffer.carlsson · September 3, 2018, 2:53am

Bounds checking being on or off might cause e.g. SIMD to be used or not used which can slightly change the answer.

Tamas_Papp · September 3, 2018, 6:15am

I agree with @kristoffer.carlsson, you should not expect exactly identical results in a different environment. You should pick tolerance based on what you asked your algorithm to achieve.

Specifically, consider an explicit tolerance in the call to optimize, possibly exposed to the user (with a default), and just test that the result is within that range.

Topic		Replies	Views
REPL arithmetic gives wrong result on first run General Usage	6	626	May 30, 2018
Strange test failures on Julia 1.4, possibly related to ForwardDiff.jl General Usage	2	316	March 23, 2020
Approx test of two real numbers General Usage	2	1124	August 22, 2017
Isapprox in tests and a more precise result / reason for failure General Usage testing	5	397	November 2, 2022
Approx with a weird behaviour General Usage	1	358	January 16, 2019

Optim returns slightly different result in unit testing environment

Related topics