Why is it reliable to use open source packages for research?

GregVernon · May 9, 2024, 5:33pm

I will say, however, that Matlab has built up a lot of trust. They have an incredibly-responsive customer service, post known issues publicly, provide regular bug-fixes, etc. There are similar closed-source software that have similar amounts of trust: LS-DYNA, Abaqus, COMSOL, etc.

WuSiren · May 9, 2024, 5:42pm

I myself feel more safe using commercial software like MATLAB than any other open-source software although the latter is indeed more transparent and appealing. Don’t know why.

cjdoris · May 9, 2024, 5:53pm

To quote Douglas Adams: because it’s reassuringly expensive.

ufechner7 · May 9, 2024, 6:24pm

I encountered very bad, undocumented bugs in Matlab/ Simulink… Example: if you try to linearize a Simulink system that has two or more algebraic loops you get completely wrong results without any warning. With ModelingToolkit.jl it just works. I would say your feeling is misleading…

jar1 · May 9, 2024, 6:40pm

I (like some, unlike others) feel that Julia usage has higher risk of producing incorrect results than I’d like. Given that, I think it’s good to calibrate fear/uncertainty/doubt levels so that (1) new users know they should be extra careful if they can afford that risk and go elsewhere if they can’t and (2) developers see there’s still concern about this and know that there is user demand for further investment in quality assurance.

But since FUD is also annoying, I expect there’s a balance to be found between too much and too little griping.

oheil · May 9, 2024, 6:48pm

This reminds me of Pentium FDIV bug - Wikipedia .

I don’t think that Julia has a higher risk for incorrect results. Not more than R or python. How is this higher risk argued? At the end it’s native code produced by LLVM. I can’t imagine Julia anything different than other things compiled by LLVM. Can someone enlighten me, where this higher risk comes from?

mbauman · May 9, 2024, 6:58pm

Laying out concrete issues and appropriately setting expectations is not FUD — it’s the exact opposite.

The Julia language is highly permissive. It allows you to combine packages that have never been combined before. That might work and be amazing. It might error. Or it might silently do something wrong. I don’t see a higher risk with individual packages as compared to other languages, but it’s the ecosystem’s high composability that poses a slightly more unique risk (and benefit!) — something that you just need to be aware of and test/validate if there aren’t existing use-cases in the test suites.

oheil · May 9, 2024, 7:02pm

and that’s what I strongly argue here for any software you use as a researcher

liuyxpp · May 10, 2024, 1:15am

Where does your conclusion come from? This kind of baseless conclusion about Julia is so harmful. Please don’t say it again unless you have concrete evidence when comparing to at least one other language in a massive way.

yurivish · May 10, 2024, 3:16am

Is it FUD if it’s true?

A few months ago I happened to run into a bug that’s the perfect example of exactly the scenario @eweiss outlined in his post.

I was using Julia for extremely basic data analysis – finding the maximum value in a column of a CSV file – and discovered that the answer was simply wrong. I only spotted the bug because I manually inspected the results and the number seemed to me to be too low.

I filed an issue that outlined the bug, identified its cause, and contained a reproducible test case. In it, I wrote

The implementation is here and is tested, though the test only tests a simple special case for which the values and indices are exactly equal.

jling · May 10, 2024, 3:45am

github.com/JuliaData/SentinelArrays.jl

`argmax` returns incorrect results

opened 08:48PM - 06 Apr 24 UTC

closed 12:22PM - 10 May 24 UTC

yurivish

This package incorrectly implements [`argmax`](https://docs.julialang.org/en/v1/…base/collections/#Base.argmax) for `ChainedVector`s. It returns the maximum value, rather than its index. ```julia julia> using SentinelArrays julia> arrays = [ [18, 70, 92, 15, 65], [25, 14, 95, 54, 57] ]; julia> cv = ChainedVector(arrays); julia> argmax(cv) 95 julia> argmax(collect(cv)) 8 ``` The implementation is [here](https://github.com/JuliaData/SentinelArrays.jl/blob/fa840f994ae821d921a9973fbd5e244d35102b1c/src/chainedvector.jl#L838) and is [tested](https://github.com/JuliaData/SentinelArrays.jl/blob/fa840f994ae821d921a9973fbd5e244d35102b1c/test/chainedvector.jl#L253), though the test only tests a simple special case for which the values and indices are exactly equal. This issue was initially reported [upstream](https://github.com/JuliaData/CSV.jl/issues/1128).

looks like it’s worse than that – not only the argmax/argmin are taking the wrong element from findmax/findmin. it looks like findmin itself is just completely wrong. (findmax seems to be fine)

this happens to the best of us though, but yeah, this particular unit test is filled with tests but most are basically useless checks primarily because of this value = index property, also partly because the tests are just super redundant…

now I’m trying to recall if I ever used argmin/findmin/argmax/findmax for publications since it’s definitely used in our data I/O

nilshg · May 10, 2024, 7:49am

Not to derail this thread further (can a broad question in Offtopic be derailed) but @Sukera, somewhat following on from our discussion on Zulip a few weeks ago, is this something that Supposition.jl would have helped with?

Sukera · May 10, 2024, 8:01am

Yes:

julia> using Supposition, SentinelArrays

# generate `Vector{UInt8}` with at most 100 elements
julia> data_vector = Data.Vectors(Data.Integers{UInt8}(); max_size=100);

# generate a vector of vectors, with 1 to 10 vectors
julia> vecs = Data.Vectors(data_vector; min_size=1, max_size=10);

# create a ChainedVector
julia> chainedvecs = map(vecs) do vs
           # can't reduce over empty arrays, so ignore them
           all(isempty, vs) && reject!()
           ChainedVector(vs)
       end;

# Does the found index actually refer to the maximum?
julia> isargmax_maximum(cv) = cv[argmax(cv)] == maximum(cv)

# Run the fuzzing/property test
julia> @check db=false isargmax_maximum(chainedvecs);
┌ Error: Property errored!
│   Description = "isargmax_maximum"
│   arg_1 =
│    1-element ChainedVector{UInt8, Vector{UInt8}}:
│     0x00
│   exception =
│    BoundsError: attempt to access 1-element ChainedVector{UInt8, Vector{UInt8}} at index [0x00]
│    Stacktrace:
│     [1] throw_boundserror(A::ChainedVector{UInt8, Vector{UInt8}}, I::Tuple{UInt8})
│       @ Base ./essentials.jl:14
│     [2] checkbounds
│       @ ./abstractarray.jl:699 [inlined]
│     [3] getindex
│       @ ~/.julia/packages/SentinelArrays/1kRo4/src/chainedvector.jl:94 [inlined]
│     [4] isargmax_maximum(cv::ChainedVector{UInt8, Vector{UInt8}})
│       @ Main ./REPL[39]:1
└ @ Supposition ~/Documents/projects/Supposition.jl/src/testset.jl:287
Test Summary:    | Error  Total  Time
isargmax_maximum |     1      1  0.7s

The important part is checking the correct/expected properties (in this case that the return value of argmax is the index of the maximum element). It’s possible to do that with just naively generated random data; Supposition.jl “just” does the job of reducing a found problem to a minimal example (here, ChainedVector([[0]])).

This is very off topic now, but I think part of the problem with argmax in particular is that it does return the element, rather than the index, in the argmax(f, col) case. See Unintuitive findmin and findmax · Issue #39203 · JuliaLang/julia · GitHub and `argmax` behaves differently from other higher-order functions · Issue #48502 · JuliaLang/julia · GitHub for discussion and Add `indmin`/`indmax` by Seelengrab · Pull Request #41339 · JuliaLang/julia · GitHub for my (old, non-mergeable) attempt at clarifying the situation.

mbauman · May 10, 2024, 12:11pm

Great to have you back! Obviously that’s not FUD — it’s a concrete issue that is clearly a bug. Thanks for filing the issues!

Tamas_Papp · May 10, 2024, 2:37pm

OTOH, Julia is outstanding when it comes to relatively small packages of generic code, which can be very thoroughly tested and (ideally) expose a simple interface that the user can grasp in its entirety. You make a very valid point, but it is unclear whether the net effect would be more or fewer bugs.

In any case, I think the license for the code (ie whether a library is FOSS) is just one dimension for evaluating its reliability. Code maturity is another, and closely related, the number of users. Personally, I also put a lot of weight on the principal authors and maintainers, as there are quite a few people in the Julia community whose code I would trust a lot because I have seen their coding style.

One cannot make blanket statements about either closed or open source software. I agree with @oheil they only way to build trust in software is to evaluate it, so if I end up using a package for more than exploratory work I usually look at the source code, test coverage, testing style, and in some cases end up adding tests in PRs. This way the effort I have put into evaluating code ends up benefiting others.

mihalybaci · May 10, 2024, 5:32pm

This is why I like open source software, and following issues on Discourse. Here we have an example of a bug in argmax, I can go look at the github issue, then follow that to the PR to fix it, and see that it merged ~5 hours ago. I can actually watch the process work. Sometimes it doesn’t work as fast as wanted, but closed source doesn’t necessary work fast either.

Things improve over time, and it is reassuring to watch it happen in the open. I mean, how many bugs were in Numeric before it became NumPy? (that was a rhetorical question)

mkitti · May 11, 2024, 10:59pm

I worked on generating and adding additional tests for SentinelArrays.jl here:

In the process I found another bug with findmax:

While I am glad the original issue was fixed, I’m wondering why the attention to the package was so narrow to ignore the other pull request addesssing perhaps the deeper issue - insufficient testing. I’m sure it was an oversight due to limited time.

Certainly when presented with a concrete bug the focus must not only be on fixing that bug but addressing why that bug was able to exist in the first place without being caught by CI.

tecosaur · May 12, 2024, 2:00am

It would also be good if tools like Supposition.jl could be more widely adopted and so help avoid “oops, we tested a special case where the bug doesn’t occur” cases.

xlxs4 · May 12, 2024, 1:30pm

FWIW, I recall an old blog post trying AFL with Julia: Bugs in Julia with AFL and C-Reduce · maleadt

As also highlighted in the post, a possible avenue could be a Julia package wrapping AFL++

o314 · May 12, 2024, 5:03pm

In civil engineering, it is a common practice to use two equivalents softwares but from different vendors to check their results one against the other when doing some quantitative mechanical design.

Not a problem of open or closed source but a consideration about consistent and regular control

Topic		Replies	Views
Arguing about sloppy science Offtopic	24	523	May 10, 2024
Provenance of data in scientific packages Modelling & Simulations	4	377	January 10, 2024
Reproducibility: What's the risk of a dependency becoming unavailable? General Usage	9	540	May 8, 2024
How are the intellectual property rights of Julia package developers protected? Community package	30	1836	March 16, 2024
Citation principles of Julia packages Community community , academia	14	838	July 13, 2023

Why is it reliable to use open source packages for research?

Related topics