Did Julia community do something to improve its correctness?

jlapeyre · August 7, 2023, 12:01am

At first wasn’t very concerned about this example. It’s typical of examples meant to illustrate the rampant incorrectness.

In general the function is called like map!(c, a, b). In the examples in the doc string, the input and output are clearly not aliased. In fact c is freshly initialized. Many numerical libraries work like this. It is very common to read something like “output must not be aliased to input”. So if I see the doc to map!(c, a, b) and thought, “well it didnt say I cant alias, I guess I just do it and move on”, I’m asking for trouble. If I didn’t read the doc string, I’d never assume it. I’d check it first or ask or something.

If this were the end of the story, I’d say it’s not such a big deal.

The bigger problem, apparently, is that

behavior with aliasing is not documented.
behavior with aliasing was not properly tested.
behavior with aliasing changed in a (non-breaking) release.

Now there’s nothing I can do to avoid a bug in my code, even if I’m careful.

It’s possible that documenting it more carefully may have avoided this. It would have made it less likely that adequate tests were overlooked, and that a breaking change would be introduced.

Note that although the previous behavior has been restored, it’s still not documented.

More generally: I won’t put much stock in the idea that Julia is really buggy based on anecdotes. I’d have to see some attempt at an analysis. And discussion of which languages you’re comparing to.

EDIT:

github.com/JuliaLang/julia

Document aliasing in `map!`

opened 12:24AM - 07 Aug 23 UTC

jlapeyre

doc

`map!` should either support aliasing or not and the choice should be documented… and tested. (It could be documented that it's undefined, but probably a bad idea) It may be better tested after the PR below merged, but the doc string should be updated. Maybe auditing a bit for other similar functions that lack this kind of documentation would be a good idea. I'll add them here if I find them. See * #46352 * #50780 https://discourse.julialang.org/t/did-julia-community-do-something-to-improve-its-correctness/102515/86

ryofurue · August 7, 2023, 4:53am

I have long had a question related to this. I’m not in a position to let other people to use my code. All I write are very small and ad-hoc. But, some of my tools may grow enough to be interesting to other people in the future.

So, I wonder: Why am I writing for j in 1:size(arr,2); for i in 1:size(arr,1), assuming that the array index starts from 1 and is contiguous, even when I don’t need to assume that.

So, my question is, why don’t you recommend writing everything as general as possible if that doesn’t increase the size of the code too much and if that doesn’t reduce the capability of the functions you are writing?

So,

Using abstract types requires significantly more testing than using concrete types.

I wonder if you can elaborate on that? Do you mean that testing is more difficult even if you test only on the few concrete types that you would be anyway using if you wrote a concrete interface?

. . . Anyway, we need some textbook or document that describes “the best practices”. Everything about generality seems confusing.

tim.holy · August 7, 2023, 8:02am

This is a great list @adienes, thanks. ~~Care to try tackling some of these yourself? Those who notice problems from such issues are usually best-motivated to start fixing them.~~ And thanks for the several pull requests, too!

ufechner7 · August 7, 2023, 6:44pm

It was just removed from the 1.10 milestone (see: sum!, prod!, any!, and all! may silently return incorrect results · Issue #39385 · JuliaLang/julia · GitHub), which I see as a sign of ignorance of leading team members for correctness issues…

mbauman · August 7, 2023, 7:19pm

I understand your frustration, but these are internal organizational tools that have very clearly defined processes and requirements. This does not mean the issue is not valued. It doesn’t mean that leading team members don’t care about correctness. It doesn’t mean it won’t get fixed. It doesn’t mean that its fix (or mitigation or …) won’t get into 1.10, either.

greatpet · August 7, 2023, 7:26pm

Stroustrup said that C++ protects you from accidents not sabotage. That should be applicable to most other languages, too.

ufechner7 · August 7, 2023, 7:27pm

There are currently 25 bugs in the issue tracker that have the label “correctness issue”. Two of them are on the list of the bugs to be fixed for 1.10., which is nice… But it also means we probably need to wait another 6 years or 12 releases until all of them are fixed…

aplavin · August 7, 2023, 7:30pm

An easy, and pretty reasonable, solution to the aliasing “bug” would be to add notes to all such functions that “The mutated argument must not alias any other argument”. I personally wouldn’t really expect such a call that involves aliasing between arguments to behave in any particular way, unless explicitly stated in the docsting.

algunion · August 7, 2023, 7:33pm

It seems that the issue has been open since January '21 - and the behavior was observed even in 1.5.3.

I wouldn’t expect the issue suddenly becoming a release-blocking issue.

However, some low-hanging fruits are more worrying: I understand the bug fix not being a release-blocking priority, but why not add the minimal documentation to help people avoid the issue in the meantime?

On the same note - it has been known for a while that @async is to be avoided in favor of @spawn to the extent that we have a clear warning against using @async: however, the new users reading the manual are clearly presented with learning material that goes against the documentation.

These easy-to-achieve goals that seem to reflect caring about new users are things that get me worried: the experienced users can get along and just avoid these traps (having access to a kind of special lore or simply being familiar to a larger extent with the up-to-date documentation), while new Julia users (who are primarily reading the manual before going into more depth) are actually taught to use the language in the wrong way.

ufechner7 · August 7, 2023, 7:40pm

Well, if it is open for 2.5 years it should get fixed finally, don’t you think?
If we are still not doing it we are ignorant for correctness issues, that is all I say…

mbauman · August 7, 2023, 7:42pm

Again, you’re misreading what it means to be (or not be) on a particular milestone release. Being placed on a milestone means that it’s a release blocker — that it must be fixed for the release. Not being on a milestone does not mean the converse.

ufechner7 · August 7, 2023, 7:43pm

So why was it then added to the milestone in the first place?

mbauman · August 7, 2023, 7:54pm

Because folks do want to see it fixed. People do care. And it’s very tempting to coopt the milestone for that sort of thing. This happens occasionally.

algunion · August 7, 2023, 7:58pm

I understand how it feels, especially when we encounter issues that are messing up our use cases.

My personal pain point: I don’t feel at all comfortable with the idea of having scenarios where the responsivity of tasks running HTTP listeners (on their own dedicated thread) are actually dependent on the main thread not doing work - it is somehow mindblowing.

However, I wouldn’t say that the leading team doesn’t care about the issue (I am more concerned that I didn’t do a good enough job to convey the practical implications of the issue). I have no idea if there are plans to fix that in the near future - the only clear thing is there are signs that awareness exists, and the root cause is known.

In fact, I am convinced that, even if only as a matter of being proud of their work, the leading team is clearly focused on doing a great job - but we cannot expect their priorities to be perfectly aligned with those of individual language users (and where there is such alignment, we cannot expect them to assign the same level of urgency to individual issues).

gdalle · August 7, 2023, 8:04pm

Many people here are pointing out that there is an easy (partial) fix for some of these issues, namely adding doc warnings. In fact, the fix is so easy that it doesn’t take being a “leading team member”:

github.com/JuliaLang/julia

Add some aliasing warnings to docstrings for mutating functions

JuliaLang:master ← gdalle:doc_aliasing_warnings

opened 07:55PM - 07 Aug 23 UTC

gdalle

+65 -5

Functions like `sum!(B, A)` have undefined behavior when `A` and `B` share memor…y. We might fix that in the long run, but in the short run, doc warnings are better than nothing. Related issues: - #39385 - #50814 See also: https://discourse.julialang.org/t/did-julia-community-do-something-to-improve-its-correctness/102515/ - [x] `accumulate!` - [x] `all!` - [x] `any!` - [ ] `append!` (unsure) - [x] `asyncmap!` - [ ] `broadcast!` - [x] `circcopy!` - [x] `circshift!` - [ ] `clamp!` - [ ] `conj!` - [ ] `copy!` - [x] `copyto!` - [x] `count!` - [x] `cumprod!` - [x] `cumsum!` - [ ] `delete!` - [ ] `deleteat!` - [ ] `digits!` - [ ] `empty!` - [x] `extrema!` - [ ] `fill!` - [ ] `filter!` - [x] `findmax!` - [x] `findmin!` - [ ] `get!` - [ ] `hex2bytes!` - [ ] `insert!` - [x] `intersect!` - [x] `invpermute!` - [x] `keepat!` - [x] `kron!` - [x] `map!` - [x] `maximum!` - [ ] `merge!` (unsure) - [ ] `mergewith!` (unsure) - [x] `minimum!` - [ ] `modifyproperty!` - [ ] `partialsort!` - [x] `partialsortperm!` - [x] `permute!` - [ ] `permutedims!` - [ ] `pop!` - [ ] `popat!` - [ ] `popfirst!` - [ ] `prepend!` (unsure) - [x] `prod!` - [ ] `push!` - [ ] `pushfirst!` - [ ] `put!` - [ ] `read!` - [ ] `readbytes!` - [ ] `replace!` - [ ] `replaceproperty!` - [ ] `resize!` - [ ] `reverse!` - [x] `setdiff!` - [x] `setindex!` - [ ] `setproperty!` - [ ] `sizehint!` - [ ] `sort!` - [x] `sortperm!` - [x] `splice!` - [x] `sum!` - [ ] `swapproperty!` - [x] `symdiff!` - [ ] `take!` - [x] `union!` - [ ] `unique!` - [x] `unsafe_copyto!` - [ ] `unsafe_store!`

My two cents is that we would all be much better off if we poured our frustrations into PRs, however small, rather than just complaints.

algunion · August 7, 2023, 8:17pm

I agree - but there is no clear threshold between severe matters and the simple PRs you mention. For example, I understand that there is somebody actively involved in rewriting the manual (sorry, I don’t have a reference at hand). At that point, if I want to contribute and invest some time in fixing obvious outdated stuff in the manual, I have no idea if that work is for nothing.

Also, although I feel pretty confident at this point in a few areas of the language, I feel that the manual should reflect the very best practices that are vetted by those who can add a kind of official expert stamp to that work.

And this is adding even more weight to my point - the manual, which should actually reflect the best practices and put the newcomers on the right track is outdated and recommends practices prohibited by the documentation.

So it is not very clear when we should just push for changes as outsiders or we should become part of some active force for change. There is no clear guideline on that.

However - I think complaining should be welcomed: if this language is going to grow and (also) be used in areas that are going to depart from technical/mathematical computing - we should expect an increase in the complaints/PR ratio. And I would say that would be a very good sign

gdalle · August 7, 2023, 8:24pm

The point I was trying to make is that most of the people in this conversation are not “outsiders”, far from it. We may not have merge rights for the Julia repo, but I think we often underestimate our ability to help. And by “we” I mean the active Julia users that like the language enough to keep reading when a thread gets past 50 messages.

To me, the vetting is the PR review. If my contribution is not worthy, it will be rejected or just languish in the endless pit of despair where code goes to die. So usually when I open a PR, my primary concern is not complete exactness: I mostly try to get things moving.

If someone is rewriting the manual that’s amazing news, but it doesn’t mean development has to pause. And it’s probably someone careful enough to merge the latest changes done by other people into their branch. At least I would suppose so.

gdalle · August 7, 2023, 8:26pm

Me when I fool myself into thinking the 2 lines of docs I wrote will make the life of core devs easier:

samwise-gamgee-i-cant-carry-it-for-you-but-i-can-carry-you

algunion · August 7, 2023, 8:49pm

Maybe I didn’t use the right word there - I meant a kind of Julia repo outsiders.

It would be nice to see this kind of message delivered by the core developers - and that doesn’t imply I don’t appreciate this coming from you.

Not sure if this holds for many other Julia users, but my admiration for these people is also somehow intimidating when comes to starting PRs.

Obviously, the PR review is literally a vetting process. But specifically when it comes to writing manual stuff, I feel it is somehow special - I mean, it would be a great learning experience (because I feel good enough is not enough, and the one making the contribution(s) should be somehow morally obligated to ensure both complete exactness and not leaving out relevant information).

Imagine that the manual might be the first contact with the language for many new users.

But enough about this - I think we are on the same page, and even more - an unsuccessful PR would still be a learning experience (if not just ignored or rejected without a stated reason).

This thread already spawned a topic (the reverse branch). I don’t insist on spawning the “how to contribute to Julia repo if you suffer from acute impostor syndrome” topic.

JeffreySarnoff · August 7, 2023, 8:58pm

I learned important aspects of Julia by [hesitantly] submitting a PR and then working with the helping hands that be in PR-land to take that through to acceptance. It took some determination; at that time, eagerness worked best.

Topic		Replies	Views
Julia 1.11.1 gives different results from 1.10.5 General Usage	56	2133	December 7, 2024
Correctness and Multiple Dispatch (Help explain to a julia noob) Community question	13	1033	April 30, 2025
Correctness bugs Offtopic gripes , griping	8	1009	November 29, 2024
Julia v0.6 to v1.10 appreciation post Community	3	679	October 7, 2024
Discussion on "Why I no longer recommend Julia" by Yuri Vishnevsky Community discussion	298	46714	September 9, 2022

Did Julia community do something to improve its correctness?

Related topics