Why do packages run continuous integration tests on Julia nightly?

Datseris · July 6, 2023, 3:11pm

I am kind of confused here. To my understanding, the main argument for testing versus Julia nightly is to find bugs in the Julia language itself (or some other core dependencies), not find bugs in your own package. The only builds that fail in nightly and are truly because of my package must also fail in all other builds. But then, of course I am checking every failure in the normal builds so the nightly tests add no value here.

Within this whole discussion I am focused in testing non-breaking nightly releases i.e., not going from 1.x to 2.x. In this scenario of non-officially-breaking changes, we are hoping that Julia’s tests will catch any unintended breakage. Our point of testing nightly is to just further help ensuring this fact and to further help the main Julia development team. At least, that’s the only reason I ever found for doing nightly tests.

As I personally do not have the capacity to go through the many false positives of the nightlies, I cannot justify the computing cost of running nightlies. In fact, even if I did have the time to go through the nightly failures, I am not sure that would justify the extra computing cost as my packages only use the formally exported, and likely very well tested API. This means that in the overwhelming majority of cases any Julia bug I could detect would have already been detected by the main language’s repo tests.

And to re-iterate, this argument holds for the front-end packages that I am involved with, which means that :

this is not at all tricky in my eyes. Some package that believes that they are not using the well tested and exported API should run nightly. The majority of packages don’t have to though.

I agree, and would never argue that there is 0 value. However, I am comparing this value with the associated impact it has, and find this value much too low to justify the cost.

joa-quim · July 6, 2023, 3:17pm

No. Julia 1.10 moved to use libcurl 8.0.1. Now all packages that have compat 7.xx (almost all I would guess) will fail to run in Julia 1.10 but run on Julia 1.9

ToucheSir · July 6, 2023, 4:12pm

If breakages due to using internals are known unknowns, we also have to account for unknown unknowns. Aside from @joa-quim’s examples, there are many instances of packages making relatively benign assumptions about functionality which breaks with a minor Julia update. "failed to start primary task" with Julia 1.9 and nthreads(:interactive) > 0 · Issue #21 · JuliaFolds/FoldsThreads.jl · GitHub is a recent example that comes to mind. Packages which are somewhat sensitive to inference quality (e.g. StaticArrays), subtyping, inlining etc are another.

Now, does that mean everyone needs to test on nightly? No, I think we’re in agreement there. But testing alphas/betas/release candidates when they come out (which to your point, would have a much smaller impact and fewer false positives to look through)? I think packages which are at risk for breakage due to one or more of the aforementioned reasons should do that. My question was whether doing so is even possible at the moment, because my understanding is that GHA will happily run jobs with 1.9rc well after 1.9 stable is released and the yaml format doesn’t provide a lot of flexibility to say “only run this job if there’s a newer pre-release than the current stable release”.

Switching gears, another concern that was touched on above is that the mechanism for checking for bugs in the language (PkgEval) has its own “boy who cried wolf” problem. If the last line of defense is not extremely reliable or frequently kicks in too late in the release/dev cycle, the pragmatic solution is to have a defense in depth approach. That has generally taken the shape of packages running CI on nightly. It would be great to have a more sustainable alternative to this and I’d love to hear how PkgEval could be improved to avoid some of the issues noted in this thread, but my (perhaps incorrect) impression is that there are no silver bullets there.

simsurace · July 6, 2023, 4:58pm

Besides the failure modes already mentioned, your package may also unknowingly rely on a bug.

MilesCranmer · July 6, 2023, 9:38pm

No. Julia 1.10 moved to use libcurl 8.0.1. Now all packages that have compat 7.xx (almost all I would guess) will fail to run in Julia 1.10 but run on Julia 1.9

(Re: this specific issue, unrelated to this thread, maybe someone could trigger a JuliaRegistrator PR fixing those compat issues on all matching repos? Similar to what Tim Holy did when he automatically generated PRs deprecating SnoopPrecompile => PrecompileTools)

mrufsvold · July 6, 2023, 10:10pm

I haven’t looked at the Docker Images to know how they are built and hosted, but maybe we could change the infrastructure to follow Juliaup’s conventions so you could denote that you want the image for release, etc.?

joa-quim · July 6, 2023, 11:44pm

Unrelated? I found it exactly because I run CI on nightly.

Thought on that too.

ToucheSir · July 7, 2023, 12:31am

I believe that exists in some form already. The problem is that stable and pre-release need to be run as separate jobs/steps, but if they end up with redundant Julia versions there’s no way to communicate that the latter one should stop when running on GitHub Actions. I believe we’d some way to conditionally run the step for pre-release versions if there is new pre-release available, but that might go beyond what the declarative pipeline format in GitHub Actions supports.

oxinabox · July 7, 2023, 12:36am

Back when we had TravisCI and AppVeyor rather than GitHubActions,
we would run those set to “Warn” and they wouldn’t cause failure states in the badges, just let you know.
There has been a long standing issue for github actions to add this.
but last i looked the github actions team seemed confused as to why anyone would want this.

aplavin · July 7, 2023, 7:47am

That looks an unambiguously breaking change in Julia, both technically and practically. Almost 2k packages depend on libcurl_jll, are they all going to be broken in their current state?

joa-quim · July 7, 2023, 11:03am

Don’t know about the general situation but two that I need to have GMT running on 10 are broken

You may try yourself to install LibGEOS which is not a big package.

kristoffer.carlsson · July 7, 2023, 11:06am

(@v1.10) pkg> activate --temp
  Activating new project at `/tmp/jl_Ra4PYi`

(jl_Ra4PYi) pkg> add LibGEOS
   Resolving package versions...
   Installed GEOS_jll ──────────── v3.11.2+0
   Installed GeoInterfaceRecipes ─ v1.0.1
   Installed LibGEOS ───────────── v0.8.4
  Downloaded artifact: GEOS
    Updating `/tmp/jl_Ra4PYi/Project.toml`
  [a90b1aa1] + LibGEOS v0.8.4
    Updating `/tmp/jl_Ra4PYi/Manifest.toml`
  [fa961155] + CEnum v0.4.2
  [411431e0] + Extents v0.1.1
  [cf35fbd7] + GeoInterface v1.3.1
  [0329782f] + GeoInterfaceRecipes v1.0.1
  [692b3bcd] + JLLWrappers v1.4.1
  [a90b1aa1] + LibGEOS v0.8.4
  [aea7be01] + PrecompileTools v1.1.2
  [21216c6a] + Preferences v1.4.0
  [3cdcf5f2] + RecipesBase v1.3.4
  [d604d12d] + GEOS_jll v3.11.2+0
  [56f22d72] + Artifacts
  [ade2ca70] + Dates
  [8f399da3] + Libdl
  [de0858da] + Printf
  [fa267f1f] + TOML v1.0.3
  [4ec0a83e] + Unicode
Precompiling project...
  7 dependencies successfully precompiled in 3 seconds. 3 already precompiled.

julia> VERSION
v"1.10.0-alpha1"

joa-quim · July 7, 2023, 11:12am

Good, that means the case is no so bad. But my screenshot was short and I don’t understand it. It continues the dependencies through GDAL

and as we can see the GDAL CI fails on 10dev exactly because of this libCURL error.

And then GMT.jl fails because dependencies fails …

GunnarFarneback · July 7, 2023, 12:23pm

I think at least a part of your problem is that a recent GDAL_jll has compat NetCDF_jll = "400.902.5" and all such versions of NetCDF_jll has compat LibCURL_jll = "7.73.0".

giordano · July 7, 2023, 4:44pm

Not at all.

Also, everyone who is annoyed by this situation please direct your complaints to the Curl developer who released a major version of the package just to celebrate the 25th anniversary of the project, with the only result that that version was broken on its own.

The only thing we have to do is fix compat bounds in the registry. Keeping complaining about this is not helpful, we’re merely affected by bizarre, to say the least, decisions of upstream developers who don’t follow semver. Also, Curl v8 completely broke compilation of R, what should they say?

Henrique_Becker · July 7, 2023, 4:51pm

Well, there was one thing:

Changes

There is only one actual “change” in this release. This is the first curl release to drop support for building on a systems that lack a working 64 bit data type. curl now requires that ‘long long‘ or an equivalent exists.

What can actually break on very old hardware I suppose. But yes, it seems it was mostly a misfire.

giordano · July 7, 2023, 4:54pm

Semver calls breaking changes only changes of the API. A change of ABI would also practically be breaking. Nothing of this changed.

Henrique_Becker · July 7, 2023, 8:17pm

I misread, I thought the code would not work on 32bit machines, but it seems it will only stop compiling? What clearly is not a semver breaking change.

gbaraldi · July 7, 2023, 11:41pm

It’s not even that, 32 bit supports 64 bit variables just fine, I don’t remember of the top of my head a platform that explicitly doesn’t support 64 bit that one would use libcurl in.

nsajko · July 8, 2023, 1:19am

I agree that it’s a bit silly to move from v7 to v8 for no reason, however it’s also silly to complain about “upstream developers who don’t follow semver”. At the end of the day, the maintainer is the one who makes and names releases, and there’s not even a guarantee that the version will have anything resembling the format promoted by semver. Especially for a C project, since there’s no widely-used package management solution for C, so semver doesn’t mean as much to C developers.

Furthermore, some are actually of the opinion that semver allows for releasing non-breaking changes as breaking, the spec is not very clear.

Now that I think about it, even for Julia projects, which hopefully all try to follow semver, I’m not sure that blindly trusting the maintainers to follow it correctly is such a good idea. Semver bugs, or ambiguities, are inevitable.

Topic		Replies	Views
Julia nightly CI errors New to Julia ci	6	270	July 28, 2024
Forward compatibility and stability of Julia vs. Packages General Usage	97	3605	August 27, 2023
Would slowing Julia's release cadence improve ecosystem quality? Community	98	5117	September 15, 2023
Setting correctly package dependencies General Usage	9	382	November 24, 2020
PSA: Add Downgrade CI to Better Check Version Compatibility Community package , package-manager , semver , versioning	35	2426	March 14, 2024

Why do packages run continuous integration tests on Julia nightly?

Related topics