In short is PkgEval/master in worse state than usual? Before people start panic, the process seems to be working as intended, except maybe the skipped packages. I’m still curious about some of the (other) failures I’m seeing.
I’m seeing this recently, may I just overlooked before, and this isn’t new. I like how 100% of the registered package system is tested (well we have over 9300 now, so seemingly about all non-JLLs packages), or at least attempted.
In total, 8013 packages were tested, out of which 4823 succeeded, 23 crashed, 200 failed and 2967 were skipped.
2955 packages were skipped on the previous version too.
Thereof:
Package was blacklisted (2952 packages):
Some of the most important are skipped (intentionally it seems “blacklisted”), e.g. DataStructures, HTTP, DataFrames, LoopVectorization, CSV, JLD2, ImageCore, CUDA, JSON3, JuMP.
I don’t see any obvious reason or something relating all of these packages, so what does make a package be skipped (is it not them blacklisted directly, rather related to some blacklisted dependency, Artifacts?)?
https://s3.amazonaws.com/julialang-reports/nanosoldier/pkgeval/by_hash/408b4ac_vs_b12ddca/SIMDMath.primary.log
complex: Test Failed at /home/pkgeval/.julia/packages/SIMDMath/Dn7RO/test/complex_test.jl:81
Expression: (s[1]).re == e.re
Evaluated: -2397.529479230523 == -2397.5294792305203
[…]
Evaluated: -257.8749055869241 == -257.87490558692
While:
julia> -2397.529479230523 ≈ -2397.5294792305203
truejulia> -2547.570474265955 ≈ -2547.5704742659523
true
Wouldn’t you rather want ≈ there in most tests, and other packages with float math? I think someone should make a PR, maybe even me. Only in rare cases you want exact comparison for floats, and not there.
14 packages crashed during testing on the previous version too.
e.g. important I want tested: Revise v3.5.2 and StaticCompiler v0.4.9
173 packages failed tests on the previous version too.
Package has test failures (83 packages):
Some of those there I want tested and working:
- StaticArrays v1.5.21
- Missings v1.1.0
- StatsBase v0.33.21
- StrideArraysCore v0.4.12
- PlotUtils v1.3.5
- OnlineStats v1.6.1
- ProtoBuf v1.0.9
- LibDeflate v0.4.1
- Cthulhu v2.8.14
- Reactive v0.8.3
- SnoopCompile v2.10.6
Also Nullables.
The process was aborted (6 packages):
A segmentation fault happened (3 packages):
Many of those are strange (why fail before, but not much of a worry since with the PR now works “ok”):
29 packages passed tests only on the current version.
MbedTLS v1.1.7: ok vs. fail
[…]
Ironically, testing the package Test failed (“became inactive”, was “ok” before):
Mock for testing retval of record(::DefaultTestSet, ::T <: Result) methods: Error During Test at An Error Mock:0
Mock for testing retval of record(::DefaultTestSet, ::T <: Result) methods: Test Failed at A Fail Mock:0
Expression: 1
Evaluated: 2Stacktrace:
[1] record(ts::Test.DefaultTestSet, t::Union{Test.Error, Test.Fail}; print_result::Bool)
The process was aborted (6 packages):
StaticTools v0.8.7: crash vs. ok
[…]
A segmentation fault happened (3 packages):
GPUCompiler v0.19.3: crash vs. ok
RData v1.0.0: crash vs. ok