Latest CUDA.jl version 5.8.3 fails to install on NVIDIA Jetson Orin with Jetpack 6.2.1+b38

Greetings,

I am trying to install CUDA.jl on a NVIDIA Jetson Orin. The installation
fails with many errors captured below. I would NOT like to use CUDA via
artifact and use CUDA installed on the system but when I add the package
CUDA.jl goes ahead and adds the wrong version of CUDA toolkit anyway. with
the current version of CUDA.jl 5.8.3 it seems to want to install CUDA 12.9
which is not support on the Jetson. If I try to install CUDA.jl version
number “5.5” everything mostly works but I get the error “ERROR:
NVMLError: Not Found (code 6)” which was fixed after 5.5 was released. This
could be a user error or a bug. Thanks for the help!

I am running the latest Jetpack 6.2.1+b38 which is CUDA 6.2 with Julia
version 1.11.6.

Here is my install script:

#!/usr/bin/env bash
# CUDA.jl install script for the NVIDIA Jetson Orin
# The NVIDIA Jetson Orin is using CUDA 12.6 and seems to only work with CUDA.jl version 5.5
# /bin/bash cuda_installV2.sh
set -euo pipefail

# ----------- 1. set env vars (adjust if your CUDA dir is different) ----------
#export JULIA_CUDA_USE_BINARYBUILDER=false # this is depericated apparently
export CUDA_PATH=/usr/local/cuda-12.6
#export LD_LIBRARY_PATH=${CUDA_PATH}/lib64:${LD_LIBRARY_PATH}
export PATH=${CUDA_PATH}/bin:${PATH}

# ----------- 2. cleans out the entire Julia package system ----------
rm -rf ~/.julia

# ----------- 3. create a temporary project ----------
# This script only works when version = "5.5"
julia -e '
using Pkg
# Pkg.activate(".")
Pkg.add(name="CUDA", version="5.8.3")
# write the env flag into startup.jl so it survives inside the REPL
mkpath(joinpath(DEPOT_PATH[1], "config"))
open(joinpath(DEPOT_PATH[1], "config", "startup.jl"), "w") do io
    println(io, "ENV[\"JULIA_CUDA_USE_BINARYBUILDER\"] = \"false\"")
        println(io, "ENV[\"CUDA_PATH\"] = \"/usr/local/cuda-12.6\"")
            println(io, "ENV[\"JULIA_DEBUG\"] = \"CUDA\"")
end
using CUDA
CUDA.set_runtime_version!(v"12.6", local_toolkit=true)
println("=== CUDA versioninfo ===")
CUDA.versioninfo()
'

Here is the output from trying to install CUDA.jl with the script above

Precompiling CUDA_Driver_jll...
    783.5 ms  ✓ Preferences
    593.5 ms  ✓ JLLWrappers
    924.0 ms  ✓ CUDA_Driver_jll
  3 dependencies successfully precompiled in 2 seconds. 23 already precompiled.
  Installing known registries into `~/.julia`
       Added `General` registry to ~/.julia/registries
    Updating registry at `~/.julia/registries/General.toml`
   Resolving package versions...
   Installed GPUArraysCore ─────────────── v0.2.0
   Installed demumble_jll ──────────────── v1.3.0+0
   Installed Crayons ───────────────────── v4.1.1
   Installed Adapt ─────────────────────── v4.3.0
   Installed ColorTypes ────────────────── v0.12.1
   Installed Preferences ───────────────── v1.5.0
   Installed Scratch ───────────────────── v1.3.0
   Installed PrettyTables ──────────────── v2.4.0
   Installed CUDA_Driver_jll ───────────── v13.0.0+0
   Installed TableTraits ───────────────── v1.0.1
   Installed LLVMLoopInfo ──────────────── v1.0.0
   Installed SentinelArrays ────────────── v1.4.8
   Installed GPUCompiler ───────────────── v1.6.1
   Installed DataAPI ───────────────────── v1.16.0
   Installed HashArrayMappedTries ──────── v0.2.0
   Installed Tables ────────────────────── v1.12.1
   Installed PooledArrays ──────────────── v1.4.3
   Installed FixedPointNumbers ─────────── v0.8.5
   Installed InlineStrings ─────────────── v1.4.5
   Installed JLLWrappers ───────────────── v1.7.1
   Installed AbstractFFTs ──────────────── v1.5.0
   Installed Tracy ─────────────────────── v0.1.5
   Installed PrecompileTools ───────────── v1.2.1
   Installed StaticArrays ──────────────── v1.9.15
   Installed DataValueInterfaces ───────── v1.0.0
   Installed CUDA_Compiler_jll ─────────── v0.2.0+2
   Installed IteratorInterfaceExtensions ─ v1.0.0
   Installed StaticArraysCore ──────────── v1.4.3
   Installed NVTX ──────────────────────── v1.0.1
   Installed CUDA_Runtime_Discovery ────── v1.0.0
   Installed OrderedCollections ────────── v1.8.1
   Installed LLVMExtra_jll ─────────────── v0.0.37+2
   Installed NVTX_jll ──────────────────── v3.2.2+0
   Installed GPUToolbox ────────────────── v0.3.0
   Installed LaTeXStrings ──────────────── v1.4.0
   Installed CEnum ─────────────────────── v0.5.0
   Installed ScopedValues ──────────────── v1.5.0
   Installed InvertedIndices ───────────── v1.3.1
   Installed Random123 ─────────────────── v1.7.1
   Installed GPUArrays ─────────────────── v11.2.3
   Installed BFloat16s ─────────────────── v0.5.1
   Installed JuliaNVTXCallbacks_jll ────── v0.2.1+0
   Installed Reexport ──────────────────── v1.2.2
   Installed LibTracyClient_jll ────────── v0.9.1+6
   Installed CUDA_Runtime_jll ──────────── v0.19.0+0
   Installed RandomNumbers ─────────────── v1.6.0
   Installed Statistics ────────────────── v1.11.1
   Installed DataFrames ────────────────── v1.7.1
   Installed Requires ──────────────────── v1.3.1
   Installed DataStructures ────────────── v0.19.1
   Installed Colors ────────────────────── v0.13.1
   Installed Compat ────────────────────── v4.18.0
   Installed ExprTools ─────────────────── v0.1.10
   Installed MacroTools ────────────────── v0.5.16
   Installed KernelAbstractions ────────── v0.9.38
   Installed UnsafeAtomics ─────────────── v0.3.0
   Installed Missings ──────────────────── v1.2.0
   Installed StringManipulation ────────── v0.4.1
   Installed Atomix ────────────────────── v1.1.2
   Installed SortingAlgorithms ─────────── v1.2.2
   Installed CUDA ──────────────────────── v5.8.3
   Installed LLVM ──────────────────────── v9.4.2
    Updating `~/.julia/environments/v1.11/Project.toml`
  [052768ef] + CUDA v5.8.3
    Updating `~/.julia/environments/v1.11/Manifest.toml`
  [621f4979] + AbstractFFTs v1.5.0
  [79e6a3ab] + Adapt v4.3.0
  [a9b6321e] + Atomix v1.1.2
  [ab4f0b2a] + BFloat16s v0.5.1
  [fa961155] + CEnum v0.5.0
  [052768ef] + CUDA v5.8.3
  [1af6417a] + CUDA_Runtime_Discovery v1.0.0
  [3da002f7] + ColorTypes v0.12.1
  [5ae59095] + Colors v0.13.1
  [34da2185] + Compat v4.18.0
  [a8cc5b0e] + Crayons v4.1.1
  [9a962f9c] + DataAPI v1.16.0
  [a93c6f00] + DataFrames v1.7.1
  [864edb3b] + DataStructures v0.19.1
  [e2d170a0] + DataValueInterfaces v1.0.0
  [e2ba6199] + ExprTools v0.1.10
  [53c48c17] + FixedPointNumbers v0.8.5
  [0c68f7d7] + GPUArrays v11.2.3
  [46192b85] + GPUArraysCore v0.2.0
  [61eb1bfa] + GPUCompiler v1.6.1
  [096a3bc2] + GPUToolbox v0.3.0
  [076d061b] + HashArrayMappedTries v0.2.0
  [842dd82b] + InlineStrings v1.4.5
  [41ab1584] + InvertedIndices v1.3.1
  [82899510] + IteratorInterfaceExtensions v1.0.0
  [692b3bcd] + JLLWrappers v1.7.1
  [63c18a36] + KernelAbstractions v0.9.38
  [929cbde3] + LLVM v9.4.2
  [8b046642] + LLVMLoopInfo v1.0.0
  [b964fa9f] + LaTeXStrings v1.4.0
  [1914dd2f] + MacroTools v0.5.16
  [e1d29d7a] + Missings v1.2.0
  [5da4648a] + NVTX v1.0.1
  [bac558e1] + OrderedCollections v1.8.1
  [2dfb63ee] + PooledArrays v1.4.3
⌅ [aea7be01] + PrecompileTools v1.2.1
  [21216c6a] + Preferences v1.5.0
⌅ [08abe8d2] + PrettyTables v2.4.0
  [74087812] + Random123 v1.7.1
  [e6cf234a] + RandomNumbers v1.6.0
  [189a3867] + Reexport v1.2.2
  [ae029012] + Requires v1.3.1
  [7e506255] + ScopedValues v1.5.0
  [6c6a2e73] + Scratch v1.3.0
  [91c51154] + SentinelArrays v1.4.8
  [a2af1166] + SortingAlgorithms v1.2.2
  [90137ffa] + StaticArrays v1.9.15
  [1e83bf80] + StaticArraysCore v1.4.3
  [10745b16] + Statistics v1.11.1
  [892a3eda] + StringManipulation v0.4.1
  [3783bdb8] + TableTraits v1.0.1
  [bd369af6] + Tables v1.12.1
  [e689c965] + Tracy v0.1.5
  [013be700] + UnsafeAtomics v0.3.0
→ [d1e2174e] + CUDA_Compiler_jll v0.2.0+2
  [4ee394cb] + CUDA_Driver_jll v13.0.0+0
→ [76a88914] + CUDA_Runtime_jll v0.19.0+0
  [9c1d0b0a] + JuliaNVTXCallbacks_jll v0.2.1+0
  [dad2f222] + LLVMExtra_jll v0.0.37+2
  [ad6e5548] + LibTracyClient_jll v0.9.1+6
  [e98f9f5b] + NVTX_jll v3.2.2+0
  [1e29f10c] + demumble_jll v1.3.0+0
  [0dad84c5] + ArgTools v1.1.2
  [56f22d72] + Artifacts v1.11.0
  [2a0f44e3] + Base64 v1.11.0
  [ade2ca70] + Dates v1.11.0
  [f43a241f] + Downloads v1.6.0
  [7b1f6079] + FileWatching v1.11.0
  [9fa8497b] + Future v1.11.0
  [b77e0a4c] + InteractiveUtils v1.11.0
  [4af54fe1] + LazyArtifacts v1.11.0
  [b27032c2] + LibCURL v0.6.4
  [76f85450] + LibGit2 v1.11.0
  [8f399da3] + Libdl v1.11.0
  [37e2e46d] + LinearAlgebra v1.11.0
  [56ddb016] + Logging v1.11.0
  [d6f4376e] + Markdown v1.11.0
  [ca575930] + NetworkOptions v1.2.0
  [44cfe95a] + Pkg v1.11.0
  [de0858da] + Printf v1.11.0
  [9a3f8284] + Random v1.11.0
  [ea8e919c] + SHA v0.7.0
  [9e88b42a] + Serialization v1.11.0
  [2f01184e] + SparseArrays v1.11.0
  [fa267f1f] + TOML v1.0.3
  [a4e569a6] + Tar v1.10.0
  [cf7118a7] + UUIDs v1.11.0
  [4ec0a83e] + Unicode v1.11.0
  [e66e0078] + CompilerSupportLibraries_jll v1.1.1+0
  [deac9b47] + LibCURL_jll v8.6.0+0
  [e37daf67] + LibGit2_jll v1.7.2+0
  [29816b5a] + LibSSH2_jll v1.11.0+1
  [c8ffd9c3] + MbedTLS_jll v2.28.6+0
  [14a3606d] + MozillaCACerts_jll v2023.12.12
  [4536629a] + OpenBLAS_jll v0.3.27+1
  [bea87d4a] + SuiteSparse_jll v7.7.0+0
  [83775a58] + Zlib_jll v1.2.13+1
  [8e850b90] + libblastrampoline_jll v5.11.0+0
  [8e850ede] + nghttp2_jll v1.59.0+0
  [3f19e933] + p7zip_jll v17.4.0+2
        Info Packages marked with → are not downloaded, use `instantiate` to download
        Info Packages marked with ⌅ have new versions available but compatibility constraints restrict them from upgrading. To see why use `status --outdated -m`
Precompiling project...
    763.3 ms  ✓ LLVMLoopInfo
    694.2 ms  ✓ Reexport
    713.2 ms  ✓ DataValueInterfaces
    726.4 ms  ✓ LaTeXStrings
    723.8 ms  ✓ IteratorInterfaceExtensions
    845.9 ms  ✓ ExprTools
    860.3 ms  ✓ InvertedIndices
    842.9 ms  ✓ DataAPI
   1003.9 ms  ✓ Statistics
   1085.8 ms  ✓ AbstractFFTs
   1118.5 ms  ✓ OrderedCollections
   1235.3 ms  ✓ InlineStrings
    767.7 ms  ✓ CEnum
    704.0 ms  ✓ StaticArraysCore
   1053.1 ms  ✓ HashArrayMappedTries
   1287.6 ms  ✓ BFloat16s
    883.5 ms  ✓ Scratch
   1065.3 ms  ✓ Requires
   1751.7 ms  ✓ RandomNumbers
   1298.7 ms  ✓ Compat
    983.8 ms  ✓ TableTraits
    948.1 ms  ✓ PooledArrays
    976.3 ms  ✓ Missings
   3033.2 ms  ✓ UnsafeAtomics
   2476.5 ms  ✓ Crayons
   2385.4 ms  ✓ CUDA_Runtime_Discovery
   1777.1 ms  ✓ Preferences
   1411.2 ms  ✓ Statistics → SparseArraysExt
    969.7 ms  ✓ ScopedValues
   1169.0 ms  ✓ Adapt
    829.4 ms  ✓ Compat → CompatLinearAlgebraExt
   2915.5 ms  ✓ SentinelArrays
    792.8 ms  ✓ PrecompileTools
    995.2 ms  ✓ Atomix
    913.7 ms  ✓ JLLWrappers
   1373.4 ms  ✓ Tables
   1738.9 ms  ✓ Random123
   1038.7 ms  ✓ Adapt → AdaptSparseArraysExt
   1085.5 ms  ✓ GPUArraysCore
   1070.9 ms  ✓ LibTracyClient_jll
   1123.7 ms  ✓ NVTX_jll
   1141.2 ms  ✓ JuliaNVTXCallbacks_jll
   4683.4 ms  ✓ MacroTools
   3062.0 ms  ✓ DataStructures
    992.7 ms  ✓ demumble_jll
    785.5 ms  ✓ SortingAlgorithms
   2387.7 ms  ✓ CUDA_Driver_jll
   1340.3 ms  ✓ Tracy
  10642.5 ms  ✓ FixedPointNumbers
   8625.8 ms  ✓ StringManipulation
   8357.7 ms  ✓ LLVMExtra_jll
   8518.4 ms  ✓ ColorTypes
  15014.5 ms  ✓ CUDA_Runtime_jll
  24077.9 ms  ✓ StaticArrays
           ✗ CUDA_Compiler_jll
   7361.7 ms  ✓ StaticArrays → StaticArraysStatisticsExt
   7362.5 ms  ✓ Adapt → AdaptStaticArraysExt
  14423.2 ms  ✓ Colors
  22810.5 ms  ✓ LLVM
   1923.0 ms  ✓ NVTX
   1925.7 ms  ✓ LLVM → BFloat16sExt
   1997.7 ms  ✓ GPUToolbox
   2901.0 ms  ✓ UnsafeAtomics → UnsafeAtomicsLLVM
   7594.3 ms  ✓ KernelAbstractions
   6768.8 ms  ✓ KernelAbstractions → LinearAlgebraExt
   6789.0 ms  ✓ KernelAbstractions → SparseArraysExt
  37331.3 ms  ✓ PrettyTables
   7767.8 ms  ✓ GPUArrays
  39988.7 ms  ✓ GPUCompiler
  56262.7 ms  ✓ DataFrames
           ✗ CUDA
           ✗ Atomix → AtomixCUDAExt
  69 dependencies successfully precompiled in 126 seconds. 30 already precompiled.

The following 1 direct dependency failed to precompile:

CUDA 

Failed to precompile CUDA [052768ef-5323-5732-b1bb-66c8b64840ba] to "/home/orinstfi/.julia/compiled/v1.11/CUDA/jl_hiBzIn".
ERROR: LoadError: InitError: could not load library "/home/orinstfi/.julia/artifacts/af88240fc8904d48bf677fd73760e1e961eaba8e/lib/libcusparse.so"
/home/orinstfi/.julia/artifacts/af88240fc8904d48bf677fd73760e1e961eaba8e/lib/libcusparse.so: undefined symbol: __nvJitLinkGetErrorLogSize_12_9, version libnvJitLink.so.12
Stacktrace:
  [1] dlopen(s::String, flags::UInt32; throw_error::Bool)
    @ Base.Libc.Libdl ./libdl.jl:120
  [2] dlopen(s::String, flags::UInt32)
    @ Base.Libc.Libdl ./libdl.jl:119
  [3] macro expansion
    @ ~/.julia/packages/JLLWrappers/m2Pjh/src/products/library_generators.jl:63 [inlined]
  [4] __init__()
    @ CUDA_Runtime_jll ~/.julia/packages/CUDA_Runtime_jll/xAuK8/src/wrappers/aarch64-linux-gnu-cuda_platform+jetson-cuda+12.9.jl:70
  [5] run_module_init(mod::Module, i::Int64)
    @ Base ./loading.jl:1378
  [6] register_restored_modules(sv::Core.SimpleVector, pkg::Base.PkgId, path::String)
    @ Base ./loading.jl:1366
  [7] _include_from_serialized(pkg::Base.PkgId, path::String, ocachepath::String, depmods::Vector{Any}, ignore_native::Nothing; register::Bool)
    @ Base ./loading.jl:1254
  [8] _include_from_serialized (repeats 2 times)
    @ ./loading.jl:1210 [inlined]
  [9] _require_search_from_serialized(pkg::Base.PkgId, sourcepath::String, build_id::UInt128, stalecheck::Bool; reasons::Dict{String, Int64}, DEPOT_PATH::Vector{String})
    @ Base ./loading.jl:2057
 [10] _require(pkg::Base.PkgId, env::String)
    @ Base ./loading.jl:2527
 [11] __require_prelocked(uuidkey::Base.PkgId, env::String)
    @ Base ./loading.jl:2388
 [12] #invoke_in_world#3
    @ ./essentials.jl:1089 [inlined]
 [13] invoke_in_world
    @ ./essentials.jl:1086 [inlined]
 [14] _require_prelocked(uuidkey::Base.PkgId, env::String)
    @ Base ./loading.jl:2375
 [15] macro expansion
    @ ./loading.jl:2314 [inlined]
 [16] macro expansion
    @ ./lock.jl:273 [inlined]
 [17] __require(into::Module, mod::Symbol)
    @ Base ./loading.jl:2271
 [18] #invoke_in_world#3
    @ ./essentials.jl:1089 [inlined]
 [19] invoke_in_world
    @ ./essentials.jl:1086 [inlined]
 [20] require(into::Module, mod::Symbol)
    @ Base ./loading.jl:2260
 [21] include(mod::Module, _path::String)
    @ Base ./Base.jl:562
 [22] top-level scope
    @ ~/.julia/packages/JLLWrappers/m2Pjh/src/toplevel_generators.jl:199
 [23] include
    @ ./Base.jl:562 [inlined]
 [24] include_package_for_output(pkg::Base.PkgId, input::String, depot_path::Vector{String}, dl_load_path::Vector{String}, load_path::Vector{String}, concrete_deps::Vector{Pair{Base.PkgId, UInt128}}, source::String)
    @ Base ./loading.jl:2881
 [25] top-level scope
    @ stdin:6
during initialization of module CUDA_Runtime_jll
in expression starting at /home/orinstfi/.julia/packages/CUDA_Compiler_jll/BYn2f/src/wrappers/aarch64-linux-gnu-cuda_platform+sbsa-cuda+12.jl:5
in expression starting at /home/orinstfi/.julia/packages/CUDA_Compiler_jll/BYn2f/src/CUDA_Compiler_jll.jl:2
in expression starting at stdin:6
ERROR: LoadError: Failed to precompile CUDA_Compiler_jll [d1e2174e-dfdc-576e-b43e-73b79eb1aca8] to "/home/orinstfi/.julia/compiled/v1.11/CUDA_Compiler_jll/jl_BQ1UdQ".
Stacktrace:
  [1] error(s::String)
    @ Base ./error.jl:35
  [2] compilecache(pkg::Base.PkgId, path::String, internal_stderr::IO, internal_stdout::IO, keep_loaded_modules::Bool; flags::Cmd, cacheflags::Base.CacheFlags, reasons::Dict{String, Int64}, loadable_exts::Nothing)
    @ Base ./loading.jl:3174
  [3] (::Base.var"#1110#1111"{Base.PkgId})()
    @ Base ./loading.jl:2579
  [4] mkpidlock(f::Base.var"#1110#1111"{Base.PkgId}, at::String, pid::Int32; kwopts::@Kwargs{stale_age::Int64, wait::Bool})
    @ FileWatching.Pidfile /opt/julia-1.11.6/share/julia/stdlib/v1.11/FileWatching/src/pidfile.jl:95
  [5] #mkpidlock#6
    @ /opt/julia-1.11.6/share/julia/stdlib/v1.11/FileWatching/src/pidfile.jl:90 [inlined]
  [6] trymkpidlock(::Function, ::Vararg{Any}; kwargs::@Kwargs{stale_age::Int64})
    @ FileWatching.Pidfile /opt/julia-1.11.6/share/julia/stdlib/v1.11/FileWatching/src/pidfile.jl:116
  [7] #invokelatest#2
    @ ./essentials.jl:1057 [inlined]
  [8] invokelatest
    @ ./essentials.jl:1052 [inlined]
  [9] maybe_cachefile_lock(f::Base.var"#1110#1111"{Base.PkgId}, pkg::Base.PkgId, srcpath::String; stale_age::Int64)
    @ Base ./loading.jl:3698
 [10] maybe_cachefile_lock
    @ ./loading.jl:3695 [inlined]
 [11] _require(pkg::Base.PkgId, env::String)
    @ Base ./loading.jl:2565
 [12] __require_prelocked(uuidkey::Base.PkgId, env::String)
    @ Base ./loading.jl:2388
 [13] #invoke_in_world#3
    @ ./essentials.jl:1089 [inlined]
 [14] invoke_in_world
    @ ./essentials.jl:1086 [inlined]
 [15] _require_prelocked(uuidkey::Base.PkgId, env::String)
    @ Base ./loading.jl:2375
 [16] macro expansion
    @ ./loading.jl:2314 [inlined]
 [17] macro expansion
    @ ./lock.jl:273 [inlined]
 [18] __require(into::Module, mod::Symbol)
    @ Base ./loading.jl:2271
 [19] #invoke_in_world#3
    @ ./essentials.jl:1089 [inlined]
 [20] invoke_in_world
    @ ./essentials.jl:1086 [inlined]
 [21] require(into::Module, mod::Symbol)
    @ Base ./loading.jl:2260
 [22] include
    @ ./Base.jl:562 [inlined]
 [23] include_package_for_output(pkg::Base.PkgId, input::String, depot_path::Vector{String}, dl_load_path::Vector{String}, load_path::Vector{String}, concrete_deps::Vector{Pair{Base.PkgId, UInt128}}, source::Nothing)
    @ Base ./loading.jl:2881
 [24] top-level scope
    @ stdin:6
in expression starting at /home/orinstfi/.julia/packages/CUDA/Wfi8S/src/CUDA.jl:1
in expression starting at stdin:Precompiling CUDA...
           ✗ CUDA_Compiler_jll
Info Given CUDA was explicitly requested, output will be shown live e[0K
e[0KERROR: LoadError: InitError: could not load library "/home/orinstfi/.julia/artifacts/af88240fc8904d48bf677fd73760e1e961eaba8e/lib/libcusparse.so"
e[0K/home/orinstfi/.julia/artifacts/af88240fc8904d48bf677fd73760e1e961eaba8e/lib/libcusparse.so: undefined symbol: __nvJitLinkGetErrorLogSize_12_9, version libnvJitLink.so.12
e[0KStacktrace:
e[0K  [1] dlopen(s::String, flags::UInt32; throw_error::Bool)
e[0K    @ Base.Libc.Libdl ./libdl.jl:120
e[0K  [2] dlopen(s::String, flags::UInt32)
e[0K    @ Base.Libc.Libdl ./libdl.jl:119
e[0K  [3] macro expansion
e[0K    @ ~/.julia/packages/JLLWrappers/m2Pjh/src/products/library_generators.jl:63 [inlined]
e[0K  [4] __init__()
e[0K    @ CUDA_Runtime_jll ~/.julia/packages/CUDA_Runtime_jll/xAuK8/src/wrappers/aarch64-linux-gnu-cuda_platform+jetson-cuda+12.9.jl:70
e[0K  [5] run_module_init(mod::Module, i::Int64)
e[0K    @ Base ./loading.jl:1378
e[0K  [6] register_restored_modules(sv::Core.SimpleVector, pkg::Base.PkgId, path::String)
e[0K    @ Base ./loading.jl:1366
e[0K  [7] _include_from_serialized(pkg::Base.PkgId, path::String, ocachepath::String, depmods::Vector{Any}, ignore_native::Nothing; register::Bool)
e[0K    @ Base ./loading.jl:1254
e[0K  [8] _include_from_serialized (repeats 2 times)
e[0K    @ ./loading.jl:1210 [inlined]
e[0K  [9] _require_search_from_serialized(pkg::Base.PkgId, sourcepath::String, build_id::UInt128, stalecheck::Bool; reasons::Dict{String, Int64}, DEPOT_PATH::Vector{String})
e[0K    @ Base ./loading.jl:2057
e[0K [10] _require(pkg::Base.PkgId, env::String)
e[0K    @ Base ./loading.jl:2527
e[0K [11] __require_prelocked(uuidkey::Base.PkgId, env::String)
e[0K    @ Base ./loading.jl:2388
e[0K [12] #invoke_in_world#3
e[0K    @ ./essentials.jl:1089 [inlined]
e[0K [13] invoke_in_world
e[0K    @ ./essentials.jl:1086 [inlined]
e[0K [14] _require_prelocked(uuidkey::Base.PkgId, env::String)
e[0K    @ Base ./loading.jl:2375
e[0K [15] macro expansion
e[0K    @ ./loading.jl:2314 [inlined]
e[0K [16] macro expansion
e[0K    @ ./lock.jl:273 [inlined]
e[0K [17] __require(into::Module, mod::Symbol)
e[0K    @ Base ./loading.jl:2271
e[0K [18] #invoke_in_world#3
e[0K    @ ./essentials.jl:1089 [inlined]
e[0K [19] invoke_in_world
e[0K    @ ./essentials.jl:1086 [inlined]
e[0K [20] require(into::Module, mod::Symbol)
e[0K    @ Base ./loading.jl:2260
e[0K [21] include(mod::Module, _path::String)
e[0K    @ Base ./Base.jl:562
e[0K [22] top-level scope
e[0K    @ ~/.julia/packages/JLLWrappers/m2Pjh/src/toplevel_generators.jl:199
e[0K [23] include
e[0K    @ ./Base.jl:562 [inlined]
e[0K [24] include_package_for_output(pkg::Base.PkgId, input::String, depot_path::Vector{String}, dl_load_path::Vector{String}, load_path::Vector{String}, concrete_deps::Vector{Pair{Base.PkgId, UInt128}}, source::String)
e[0K    @ Base ./loading.jl:2881
e[0K [25] top-level scope
e[0K    @ stdin:6
e[0Kduring initialization of module CUDA_Runtime_jll
e[0Kin expression starting at /home/orinstfi/.julia/packages/CUDA_Compiler_jll/BYn2f/src/wrappers/aarch64-linux-gnu-cuda_platform+sbsa-cuda+12.jl:5
e[0Kin expression starting at /home/orinstfi/.julia/packages/CUDA_Compiler_jll/BYn2f/src/CUDA_Compiler_jll.jl:2
e[0Kin expression starting at stdin:6
e[0KERROR: LoadError: Failed to precompile CUDA_Compiler_jll [d1e2174e-dfdc-576e-b43e-73b79eb1aca8] to "/home/orinstfi/.julia/compiled/v1.11/CUDA_Compiler_jll/jl_05Y5uf".
e[0KStacktrace:
e[0K  [1] error(s::String)
e[0K    @ Base ./error.jl:35
e[0K  [2] compilecache(pkg::Base.PkgId, path::String, internal_stderr::IO, internal_stdout::IO, keep_loaded_modules::Bool; flags::Cmd, cacheflags::Base.CacheFlags, reasons::Dict{String, Int64}, loadable_exts::Nothing)
e[0K    @ Base ./loading.jl:3174
e[0K  [3] (::Base.var"#1110#1111"{Base.PkgId})()
e[0K    @ Base ./loading.jl:2579
e[0K  [4] mkpidlock(f::Base.var"#1110#1111"{Base.PkgId}, at::String, pid::Int32; kwopts::@Kwargs{stale_age::Int64, wait::Bool})
e[0K    @ FileWatching.Pidfile /opt/julia-1.11.6/share/julia/stdlib/v1.11/FileWatching/src/pidfile.jl:95
e[0K  [5] #mkpidlock#6
e[0K    @ /opt/julia-1.11.6/share/julia/stdlib/v1.11/FileWatching/src/pidfile.jl:90 [inlined]
e[0K  [6] trymkpidlock(::Function, ::Vararg{Any}; kwargs::@Kwargs{stale_age::Int64})
e[0K    @ FileWatching.Pidfile /opt/julia-1.11.6/share/julia/stdlib/v1.11/FileWatching/src/pidfile.jl:116
e[0K  [7] #invokelatest#2
e[0K    @ ./essentials.jl:1057 [inlined]
e[0K  [8] invokelatest
e[0K    @ ./essentials.jl:1052 [inlined]
e[0K  [9] maybe_cachefile_lock(f::Base.var"#1110#1111"{Base.PkgId}, pkg::Base.PkgId, srcpath::String; stale_age::Int64)
e[0K    @ Base ./loading.jl:3698
e[0K [10] maybe_cachefile_lock
e[0K    @ ./loading.jl:3695 [inlined]
e[0K [11] _require(pkg::Base.PkgId, env::String)
e[0K    @ Base ./loading.jl:2565
e[0K [12] __require_prelocked(uuidkey::Base.PkgId, env::String)
e[0K    @ Base ./loading.jl:2388
e[0K [13] #invoke_in_world#3
e[0K    @ ./essentials.jl:1089 [inlined]
e[0K [14] invoke_in_world
e[0K    @ ./essentials.jl:1086 [inlined]
e[0K [15] _require_prelocked(uuidkey::Base.PkgId, env::String)
e[0K    @ Base ./loading.jl:2375
e[0K [16] macro expansion
e[0K    @ ./loading.jl:2314 [inlined]
e[0K [17] macro expansion
e[0K    @ ./lock.jl:273 [inlined]
e[0K [18] __require(into::Module, mod::Symbol)
e[0K    @ Base ./loading.jl:2271
e[0K [19] #invoke_in_world#3
e[0K    @ ./essentials.jl:1089 [inlined]
e[0K [20] invoke_in_world
e[0K    @ ./essentials.jl:1086 [inlined]
e[0K [21] require(into::Module, mod::Symbol)
e[0K    @ Base ./loading.jl:2260
e[0K [22] include
e[0K    @ ./Base.jl:562 [inlined]
e[0K [23] include_package_for_output(pkg::Base.PkgId, input::String, depot_path::Vector{String}, dl_load_path::Vector{String}, load_path::Vector{String}, concrete_deps::Vector{Pair{Base.PkgId, UInt128}}, source::Nothing)
e[0K    @ Base ./loading.jl:2881
e[0K [24] top-level scope
e[0K    @ stdin:6
e[0Kin expression starting at /home/orinstfi/.julia/packages/CUDA/Wfi8S/src/CUDA.jl:1
e[0Kin expression starting at stdin:6
           ✗ CUDA
           ✗ Atomix → AtomixCUDAExt
  0 dependencies successfully precompiled in 24 seconds. 98 already precompiled.

ERROR: The following 1 direct dependency failed to precompile:

CUDA 

Failed to precompile CUDA [052768ef-5323-5732-b1bb-66c8b64840ba] to "/home/orinstfi/.julia/compiled/v1.11/CUDA/jl_7Gka21".
ERROR: LoadError: InitError: could not load library "/home/orinstfi/.julia/artifacts/af88240fc8904d48bf677fd73760e1e961eaba8e/lib/libcusparse.so"
/home/orinstfi/.julia/artifacts/af88240fc8904d48bf677fd73760e1e961eaba8e/lib/libcusparse.so: undefined symbol: __nvJitLinkGetErrorLogSize_12_9, version libnvJitLink.so.12
Stacktrace:
  [1] dlopen(s::String, flags::UInt32; throw_error::Bool)
    @ Base.Libc.Libdl ./libdl.jl:120
  [2] dlopen(s::String, flags::UInt32)
    @ Base.Libc.Libdl ./libdl.jl:119
  [3] macro expansion
    @ ~/.julia/packages/JLLWrappers/m2Pjh/src/products/library_generators.jl:63 [inlined]
  [4] __init__()
    @ CUDA_Runtime_jll ~/.julia/packages/CUDA_Runtime_jll/xAuK8/src/wrappers/aarch64-linux-gnu-cuda_platform+jetson-cuda+12.9.jl:70
  [5] run_module_init(mod::Module, i::Int64)
    @ Base ./loading.jl:1378
  [6] register_restored_modules(sv::Core.SimpleVector, pkg::Base.PkgId, path::String)
    @ Base ./loading.jl:1366
  [7] _include_from_serialized(pkg::Base.PkgId, path::String, ocachepath::String, depmods::Vector{Any}, ignore_native::Nothing; register::Bool)
    @ Base ./loading.jl:1254
  [8] _include_from_serialized (repeats 2 times)
    @ ./loading.jl:1210 [inlined]
  [9] _require_search_from_serialized(pkg::Base.PkgId, sourcepath::String, build_id::UInt128, stalecheck::Bool; reasons::Dict{String, Int64}, DEPOT_PATH::Vector{String})
    @ Base ./loading.jl:2057
 [10] _require(pkg::Base.PkgId, env::String)
    @ Base ./loading.jl:2527
 [11] __require_prelocked(uuidkey::Base.PkgId, env::String)
    @ Base ./loading.jl:2388
 [12] #invoke_in_world#3
    @ ./essentials.jl:1089 [inlined]
 [13] invoke_in_world
    @ ./essentials.jl:1086 [inlined]
 [14] _require_prelocked(uuidkey::Base.PkgId, env::String)
    @ Base ./loading.jl:2375
 [15] macro expansion
    @ ./loading.jl:2314 [inlined]
 [16] macro expansion
    @ ./lock.jl:273 [inlined]
 [17] __require(into::Module, mod::Symbol)
    @ Base ./loading.jl:2271
 [18] #invoke_in_world#3
    @ ./essentials.jl:1089 [inlined]
 [19] invoke_in_world
    @ ./essentials.jl:1086 [inlined]
 [20] require(into::Module, mod::Symbol)
    @ Base ./loading.jl:2260
 [21] include(mod::Module, _path::String)
    @ Base ./Base.jl:562
 [22] top-level scope
    @ ~/.julia/packages/JLLWrappers/m2Pjh/src/toplevel_generators.jl:199
 [23] include
    @ ./Base.jl:562 [inlined]
 [24] include_package_for_output(pkg::Base.PkgId, input::String, depot_path::Vector{String}, dl_load_path::Vector{String}, load_path::Vector{String}, concrete_deps::Vector{Pair{Base.PkgId, UInt128}}, source::String)
    @ Base ./loading.jl:2881
 [25] top-level scope
    @ stdin:6
during initialization of module CUDA_Runtime_jll
in expression starting at /home/orinstfi/.julia/packages/CUDA_Compiler_jll/BYn2f/src/wrappers/aarch64-linux-gnu-cuda_platform+sbsa-cuda+12.jl:5
in expression starting at /home/orinstfi/.julia/packages/CUDA_Compiler_jll/BYn2f/src/CUDA_Compiler_jll.jl:2
in expression starting at stdin:6
ERROR: LoadError: Failed to precompile CUDA_Compiler_jll [d1e2174e-dfdc-576e-b43e-73b79eb1aca8] to "/home/orinstfi/.julia/compiled/v1.11/CUDA_Compiler_jll/jl_05Y5uf".
Stacktrace:
  [1] error(s::String)
    @ Base ./error.jl:35
  [2] compilecache(pkg::Base.PkgId, path::String, internal_stderr::IO, internal_stdout::IO, keep_loaded_modules::Bool; flags::Cmd, cacheflags::Base.CacheFlags, reasons::Dict{String, Int64}, loadable_exts::Nothing)
    @ Base ./loading.jl:3174
  [3] (::Base.var"#1110#1111"{Base.PkgId})()
    @ Base ./loading.jl:2579
  [4] mkpidlock(f::Base.var"#1110#1111"{Base.PkgId}, at::String, pid::Int32; kwopts::@Kwargs{stale_age::Int64, wait::Bool})
    @ FileWatching.Pidfile /opt/julia-1.11.6/share/julia/stdlib/v1.11/FileWatching/src/pidfile.jl:95
  [5] #mkpidlock#6
    @ /opt/julia-1.11.6/share/julia/stdlib/v1.11/FileWatching/src/pidfile.jl:90 [inlined]
  [6] trymkpidlock(::Function, ::Vararg{Any}; kwargs::@Kwargs{stale_age::Int64})
    @ FileWatching.Pidfile /opt/julia-1.11.6/share/julia/stdlib/v1.11/FileWatching/src/pidfile.jl:116
  [7] #invokelatest#2
    @ ./essentials.jl:1057 [inlined]
  [8] invokelatest
    @ ./essentials.jl:1052 [inlined]
  [9] maybe_cachefile_lock(f::Base.var"#1110#1111"{Base.PkgId}, pkg::Base.PkgId, srcpath::String; stale_age::Int64)
    @ Base ./loading.jl:3698
 [10] maybe_cachefile_lock
    @ ./loading.jl:3695 [inlined]
 [11] _require(pkg::Base.PkgId, env::String)
    @ Base ./loading.jl:2565
 [12] __require_prelocked(uuidkey::Base.PkgId, env::String)
    @ Base ./loading.jl:2388
 [13] #invoke_in_world#3
    @ ./essentials.jl:1089 [inlined]
 [14] invoke_in_world
    @ ./essentials.jl:1086 [inlined]
 [15] _require_prelocked(uuidkey::Base.PkgId, env::String)
    @ Base ./loading.jl:2375
 [16] macro expansion
    @ ./loading.jl:2314 [inlined]
 [17] macro expansion
    @ ./lock.jl:273 [inlined]
 [18] __require(into::Module, mod::Symbol)
    @ Base ./loading.jl:2271
 [19] #invoke_in_world#3
    @ ./essentials.jl:1089 [inlined]
 [20] invoke_in_world
    @ ./essentials.jl:1086 [inlined]
 [21] require(into::Module, mod::Symbol)
    @ Base ./loading.jl:2260
 [22] include
    @ ./Base.jl:562 [inlined]
 [23] include_package_for_output(pkg::Base.PkgId, input::String, depot_path::Vector{String}, dl_load_path::Vector{String}, load_path::Vector{String}, concrete_deps::Vector{Pair{Base.PkgId, UInt128}}, source::Nothing)
    @ Base ./loading.jl:2881
 [24] top-level scope
    @ stdin:6
in expression starting at /home/orinstfi/.julia/packages/CUDA/Wfi8S/src/CUDA.jl:1
in expression starting at stdin: