CUDA test failure

Hi,
I’m new to Julia and GPU computing. I installed CUDA and ran ]test CUDA to check it is working. When I do this I get the following errors:

 Info: Testing using device 0 (NVIDIA GH200 120GB). To change this, specify the `--gpu` argument to the tests, or set the `CUDA_VISIBLE_DEVICES` environment variable.
[ Info: Running 47 tests in parallel. If this is too many, specify the `--jobs` argument to the tests, or set the `JULIA_CPU_THREADS` environment variable.
┌ Warning: Running tests on a GPU in exclusive mode; reducing parallelism to 1.
└ @ Main /cluster/projects/nn9874k/aklocker/juliaup/depot/packages/CUDA/x8d2s/test/runtests.jl:181
                                                  |          | ---------------- GPU ---------------- | ---------------- CPU ---------------- |
Test                                     (Worker) | Time (s) | GC (s) | GC % | Alloc (MB) | RSS (MB) | GC (s) | GC % | Alloc (MB) | RSS (MB) |
core/initialization                           (2) |     3.57 |   0.00 |  0.0 |       0.00 |   558.00 |   0.01 |  0.2 |      61.43 |  1471.06 |
gpuarrays/reductions/sum prod                 (3) |   107.85 |   0.03 |  0.0 |       3.24 |   630.00 |   3.34 |  3.1 |   11213.62 |  3911.06 |
gpuarrays/reductions/reduce                   (3) |    63.57 |   0.02 |  0.0 |       1.53 |   634.00 |   1.69 |  2.7 |    9181.83 |  4991.06 |
gpuarrays/reductions/mapreducedim!            (3) |    41.90 |   0.01 |  0.0 |       1.54 |   636.00 |   0.79 |  1.9 |    4307.07 |  5675.06 |
gpuarrays/broadcasting                        (3) |   102.40 |   0.02 |  0.0 |       2.00 |   642.00 |   1.69 |  1.7 |   10034.98 |  8051.06 |
gpuarrays/reductions/== isequal               (3) |    36.79 |   0.01 |  0.0 |       1.07 |   646.00 |   0.94 |  2.5 |    5580.35 |  8663.06 |
gpuarrays/base                                (3) |    16.83 |   0.00 |  0.0 |       8.90 |   646.00 |   0.60 |  3.6 |    2604.37 |  9059.06 |
gpuarrays/random                              (3) |     9.21 |   0.02 |  0.2 |     392.05 |   762.00 |   0.15 |  1.6 |    1508.81 |  9491.06 |
gpuarrays/vectors                             (3) |     0.20 |   0.00 |  0.2 |       0.00 |   648.00 |   0.00 |  0.0 |      18.07 |  9491.06 |
gpuarrays/ext/jld2                            (3) |     5.40 |   0.00 |  0.0 |       0.00 |   648.00 |   0.04 |  0.7 |     325.16 |  9599.06 |
gpuarrays/constructors                        (3) |    14.41 |   0.01 |  0.0 |       0.65 |   648.00 |   0.19 |  1.3 |    1166.54 |  9707.06 |
gpuarrays/reductions/mapreduce                (3) |    19.07 |   0.01 |  0.1 |       1.83 |   652.00 |   0.32 |  1.7 |    2205.61 |  9923.06 |
gpuarrays/statistics                          (3) |    37.26 |   0.01 |  0.0 |       1.51 |   718.00 |   0.64 |  1.7 |    3696.56 | 11039.06 |
gpuarrays/linalg/norm                         (3) |    82.14 |   0.02 |  0.0 |       0.02 |   722.00 |   1.14 |  1.4 |    7597.94 | 14243.06 |
gpuarrays/linalg/NaN_false                    (3) |     9.76 |   0.00 |  0.0 |       0.00 |   724.00 |   0.09 |  0.9 |     800.22 | 14711.06 |
gpuarrays/math/intrinsics                     (3) |     1.12 |   0.00 |  0.0 |       0.00 |   724.00 |   0.00 |  0.0 |      91.09 | 14711.06 |
gpuarrays/linalg/mul!/matrix-matrix           (3) |    55.27 |   0.02 |  0.0 |       0.13 |   726.00 |   0.94 |  1.7 |    5627.97 | 15431.06 |
gpuarrays/sparse                              (3) |     0.00 |   0.00 |  0.0 |       0.00 |   726.00 |   0.00 |  0.0 |       0.15 | 15431.06 |
gpuarrays/reductions/mapreducedim!_large      (3) |     5.94 |   0.02 |  0.3 |     818.38 |   766.00 |   0.10 |  1.7 |    1985.02 | 16264.88 |
      From worker 3:	JIT session error: Cannot allocate memory
      From worker 3:	JIT session error: Cannot allocate memory
      From worker 3:	
      From worker 3:	[2030669] signal (11.1): Segmentation fault
      From worker 3:	in expression starting at none:1
gpuarrays/uniformscaling                      (3) |         failed at 2025-12-05T10:18:27.336
Worker 3 terminated.
Unhandled Task ERROR: EOFError: read end of file
Stacktrace:
 [1] (::Base.var"#wait_locked#741")(s::Sockets.TCPSocket, buf::IOBuffer, nb::Int64)
   @ Base ./stream.jl:947
 [2] unsafe_read(s::Sockets.TCPSocket, p::Ptr{UInt8}, nb::UInt64)
   @ Base ./stream.jl:955
 [3] unsafe_read
   @ ./io.jl:773 [inlined]
 [4] unsafe_read(s::Sockets.TCPSocket, p::Base.RefValue{NTuple{4, Int64}}, n::Int64)
   @ Base ./io.jl:772
 [5] read!
   @ ./io.jl:774 [inlined]
 [6] deserialize_hdr_raw
   @ /cluster/projects/nn9874k/aklocker/juliaup/depot/juliaup/julia-1.10.10+0.aarch64.linux.gnu/share/julia/stdlib/v1.10/Distributed/src/messages.jl:167 [inlined]
 [7] message_handler_loop(r_stream::Sockets.TCPSocket, w_stream::Sockets.TCPSocket, incoming::Bool)
   @ Distributed /cluster/projects/nn9874k/aklocker/juliaup/depot/juliaup/julia-1.10.10+0.aarch64.linux.gnu/share/julia/stdlib/v1.10/Distributed/src/process_messages.jl:172
 [8] process_tcp_streams(r_stream::Sockets.TCPSocket, w_stream::Sockets.TCPSocket, incoming::Bool)
   @ Distributed /cluster/projects/nn9874k/aklocker/juliaup/depot/juliaup/julia-1.10.10+0.aarch64.linux.gnu/share/julia/stdlib/v1.10/Distributed/src/process_messages.jl:133
 [9] (::Distributed.var"#103#104"{Sockets.TCPSocket, Sockets.TCPSocket, Bool})()
   @ Distributed /cluster/projects/nn9874k/aklocker/juliaup/depot/juliaup/julia-1.10.10+0.aarch64.linux.gnu/share/julia/stdlib/v1.10/Distributed/src/process_messages.jl:121

Here’s the system info:

Info: System information:
│ CUDA toolchain: 
│ - runtime 12.6, local installation
│ - driver 565.57.1 for 13.0
│ - compiler 12.9
│ 
│ CUDA libraries: 
│ - CUBLAS: 12.6.3
│ - CURAND: 10.3.7
│ - CUFFT: 11.3.0
│ - CUSOLVER: 11.7.1
│ - CUSPARSE: 12.5.4
│ - CUPTI: 2024.3.2 (API 12.6.0)
│ - NVML: 12.0.0+565.57.1
│ 
│ Julia packages: 
│ - CUDA: 5.9.5
│ - CUDA_Driver_jll: 13.0.2+0
│ - CUDA_Compiler_jll: 0.3.0+0
│ - CUDA_Runtime_jll: 0.19.2+0
│ - CUDA_Runtime_Discovery: 1.0.0
│ 
│ Toolchain:
│ - Julia: 1.10.10
│ - LLVM: 15.0.7
│ 
│ Environment:
│ - JULIA_CUDA_USE_BINARY_BUILDER: false
│ - JULIA_CUDA_MEMORY_POOL: none
│ 
│ Preferences:
│ - CUDA_Runtime_jll.version: 12.6
│ - CUDA_Runtime_jll.local: true

I tried to install CUDA with a local toolkit and without, but all gives the same error. Can anyone point me in the right direction of what goes wrong here, and how I best address this? Thanks in advance!

What is your Julia versioninfo(verbose=true)?

julia> versioninfo(verbose=true)
Julia Version 1.10.10
Commit 95f30e51f41 (2025-06-27 09:51 UTC)
Build Info:
  Official https://julialang.org/ release
Platform Info:
  OS: Linux (aarch64-linux-gnu)
      "SUSE Linux Enterprise Server 15 SP6"
  uname: Linux 6.4.0-150600.23.25_15.0.9-cray_shasta_c_64k #1 SMP Mon Jan 13 18:26:04 UTC 2025 (7f98b6b) aarch64 aarch64
  CPU: unknown: 
                  speed         user         nice          sys         idle          irq
       #1-288  3960 MHz    7409551 s        424 s    2031153 s  1918107590 s          0 s
  Memory: 858.0562744140625 GB (784242.6875 MB free)
  Uptime: 669691.03 sec
  Load Avg:  1.04  1.04  1.83
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-15.0.7 (ORCJIT, generic)
Threads: 1 default, 0 interactive, 1 GC (on 288 virtual cores)
Environment:
  LD_LIBRARY_PATH = /opt/nvidia/hpc_sdk/Linux_aarch64/24.11/comm_libs/12.6/nccl/lib:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/comm_libs/12.6/nvshmem/lib:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/math_libs/12.6/lib64:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/extras/CUPTI/lib64:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/extras/Debugger/lib64:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/nvvm/lib64:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/lib64:/opt/cray/pe/papi/7.2.0.1/lib64:/opt/cray/libfabric/1.22.0/lib64
  JULIA_CUDA_MEMORY_POOL = none
  JULIA_DEPOT_PATH = /cluster/projects/nn9874k/aklocker/juliaup/depot
  JULIA_LOAD_PATH = :/cluster/projects/nn9874k/aklocker/juhpc_setup/julia_preferences
  JULIA_CUDA_USE_BINARY_BUILDER = false
  __LMOD_REF_COUNT_INCLUDE_PATH_AARCH64 = /opt/cray/pe/cce/19.0.0/cce-clang/aarch64/lib/clang/19/include:1;/opt/cray/pe/cce/19.0.0/cce/aarch64/include/craylibs:1
  __LMOD_REF_COUNT_PE_CRAYCLANG_FIXED_PKGCONFIG_PATH = /opt/cray/pe/mpich/8.1.32/ofi/crayclang/17.0/lib/pkgconfig:1
  __LMOD_REF_COUNT_PATH = /opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/bin:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/libnvvp:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/profilers/Nsight_Compute:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/profilers/Nsight_Systems/bin:1;/opt/cray/pe/mpich/8.1.32/ofi/crayclang/17.0/bin:1;/opt/cray/pe/mpich/8.1.32/bin:1;/opt/cray/pe/craype/2.7.34/bin:1;/opt/cray/pe/cce/19.0.0/binutils/aarch64/aarch64-unknown-linux-gnu/bin:1;/opt/cray/pe/cce/19.0.0/utils/aarch64/bin:1;/opt/cray/pe/cce/19.0.0/bin:1;/opt/cray/pe/perftools/25.03.0/bin:1;/opt/cray/pe/papi/7.2.0.1/bin:1;/opt/cray/libfabric/1.22.0/bin:1;/cluster/projects/nn9874k/aklocker/juhpc_setup/juliaup_wrapper:1;/cluster/projects/nn9874k/aklocker/juliaup/bin:1;/cluster/home/aklocker/.juliaup/bin:1;/opt/clmgr/sbin:1;/opt/clmgr/bin:1;/opt/sgi/sbin:1;/opt/sgi/bin:1;/usr/local/bin:1;/usr/bin:1;/bin:1;/opt/c3/bin:1;/usr/lib/mit/bin:1;/cluster/bin:1;/opt/cray/pe/bin:1
  CRAY_LD_LIBRARY_PATH = /opt/cray/pe/libsci/25.03.0/CRAYCLANG/17.0/aarch64/lib:/opt/cray/pe/mpich/8.1.32/ofi/crayclang/17.0/lib:/opt/cray/pe/mpich/8.1.32/gtl/lib:/opt/cray/pe/dsmml/0.3.1/dsmml/lib:/opt/cray/pe/cce/19.0.0/cce-clang/aarch64/lib:/opt/cray/pe/cce/19.0.0/cce/aarch64/lib:/opt/cray/pe/perftools/25.03.0/lib64
  CRAYPAT_LD_LIBRARY_PATH = /opt/cray/pe/perftools/25.03.0/lib64
  FPATH = /opt/cray/pe/lmod/lmod/init/ksh_funcs
  __LMOD_REF_COUNT_NLSPATH = /opt/cray/pe/cce/19.0.0/cce/aarch64/share/nls/En/%N.cat:1
  JAVA_HOME = /usr/lib64/jvm/java-11-openjdk-11
  __LMOD_REF_COUNT_LD_LIBRARY_PATH = /opt/nvidia/hpc_sdk/Linux_aarch64/24.11/comm_libs/12.6/nccl/lib:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/comm_libs/12.6/nvshmem/lib:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/math_libs/12.6/lib64:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/extras/CUPTI/lib64:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/extras/Debugger/lib64:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/nvvm/lib64:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/lib64:1;/opt/cray/pe/papi/7.2.0.1/lib64:1;/opt/cray/libfabric/1.22.0/lib64:1
  __LMOD_REF_COUNT_PKG_CONFIG_PATH = /usr/lib64/pkgconfig:1;/opt/cray/pe/dsmml/0.3.1/dsmml/lib/pkgconfig:1;/opt/cray/pe/craype/2.7.34/pkg-config:1;/opt/cray/libfabric/1.22.0/lib64/pkgconfig:1
  JUHPC_HDF5_HOME = 
  __LMOD_REF_COUNT_MODULEPATH = /opt/cray/pe/lmod/modulefiles/mpi/crayclang/17.0/ofi/1.0/cray-mpich/8.0:1;/opt/cray/pe/lmod/modulefiles/comnet/crayclang/17.0/ofi/1.0:1;/opt/cray/pe/lmod/modulefiles/compiler/crayclang/17.0:1;/opt/cray/pe/lmod/modulefiles/mix_compilers:1;/opt/cray/pe/lmod/modulefiles/perftools/25.03.0:1;/opt/cray/pe/lmod/modulefiles/net/ofi/1.0:1;/opt/cray/pe/lmod/modulefiles/cpu/arm-grace/1.0:1;/opt/cray/pe/modulefiles/Linux:1;/opt/cray/pe/lmod/modulefiles/craype-targets/default:1;/opt/cray/pe/lmod/modulefiles/core:1;/opt/cray/pe/lmod/lmod/modulefiles/Core:1;/opt/cray/pe/modulefiles/Core:1;/opt/cray/modulefiles:1;/cluster/software/modules/Core:1
  __LMOD_REF_COUNT_CRAY_LD_LIBRARY_PATH = /opt/cray/pe/libsci/25.03.0/CRAYCLANG/17.0/aarch64/lib:1;/opt/cray/pe/mpich/8.1.32/ofi/crayclang/17.0/lib:1;/opt/cray/pe/mpich/8.1.32/gtl/lib:1;/opt/cray/pe/dsmml/0.3.1/dsmml/lib:1;/opt/cray/pe/cce/19.0.0/cce-clang/aarch64/lib:1;/opt/cray/pe/cce/19.0.0/cce/aarch64/lib:1;/opt/cray/pe/perftools/25.03.0/lib64:1
  LLVM_SYMBOLIZER_PATH = /opt/cray/pe/cce/19.0.0/cce-clang/aarch64/bin/llvm-symbolizer
  HOME = /cluster/home/aklocker
  CUDA_HOME = /opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6
  XNLSPATH = /usr/X11R6/lib/X11/nls
  CPATH = /opt/nvidia/hpc_sdk/Linux_aarch64/24.11/comm_libs/12.6/nvshmem/include:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/comm_libs/12.6/nccl/include:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/include:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/nvvm/include:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/extras/Debugger/include:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/extras/CUPTI/include:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/math_libs/12.6/include
  COMPILERRT_PATH_AARCH64 = /opt/cray/pe/cce/19.0.0/cce-clang/aarch64/lib/clang/19/lib/linux
  SDK_HOME = /usr/lib64/jvm/java-11-openjdk-11
  NVHPC_CUDA_HOME = /opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6
  NLSPATH = /opt/cray/pe/cce/19.0.0/cce/aarch64/share/nls/En/%N.cat
  PE_LIBSCI_VOLATILE_PKGCONFIG_PATH = /opt/cray/pe/libsci/25.03.0/@PRGENV@/@PE_LIBSCI_GENCOMPS@/@PE_LIBSCI_TARGET@/lib/pkgconfig
  JDK_HOME = /usr/lib64/jvm/java-11-openjdk-11
  INCLUDE_PATH_AARCH64 = /opt/cray/pe/cce/19.0.0/cce-clang/aarch64/lib/clang/19/include:/opt/cray/pe/cce/19.0.0/cce/aarch64/include/craylibs
  __LMOD_REF_COUNT_MANPATH = /opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/doc/man:1;/opt/cray/pe/libsci/25.03.0/share/man:1;/opt/cray/pe/mpich/8.1.32/ofi/man:1;/opt/cray/pe/mpich/8.1.32/man/mpich:1;/opt/cray/pe/dsmml/0.3.1/dsmml/man:1;/opt/cray/pe/craype/2.7.34/man:1;/opt/cray/pe/cce/19.0.0/cce-clang/aarch64/share/man:1;/opt/cray/pe/cce/19.0.0/man:1;/opt/cray/pe/perftools/25.03.0/man:1;/opt/cray/pe/papi/7.2.0.1/share/pdoc/man:1;/opt/cray/libfabric/1.22.0/share/man:1;/opt/cray/pe/lmod/lmod/share/man:1;/usr/local/man:1;/usr/share/man:1;/usr/man:1;/opt/c3/man:1;/opt/clmgr/man:1;/opt/sgi/share/man:1;/opt/clmgr/share/man:1;/opt/clmgr/lib/cm-cli/man:1
  PE_CRAYCLANG_FIXED_PKGCONFIG_PATH = /opt/cray/pe/mpich/8.1.32/ofi/crayclang/17.0/lib/pkgconfig
  JULIAUP_DEPOT_PATH = /cluster/projects/nn9874k/aklocker/juliaup/depot
  TERM = xterm-256color
  __LMOD_REF_COUNT_CPATH = /opt/nvidia/hpc_sdk/Linux_aarch64/24.11/comm_libs/12.6/nvshmem/include:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/comm_libs/12.6/nccl/include:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/include:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/nvvm/include:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/extras/Debugger/include:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/extras/CUPTI/include:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/math_libs/12.6/include:1
  CUDATOOLKIT_HOME = /opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6
  JUHPC_CUDA_HOME = /opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6
  MANPATH = /opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/doc/man:/opt/cray/pe/libsci/25.03.0/share/man:/opt/cray/pe/mpich/8.1.32/ofi/man:/opt/cray/pe/mpich/8.1.32/man/mpich:/opt/cray/pe/dsmml/0.3.1/dsmml/man:/opt/cray/pe/craype/2.7.34/man:/opt/cray/pe/cce/19.0.0/cce-clang/aarch64/share/man:/opt/cray/pe/cce/19.0.0/man:/opt/cray/pe/perftools/25.03.0/man:/opt/cray/pe/papi/7.2.0.1/share/pdoc/man:/opt/cray/libfabric/1.22.0/share/man:/opt/cray/pe/lmod/lmod/share/man:/usr/local/man:/usr/share/man:/usr/man:/opt/c3/man:/opt/clmgr/man:/opt/sgi/share/man:/opt/clmgr/share/man:/opt/clmgr/lib/cm-cli/man
  OSCAR_HOME = /opt/oscar
  MODULEPATH = /opt/cray/pe/lmod/modulefiles/mpi/crayclang/17.0/ofi/1.0/cray-mpich/8.0:/opt/cray/pe/lmod/modulefiles/comnet/crayclang/17.0/ofi/1.0:/opt/cray/pe/lmod/modulefiles/compiler/crayclang/17.0:/opt/cray/pe/lmod/modulefiles/mix_compilers:/opt/cray/pe/lmod/modulefiles/perftools/25.03.0:/opt/cray/pe/lmod/modulefiles/net/ofi/1.0:/opt/cray/pe/lmod/modulefiles/cpu/arm-grace/1.0:/opt/cray/pe/modulefiles/Linux:/opt/cray/pe/lmod/modulefiles/craype-targets/default:/opt/cray/pe/lmod/modulefiles/core:/opt/cray/pe/lmod/lmod/modulefiles/Core:/opt/cray/pe/modulefiles/Core:/opt/cray/modulefiles:/cluster/software/modules/Core
  MODULEPATH_ROOT = /opt/cray/pe/modulefiles
  LMOD_PACKAGE_PATH = /cluster/software/config/lmod/SitePackage.lua
  JRE_HOME = /usr/lib64/jvm/java-11-openjdk-11
  PATH = /cluster/projects/nn9874k/aklocker/juhpc_setup/juliaup_wrapper:/cluster/projects/nn9874k/aklocker/juliaup/bin:/cluster/projects/nn9874k/aklocker/juhpc_setup/juliaup_wrapper:/cluster/projects/nn9874k/aklocker/juliaup/bin:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/bin:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/libnvvp:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/profilers/Nsight_Compute:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/profilers/Nsight_Systems/bin:/opt/cray/pe/mpich/8.1.32/ofi/crayclang/17.0/bin:/opt/cray/pe/mpich/8.1.32/bin:/opt/cray/pe/craype/2.7.34/bin:/opt/cray/pe/cce/19.0.0/binutils/aarch64/aarch64-unknown-linux-gnu/bin:/opt/cray/pe/cce/19.0.0/utils/aarch64/bin:/opt/cray/pe/cce/19.0.0/bin:/opt/cray/pe/perftools/25.03.0/bin:/opt/cray/pe/papi/7.2.0.1/bin:/opt/cray/libfabric/1.22.0/bin:/cluster/projects/nn9874k/aklocker/juhpc_setup/juliaup_wrapper:/cluster/projects/nn9874k/aklocker/juliaup/bin:/cluster/home/aklocker/.juliaup/bin:/opt/clmgr/sbin:/opt/clmgr/bin:/opt/sgi/sbin:/opt/sgi/bin:/usr/local/bin:/usr/bin:/bin:/opt/c3/bin:/usr/lib/mit/bin:/cluster/bin:/opt/cray/pe/bin
  MODULESHOME = /opt/cray/pe/lmod/lmod
  PKG_CONFIG_PATH = /usr/lib64/pkgconfig:/opt/cray/pe/dsmml/0.3.1/dsmml/lib/pkgconfig:/opt/cray/pe/craype/2.7.34/pkg-config:/opt/cray/libfabric/1.22.0/lib64/pkgconfig

Thanks! That looks like ARM (Linux) · The Julia Language

So not an issue with CUDA.jl, but rather with Julia on that platform :confused:

We could restart the test worker when we detect this issue.

1 Like

That does indeed look like my problem! I guess I can ask our HPC people to increase this limit for memory mapping.
And since I’m new to Julia I’m not sure what restarting a test worker would do..
My ultimate goal is to get cuda-aware MPI working so I thought I somehow need to fix this first…