Hi,
I’m new to Julia and GPU computing. I installed CUDA and ran ]test CUDA to check it is working. When I do this I get the following errors:
Info: Testing using device 0 (NVIDIA GH200 120GB). To change this, specify the `--gpu` argument to the tests, or set the `CUDA_VISIBLE_DEVICES` environment variable.
[ Info: Running 47 tests in parallel. If this is too many, specify the `--jobs` argument to the tests, or set the `JULIA_CPU_THREADS` environment variable.
┌ Warning: Running tests on a GPU in exclusive mode; reducing parallelism to 1.
└ @ Main /cluster/projects/nn9874k/aklocker/juliaup/depot/packages/CUDA/x8d2s/test/runtests.jl:181
| | ---------------- GPU ---------------- | ---------------- CPU ---------------- |
Test (Worker) | Time (s) | GC (s) | GC % | Alloc (MB) | RSS (MB) | GC (s) | GC % | Alloc (MB) | RSS (MB) |
core/initialization (2) | 3.57 | 0.00 | 0.0 | 0.00 | 558.00 | 0.01 | 0.2 | 61.43 | 1471.06 |
gpuarrays/reductions/sum prod (3) | 107.85 | 0.03 | 0.0 | 3.24 | 630.00 | 3.34 | 3.1 | 11213.62 | 3911.06 |
gpuarrays/reductions/reduce (3) | 63.57 | 0.02 | 0.0 | 1.53 | 634.00 | 1.69 | 2.7 | 9181.83 | 4991.06 |
gpuarrays/reductions/mapreducedim! (3) | 41.90 | 0.01 | 0.0 | 1.54 | 636.00 | 0.79 | 1.9 | 4307.07 | 5675.06 |
gpuarrays/broadcasting (3) | 102.40 | 0.02 | 0.0 | 2.00 | 642.00 | 1.69 | 1.7 | 10034.98 | 8051.06 |
gpuarrays/reductions/== isequal (3) | 36.79 | 0.01 | 0.0 | 1.07 | 646.00 | 0.94 | 2.5 | 5580.35 | 8663.06 |
gpuarrays/base (3) | 16.83 | 0.00 | 0.0 | 8.90 | 646.00 | 0.60 | 3.6 | 2604.37 | 9059.06 |
gpuarrays/random (3) | 9.21 | 0.02 | 0.2 | 392.05 | 762.00 | 0.15 | 1.6 | 1508.81 | 9491.06 |
gpuarrays/vectors (3) | 0.20 | 0.00 | 0.2 | 0.00 | 648.00 | 0.00 | 0.0 | 18.07 | 9491.06 |
gpuarrays/ext/jld2 (3) | 5.40 | 0.00 | 0.0 | 0.00 | 648.00 | 0.04 | 0.7 | 325.16 | 9599.06 |
gpuarrays/constructors (3) | 14.41 | 0.01 | 0.0 | 0.65 | 648.00 | 0.19 | 1.3 | 1166.54 | 9707.06 |
gpuarrays/reductions/mapreduce (3) | 19.07 | 0.01 | 0.1 | 1.83 | 652.00 | 0.32 | 1.7 | 2205.61 | 9923.06 |
gpuarrays/statistics (3) | 37.26 | 0.01 | 0.0 | 1.51 | 718.00 | 0.64 | 1.7 | 3696.56 | 11039.06 |
gpuarrays/linalg/norm (3) | 82.14 | 0.02 | 0.0 | 0.02 | 722.00 | 1.14 | 1.4 | 7597.94 | 14243.06 |
gpuarrays/linalg/NaN_false (3) | 9.76 | 0.00 | 0.0 | 0.00 | 724.00 | 0.09 | 0.9 | 800.22 | 14711.06 |
gpuarrays/math/intrinsics (3) | 1.12 | 0.00 | 0.0 | 0.00 | 724.00 | 0.00 | 0.0 | 91.09 | 14711.06 |
gpuarrays/linalg/mul!/matrix-matrix (3) | 55.27 | 0.02 | 0.0 | 0.13 | 726.00 | 0.94 | 1.7 | 5627.97 | 15431.06 |
gpuarrays/sparse (3) | 0.00 | 0.00 | 0.0 | 0.00 | 726.00 | 0.00 | 0.0 | 0.15 | 15431.06 |
gpuarrays/reductions/mapreducedim!_large (3) | 5.94 | 0.02 | 0.3 | 818.38 | 766.00 | 0.10 | 1.7 | 1985.02 | 16264.88 |
From worker 3: JIT session error: Cannot allocate memory
From worker 3: JIT session error: Cannot allocate memory
From worker 3:
From worker 3: [2030669] signal (11.1): Segmentation fault
From worker 3: in expression starting at none:1
gpuarrays/uniformscaling (3) | failed at 2025-12-05T10:18:27.336
Worker 3 terminated.
Unhandled Task ERROR: EOFError: read end of file
Stacktrace:
[1] (::Base.var"#wait_locked#741")(s::Sockets.TCPSocket, buf::IOBuffer, nb::Int64)
@ Base ./stream.jl:947
[2] unsafe_read(s::Sockets.TCPSocket, p::Ptr{UInt8}, nb::UInt64)
@ Base ./stream.jl:955
[3] unsafe_read
@ ./io.jl:773 [inlined]
[4] unsafe_read(s::Sockets.TCPSocket, p::Base.RefValue{NTuple{4, Int64}}, n::Int64)
@ Base ./io.jl:772
[5] read!
@ ./io.jl:774 [inlined]
[6] deserialize_hdr_raw
@ /cluster/projects/nn9874k/aklocker/juliaup/depot/juliaup/julia-1.10.10+0.aarch64.linux.gnu/share/julia/stdlib/v1.10/Distributed/src/messages.jl:167 [inlined]
[7] message_handler_loop(r_stream::Sockets.TCPSocket, w_stream::Sockets.TCPSocket, incoming::Bool)
@ Distributed /cluster/projects/nn9874k/aklocker/juliaup/depot/juliaup/julia-1.10.10+0.aarch64.linux.gnu/share/julia/stdlib/v1.10/Distributed/src/process_messages.jl:172
[8] process_tcp_streams(r_stream::Sockets.TCPSocket, w_stream::Sockets.TCPSocket, incoming::Bool)
@ Distributed /cluster/projects/nn9874k/aklocker/juliaup/depot/juliaup/julia-1.10.10+0.aarch64.linux.gnu/share/julia/stdlib/v1.10/Distributed/src/process_messages.jl:133
[9] (::Distributed.var"#103#104"{Sockets.TCPSocket, Sockets.TCPSocket, Bool})()
@ Distributed /cluster/projects/nn9874k/aklocker/juliaup/depot/juliaup/julia-1.10.10+0.aarch64.linux.gnu/share/julia/stdlib/v1.10/Distributed/src/process_messages.jl:121
Here’s the system info:
Info: System information:
│ CUDA toolchain:
│ - runtime 12.6, local installation
│ - driver 565.57.1 for 13.0
│ - compiler 12.9
│
│ CUDA libraries:
│ - CUBLAS: 12.6.3
│ - CURAND: 10.3.7
│ - CUFFT: 11.3.0
│ - CUSOLVER: 11.7.1
│ - CUSPARSE: 12.5.4
│ - CUPTI: 2024.3.2 (API 12.6.0)
│ - NVML: 12.0.0+565.57.1
│
│ Julia packages:
│ - CUDA: 5.9.5
│ - CUDA_Driver_jll: 13.0.2+0
│ - CUDA_Compiler_jll: 0.3.0+0
│ - CUDA_Runtime_jll: 0.19.2+0
│ - CUDA_Runtime_Discovery: 1.0.0
│
│ Toolchain:
│ - Julia: 1.10.10
│ - LLVM: 15.0.7
│
│ Environment:
│ - JULIA_CUDA_USE_BINARY_BUILDER: false
│ - JULIA_CUDA_MEMORY_POOL: none
│
│ Preferences:
│ - CUDA_Runtime_jll.version: 12.6
│ - CUDA_Runtime_jll.local: true
I tried to install CUDA with a local toolkit and without, but all gives the same error. Can anyone point me in the right direction of what goes wrong here, and how I best address this? Thanks in advance!
What is your Julia versioninfo(verbose=true)?
julia> versioninfo(verbose=true)
Julia Version 1.10.10
Commit 95f30e51f41 (2025-06-27 09:51 UTC)
Build Info:
Official https://julialang.org/ release
Platform Info:
OS: Linux (aarch64-linux-gnu)
"SUSE Linux Enterprise Server 15 SP6"
uname: Linux 6.4.0-150600.23.25_15.0.9-cray_shasta_c_64k #1 SMP Mon Jan 13 18:26:04 UTC 2025 (7f98b6b) aarch64 aarch64
CPU: unknown:
speed user nice sys idle irq
#1-288 3960 MHz 7409551 s 424 s 2031153 s 1918107590 s 0 s
Memory: 858.0562744140625 GB (784242.6875 MB free)
Uptime: 669691.03 sec
Load Avg: 1.04 1.04 1.83
WORD_SIZE: 64
LIBM: libopenlibm
LLVM: libLLVM-15.0.7 (ORCJIT, generic)
Threads: 1 default, 0 interactive, 1 GC (on 288 virtual cores)
Environment:
LD_LIBRARY_PATH = /opt/nvidia/hpc_sdk/Linux_aarch64/24.11/comm_libs/12.6/nccl/lib:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/comm_libs/12.6/nvshmem/lib:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/math_libs/12.6/lib64:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/extras/CUPTI/lib64:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/extras/Debugger/lib64:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/nvvm/lib64:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/lib64:/opt/cray/pe/papi/7.2.0.1/lib64:/opt/cray/libfabric/1.22.0/lib64
JULIA_CUDA_MEMORY_POOL = none
JULIA_DEPOT_PATH = /cluster/projects/nn9874k/aklocker/juliaup/depot
JULIA_LOAD_PATH = :/cluster/projects/nn9874k/aklocker/juhpc_setup/julia_preferences
JULIA_CUDA_USE_BINARY_BUILDER = false
__LMOD_REF_COUNT_INCLUDE_PATH_AARCH64 = /opt/cray/pe/cce/19.0.0/cce-clang/aarch64/lib/clang/19/include:1;/opt/cray/pe/cce/19.0.0/cce/aarch64/include/craylibs:1
__LMOD_REF_COUNT_PE_CRAYCLANG_FIXED_PKGCONFIG_PATH = /opt/cray/pe/mpich/8.1.32/ofi/crayclang/17.0/lib/pkgconfig:1
__LMOD_REF_COUNT_PATH = /opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/bin:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/libnvvp:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/profilers/Nsight_Compute:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/profilers/Nsight_Systems/bin:1;/opt/cray/pe/mpich/8.1.32/ofi/crayclang/17.0/bin:1;/opt/cray/pe/mpich/8.1.32/bin:1;/opt/cray/pe/craype/2.7.34/bin:1;/opt/cray/pe/cce/19.0.0/binutils/aarch64/aarch64-unknown-linux-gnu/bin:1;/opt/cray/pe/cce/19.0.0/utils/aarch64/bin:1;/opt/cray/pe/cce/19.0.0/bin:1;/opt/cray/pe/perftools/25.03.0/bin:1;/opt/cray/pe/papi/7.2.0.1/bin:1;/opt/cray/libfabric/1.22.0/bin:1;/cluster/projects/nn9874k/aklocker/juhpc_setup/juliaup_wrapper:1;/cluster/projects/nn9874k/aklocker/juliaup/bin:1;/cluster/home/aklocker/.juliaup/bin:1;/opt/clmgr/sbin:1;/opt/clmgr/bin:1;/opt/sgi/sbin:1;/opt/sgi/bin:1;/usr/local/bin:1;/usr/bin:1;/bin:1;/opt/c3/bin:1;/usr/lib/mit/bin:1;/cluster/bin:1;/opt/cray/pe/bin:1
CRAY_LD_LIBRARY_PATH = /opt/cray/pe/libsci/25.03.0/CRAYCLANG/17.0/aarch64/lib:/opt/cray/pe/mpich/8.1.32/ofi/crayclang/17.0/lib:/opt/cray/pe/mpich/8.1.32/gtl/lib:/opt/cray/pe/dsmml/0.3.1/dsmml/lib:/opt/cray/pe/cce/19.0.0/cce-clang/aarch64/lib:/opt/cray/pe/cce/19.0.0/cce/aarch64/lib:/opt/cray/pe/perftools/25.03.0/lib64
CRAYPAT_LD_LIBRARY_PATH = /opt/cray/pe/perftools/25.03.0/lib64
FPATH = /opt/cray/pe/lmod/lmod/init/ksh_funcs
__LMOD_REF_COUNT_NLSPATH = /opt/cray/pe/cce/19.0.0/cce/aarch64/share/nls/En/%N.cat:1
JAVA_HOME = /usr/lib64/jvm/java-11-openjdk-11
__LMOD_REF_COUNT_LD_LIBRARY_PATH = /opt/nvidia/hpc_sdk/Linux_aarch64/24.11/comm_libs/12.6/nccl/lib:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/comm_libs/12.6/nvshmem/lib:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/math_libs/12.6/lib64:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/extras/CUPTI/lib64:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/extras/Debugger/lib64:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/nvvm/lib64:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/lib64:1;/opt/cray/pe/papi/7.2.0.1/lib64:1;/opt/cray/libfabric/1.22.0/lib64:1
__LMOD_REF_COUNT_PKG_CONFIG_PATH = /usr/lib64/pkgconfig:1;/opt/cray/pe/dsmml/0.3.1/dsmml/lib/pkgconfig:1;/opt/cray/pe/craype/2.7.34/pkg-config:1;/opt/cray/libfabric/1.22.0/lib64/pkgconfig:1
JUHPC_HDF5_HOME =
__LMOD_REF_COUNT_MODULEPATH = /opt/cray/pe/lmod/modulefiles/mpi/crayclang/17.0/ofi/1.0/cray-mpich/8.0:1;/opt/cray/pe/lmod/modulefiles/comnet/crayclang/17.0/ofi/1.0:1;/opt/cray/pe/lmod/modulefiles/compiler/crayclang/17.0:1;/opt/cray/pe/lmod/modulefiles/mix_compilers:1;/opt/cray/pe/lmod/modulefiles/perftools/25.03.0:1;/opt/cray/pe/lmod/modulefiles/net/ofi/1.0:1;/opt/cray/pe/lmod/modulefiles/cpu/arm-grace/1.0:1;/opt/cray/pe/modulefiles/Linux:1;/opt/cray/pe/lmod/modulefiles/craype-targets/default:1;/opt/cray/pe/lmod/modulefiles/core:1;/opt/cray/pe/lmod/lmod/modulefiles/Core:1;/opt/cray/pe/modulefiles/Core:1;/opt/cray/modulefiles:1;/cluster/software/modules/Core:1
__LMOD_REF_COUNT_CRAY_LD_LIBRARY_PATH = /opt/cray/pe/libsci/25.03.0/CRAYCLANG/17.0/aarch64/lib:1;/opt/cray/pe/mpich/8.1.32/ofi/crayclang/17.0/lib:1;/opt/cray/pe/mpich/8.1.32/gtl/lib:1;/opt/cray/pe/dsmml/0.3.1/dsmml/lib:1;/opt/cray/pe/cce/19.0.0/cce-clang/aarch64/lib:1;/opt/cray/pe/cce/19.0.0/cce/aarch64/lib:1;/opt/cray/pe/perftools/25.03.0/lib64:1
LLVM_SYMBOLIZER_PATH = /opt/cray/pe/cce/19.0.0/cce-clang/aarch64/bin/llvm-symbolizer
HOME = /cluster/home/aklocker
CUDA_HOME = /opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6
XNLSPATH = /usr/X11R6/lib/X11/nls
CPATH = /opt/nvidia/hpc_sdk/Linux_aarch64/24.11/comm_libs/12.6/nvshmem/include:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/comm_libs/12.6/nccl/include:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/include:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/nvvm/include:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/extras/Debugger/include:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/extras/CUPTI/include:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/math_libs/12.6/include
COMPILERRT_PATH_AARCH64 = /opt/cray/pe/cce/19.0.0/cce-clang/aarch64/lib/clang/19/lib/linux
SDK_HOME = /usr/lib64/jvm/java-11-openjdk-11
NVHPC_CUDA_HOME = /opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6
NLSPATH = /opt/cray/pe/cce/19.0.0/cce/aarch64/share/nls/En/%N.cat
PE_LIBSCI_VOLATILE_PKGCONFIG_PATH = /opt/cray/pe/libsci/25.03.0/@PRGENV@/@PE_LIBSCI_GENCOMPS@/@PE_LIBSCI_TARGET@/lib/pkgconfig
JDK_HOME = /usr/lib64/jvm/java-11-openjdk-11
INCLUDE_PATH_AARCH64 = /opt/cray/pe/cce/19.0.0/cce-clang/aarch64/lib/clang/19/include:/opt/cray/pe/cce/19.0.0/cce/aarch64/include/craylibs
__LMOD_REF_COUNT_MANPATH = /opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/doc/man:1;/opt/cray/pe/libsci/25.03.0/share/man:1;/opt/cray/pe/mpich/8.1.32/ofi/man:1;/opt/cray/pe/mpich/8.1.32/man/mpich:1;/opt/cray/pe/dsmml/0.3.1/dsmml/man:1;/opt/cray/pe/craype/2.7.34/man:1;/opt/cray/pe/cce/19.0.0/cce-clang/aarch64/share/man:1;/opt/cray/pe/cce/19.0.0/man:1;/opt/cray/pe/perftools/25.03.0/man:1;/opt/cray/pe/papi/7.2.0.1/share/pdoc/man:1;/opt/cray/libfabric/1.22.0/share/man:1;/opt/cray/pe/lmod/lmod/share/man:1;/usr/local/man:1;/usr/share/man:1;/usr/man:1;/opt/c3/man:1;/opt/clmgr/man:1;/opt/sgi/share/man:1;/opt/clmgr/share/man:1;/opt/clmgr/lib/cm-cli/man:1
PE_CRAYCLANG_FIXED_PKGCONFIG_PATH = /opt/cray/pe/mpich/8.1.32/ofi/crayclang/17.0/lib/pkgconfig
JULIAUP_DEPOT_PATH = /cluster/projects/nn9874k/aklocker/juliaup/depot
TERM = xterm-256color
__LMOD_REF_COUNT_CPATH = /opt/nvidia/hpc_sdk/Linux_aarch64/24.11/comm_libs/12.6/nvshmem/include:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/comm_libs/12.6/nccl/include:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/include:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/nvvm/include:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/extras/Debugger/include:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/extras/CUPTI/include:1;/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/math_libs/12.6/include:1
CUDATOOLKIT_HOME = /opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6
JUHPC_CUDA_HOME = /opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6
MANPATH = /opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/doc/man:/opt/cray/pe/libsci/25.03.0/share/man:/opt/cray/pe/mpich/8.1.32/ofi/man:/opt/cray/pe/mpich/8.1.32/man/mpich:/opt/cray/pe/dsmml/0.3.1/dsmml/man:/opt/cray/pe/craype/2.7.34/man:/opt/cray/pe/cce/19.0.0/cce-clang/aarch64/share/man:/opt/cray/pe/cce/19.0.0/man:/opt/cray/pe/perftools/25.03.0/man:/opt/cray/pe/papi/7.2.0.1/share/pdoc/man:/opt/cray/libfabric/1.22.0/share/man:/opt/cray/pe/lmod/lmod/share/man:/usr/local/man:/usr/share/man:/usr/man:/opt/c3/man:/opt/clmgr/man:/opt/sgi/share/man:/opt/clmgr/share/man:/opt/clmgr/lib/cm-cli/man
OSCAR_HOME = /opt/oscar
MODULEPATH = /opt/cray/pe/lmod/modulefiles/mpi/crayclang/17.0/ofi/1.0/cray-mpich/8.0:/opt/cray/pe/lmod/modulefiles/comnet/crayclang/17.0/ofi/1.0:/opt/cray/pe/lmod/modulefiles/compiler/crayclang/17.0:/opt/cray/pe/lmod/modulefiles/mix_compilers:/opt/cray/pe/lmod/modulefiles/perftools/25.03.0:/opt/cray/pe/lmod/modulefiles/net/ofi/1.0:/opt/cray/pe/lmod/modulefiles/cpu/arm-grace/1.0:/opt/cray/pe/modulefiles/Linux:/opt/cray/pe/lmod/modulefiles/craype-targets/default:/opt/cray/pe/lmod/modulefiles/core:/opt/cray/pe/lmod/lmod/modulefiles/Core:/opt/cray/pe/modulefiles/Core:/opt/cray/modulefiles:/cluster/software/modules/Core
MODULEPATH_ROOT = /opt/cray/pe/modulefiles
LMOD_PACKAGE_PATH = /cluster/software/config/lmod/SitePackage.lua
JRE_HOME = /usr/lib64/jvm/java-11-openjdk-11
PATH = /cluster/projects/nn9874k/aklocker/juhpc_setup/juliaup_wrapper:/cluster/projects/nn9874k/aklocker/juliaup/bin:/cluster/projects/nn9874k/aklocker/juhpc_setup/juliaup_wrapper:/cluster/projects/nn9874k/aklocker/juliaup/bin:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/bin:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/cuda/12.6/libnvvp:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/profilers/Nsight_Compute:/opt/nvidia/hpc_sdk/Linux_aarch64/24.11/profilers/Nsight_Systems/bin:/opt/cray/pe/mpich/8.1.32/ofi/crayclang/17.0/bin:/opt/cray/pe/mpich/8.1.32/bin:/opt/cray/pe/craype/2.7.34/bin:/opt/cray/pe/cce/19.0.0/binutils/aarch64/aarch64-unknown-linux-gnu/bin:/opt/cray/pe/cce/19.0.0/utils/aarch64/bin:/opt/cray/pe/cce/19.0.0/bin:/opt/cray/pe/perftools/25.03.0/bin:/opt/cray/pe/papi/7.2.0.1/bin:/opt/cray/libfabric/1.22.0/bin:/cluster/projects/nn9874k/aklocker/juhpc_setup/juliaup_wrapper:/cluster/projects/nn9874k/aklocker/juliaup/bin:/cluster/home/aklocker/.juliaup/bin:/opt/clmgr/sbin:/opt/clmgr/bin:/opt/sgi/sbin:/opt/sgi/bin:/usr/local/bin:/usr/bin:/bin:/opt/c3/bin:/usr/lib/mit/bin:/cluster/bin:/opt/cray/pe/bin
MODULESHOME = /opt/cray/pe/lmod/lmod
PKG_CONFIG_PATH = /usr/lib64/pkgconfig:/opt/cray/pe/dsmml/0.3.1/dsmml/lib/pkgconfig:/opt/cray/pe/craype/2.7.34/pkg-config:/opt/cray/libfabric/1.22.0/lib64/pkgconfig
Thanks! That looks like ARM (Linux) · The Julia Language
So not an issue with CUDA.jl, but rather with Julia on that platform 
We could restart the test worker when we detect this issue.
1 Like
That does indeed look like my problem! I guess I can ask our HPC people to increase this limit for memory mapping.
And since I’m new to Julia I’m not sure what restarting a test worker would do..
My ultimate goal is to get cuda-aware MPI working so I thought I somehow need to fix this first…