I am trying to run some CUDA calculations on a cluster and I have a problem with CUDA artifacts. Here is a simple test code, which I run on cluster node.
ENV["JULIA_CUDA_USE_BINARYBUILDER"] = false
using CUDA
CUDA.versioninfo()
println("")
a = CuArray([1,2])
print(a)
The code give me the output, which seems to be correct:
CUDA toolkit 11.6, local installation
NVIDIA driver 460.32.3, for CUDA 11.2
CUDA driver 11.2
Libraries:
- CUBLAS: 11.9.2
- CURAND: 10.2.9
- CUFFT: 10.7.2
- CUSOLVER: 11.3.4
- CUSPARSE: 11.7.2
- CUPTI: 16.0.0
- NVML: 11.0.0+460.32.3
- CUDNN: 8.10.1 (for CUDA 11.2.0)
- CUTENSOR: missing
Toolchain:
- Julia: 1.6.7
- LLVM: 11.0.1
- PTX ISA support: 3.2, 4.0, 4.1, 4.2, 4.3, 5.0, 6.0, 6.1, 6.3, 6.4, 6.5, 7.0
- Device capability support: sm_35, sm_37, sm_50, sm_52, sm_53, sm_60, sm_61, sm_62, sm_70, sm_72, sm_75, sm_80
Environment:
- JULIA_CUDA_USE_BINARYBUILDER: false
2 devices:
0: Tesla K40m (sm_35, 11.918 GiB / 11.921 GiB available)
1: Tesla K40m (sm_35, 11.918 GiB / 11.921 GiB available)
[1, 2]
But in addition I have some error messages in the debug output. Is this OK? What is wrong
with CUDA_compat? Can I get rid of this βDownloading artifact: CUDA_compatβ messages?
Downloading artifact: CUDA_compat
Downloading artifact: CUDA_compat
β Debug: Could not download forward compatibility package
β exception =
β Unable to automatically install 'CUDA_compat' from '/home/.julia/packages/CUDA/DfvRa/Artifacts.toml'
β Stacktrace:
β [1] error(s::String)
β @ Base ./error.jl:33
β [2] ensure_artifact_installed(name::String, meta::Dict{String, Any}, artifacts_toml::String; platform::Base.BinaryPlatforms.Platform, verbose::Bool, quiet_download::Bool, io::Base.PipeEndpoint)
β @ Pkg.Artifacts /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.6/Pkg/src/Artifacts.jl:446
β [3] ensure_artifact_installed(name::String, artifacts_toml::String; platform::Base.BinaryPlatforms.Platform, pkg_uuid::Nothing, verbose::Bool, quiet_download::Bool, io::Base.PipeEndpoint)
β @ Pkg.Artifacts /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.6/Pkg/src/Artifacts.jl:409
β [4] _artifact_str(__module__::Module, artifacts_toml::String, name::SubString{String}, path_tail::String, artifact_dict::Dict{String, Any}, hash::Base.SHA1, platform::Base.BinaryPlatforms.Platform, lazyartifacts::Any)
β @ Artifacts /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.6/Artifacts/src/Artifacts.jl:548
β [5] #invokelatest#2
β @ ./essentials.jl:708 [inlined]
β [6] invokelatest
β @ ./essentials.jl:706 [inlined]
β [7] macro expansion
β @ /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.6/Artifacts/src/Artifacts.jl:690 [inlined]
β [8] (::CUDA.var"#1#4")(id::String)
β @ CUDA ~/.julia/packages/CUDA/DfvRa/lib/cudadrv/CUDAdrv.jl:97
β [9] find_libcuda()
β @ CUDA ~/.julia/packages/CUDA/DfvRa/lib/cudadrv/CUDAdrv.jl:98
β [10] libcuda()
β @ CUDA ~/.julia/packages/CUDA/DfvRa/lib/cudadrv/CUDAdrv.jl:141
β [11] macro expansion
β @ ~/.julia/packages/CUDA/DfvRa/deps/bindeps.jl:67 [inlined]
β [12] macro expansion
β @ lock.jl:209 [inlined]
β [13] toolkit()
β @ CUDA.Deps ~/.julia/packages/CUDA/DfvRa/deps/bindeps.jl:21
β [14] macro expansion
β @ ~/.julia/packages/CUDA/DfvRa/deps/bindeps.jl:342 [inlined]
β [15] macro expansion
β @ lock.jl:209 [inlined]
β [16] libcudart()
β @ CUDA.Deps ~/.julia/packages/CUDA/DfvRa/deps/bindeps.jl:21
β [17] macro expansion
β @ ~/.julia/packages/CUDA/DfvRa/lib/cudadrv/error.jl:95 [inlined]
β [18] runtime_version()
β @ CUDA ~/.julia/packages/CUDA/DfvRa/lib/cudadrv/version.jl:44
β [19] versioninfo(io::Base.PipeEndpoint) (repeats 2 times)
β @ CUDA ~/.julia/packages/CUDA/DfvRa/src/utilities.jl:32
β [20] top-level scope
β @ ~/julia/Test_CUDA/test_CUDA.jl:5
β [21] include(mod::Module, _path::String)
β @ Base ./Base.jl:384
β [22] exec_options(opts::Base.JLOptions)
β @ Base ./client.jl:285
β [23] _start()
β @ Base ./client.jl:485
β @ CUDA ~/.julia/packages/CUDA/DfvRa/lib/cudadrv/CUDAdrv.jl:100
β Debug: Trying to use local installation...
β @ CUDA.Deps ~/.julia/packages/CUDA/DfvRa/deps/bindeps.jl:210
β Debug: Looking for CUDA toolkit via environment variables CUDA_PATH, CUDA_HOME
β @ CUDA.Deps ~/.julia/packages/CUDA/DfvRa/deps/discovery.jl:271
β Debug: Looking for binary nvdisasm in /opt/software/nvidia/cuda-11.6u2
β all_locations =
β 2-element Vector{String}:
β "/opt/software/nvidia/cuda-11.6u2"
β "/opt/software/nvidia/cuda-11.6u2/bin"
β @ CUDA.Deps ~/.julia/packages/CUDA/DfvRa/deps/discovery.jl:147
β Debug: Found nvdisasm at /opt/software/nvidia/cuda-11.6u2/bin/nvdisasm
β @ CUDA.Deps ~/.julia/packages/CUDA/DfvRa/deps/discovery.jl:153
β Debug: Looking for library cudart, no specific version, in /opt/software/nvidia/cuda-11.6u2
β all_names =
β 1-element Vector{String}:
β "libcudart.so"
β all_locations =
β 4-element Vector{String}:
β "/opt/software/nvidia/cuda-11.6u2"
β "/opt/software/nvidia/cuda-11.6u2/lib"
β "/opt/software/nvidia/cuda-11.6u2/lib64"
β "/opt/software/nvidia/cuda-11.6u2/libx64"
β @ CUDA.Deps ~/.julia/packages/CUDA/DfvRa/deps/discovery.jl:119
β Debug: Found libcudart.so at /opt/software/nvidia/cuda-11.6u2/lib64
β @ CUDA.Deps ~/.julia/packages/CUDA/DfvRa/deps/discovery.jl:128
β Debug: Found local CUDA 11.6.0 at /opt/software/nvidia/cuda-11.6u2
β @ CUDA.Deps ~/.julia/packages/CUDA/DfvRa/deps/bindeps.jl:232
β Debug: Looking for library cudart, no specific version, in /opt/software/nvidia/cuda-11.6u2
β all_names =
β 1-element Vector{String}:
β "libcudart.so"
β all_locations =
β 4-element Vector{String}:
β "/opt/software/nvidia/cuda-11.6u2"
β "/opt/software/nvidia/cuda-11.6u2/lib"
β "/opt/software/nvidia/cuda-11.6u2/lib64"
β "/opt/software/nvidia/cuda-11.6u2/libx64"
β @ CUDA.Deps ~/.julia/packages/CUDA/DfvRa/deps/discovery.jl:119
β Debug: Found libcudart.so at /opt/software/nvidia/cuda-11.6u2/lib64
β @ CUDA.Deps ~/.julia/packages/CUDA/DfvRa/deps/discovery.jl:128
β Debug: Looking for library cublasLt, no specific version, in /opt/software/nvidia/cuda-11.6u2
β all_names =
β 1-element Vector{String}:
β "libcublasLt.so"
β all_locations =
β 4-element Vector{String}:
β "/opt/software/nvidia/cuda-11.6u2"
β "/opt/software/nvidia/cuda-11.6u2/lib"
β "/opt/software/nvidia/cuda-11.6u2/lib64"
β "/opt/software/nvidia/cuda-11.6u2/libx64"
β @ CUDA.Deps ~/.julia/packages/CUDA/DfvRa/deps/discovery.jl:119
β Debug: Found libcublasLt.so at /opt/software/nvidia/cuda-11.6u2/lib64
β @ CUDA.Deps ~/.julia/packages/CUDA/DfvRa/deps/discovery.jl:128
β Debug: Looking for library cublas, no specific version, in /opt/software/nvidia/cuda-11.6u2
β all_names =
β 1-element Vector{String}:
β "libcublas.so"
β all_locations =
β 4-element Vector{String}:
β "/opt/software/nvidia/cuda-11.6u2"
β "/opt/software/nvidia/cuda-11.6u2/lib"
β "/opt/software/nvidia/cuda-11.6u2/lib64"
β "/opt/software/nvidia/cuda-11.6u2/libx64"
β @ CUDA.Deps ~/.julia/packages/CUDA/DfvRa/deps/discovery.jl:119
β Debug: Found libcublas.so at /opt/software/nvidia/cuda-11.6u2/lib64
β @ CUDA.Deps ~/.julia/packages/CUDA/DfvRa/deps/discovery.jl:128
β Debug: cuBLAS (v11.5) function cublasStatus_t cublasGetProperty(libraryPropertyType, int*) called:
β type: type=SOME TYPE; val=0
β value: type=int; val=POINTER (IN HEX:0x0x2b688951add0)
β Time: 2022-12-26T14:44:46 elapsed from start 0.016667 minutes or 1.000000 seconds
β Process=150087; Thread=47727540146624; GPU=0; Handle=POINTER (IN HEX:0x(nil))
β COMPILED WITH: GNU GCC/G++ / 6.3.1 20170216 (Red Hat 6.3.1-3)
β @ CUDA.CUBLAS ~/.julia/packages/CUDA/DfvRa/lib/cublas/CUBLAS.jl:222
β Debug: cuBLAS (v11.5) function cublasStatus_t cublasGetProperty(libraryPropertyType, int*) called:
β type: type=SOME TYPE; val=1
β value: type=int; val=POINTER (IN HEX:0x0x2b6889c20c10)
β Time: 2022-12-26T14:44:46 elapsed from start 0.016667 minutes or 1.000000 seconds
β Process=150087; Thread=47727540146624; GPU=0; Handle=POINTER (IN HEX:0x(nil))
β COMPILED WITH: GNU GCC/G++ / 6.3.1 20170216 (Red Hat 6.3.1-3)
β @ CUDA.CUBLAS ~/.julia/packages/CUDA/DfvRa/lib/cublas/CUBLAS.jl:222
β Debug: cuBLAS (v11.5) function cublasStatus_t cublasGetProperty(libraryPropertyType, int*) called:
β type: type=SOME TYPE; val=2
β value: type=int; val=POINTER (IN HEX:0x0x2b6889c20c20)
β Time: 2022-12-26T14:44:46 elapsed from start 0.016667 minutes or 1.000000 seconds
β Process=150087; Thread=47727540146624; GPU=0; Handle=POINTER (IN HEX:0x(nil))
β COMPILED WITH: GNU GCC/G++ / 6.3.1 20170216 (Red Hat 6.3.1-3)
β