Thanks @malead! Installing the master version of the CUDA packages worked on the login nodes out of the box without any patching [1]
Unfortunately, there comes up another problem, which is: Precompiling CUDAnative on login node (works on compute node) .
[1] Shell output of successful installation of the CUDA packages on the login node:
omlins@daint104:~> module use /scratch/snx3000/omlins/9_soft_install_julia/modules/all
omlins@daint104:~> module load julia/1.0.4
omlins@daint104:~> cd /scratch/snx3000/omlins/julia_cuda_test_install
omlins@daint104:/scratch/snx3000/omlins/julia_cuda_test_install> export JULIA_DEPOT_PATH=.
omlins@daint104:/scratch/snx3000/omlins/julia_cuda_test_install> julia
(v1.0.4-gpu) pkg> status
Status `/scratch/snx3000/omlins/julia_cuda_test_install/environments/v1.0.4-gpu/Project.toml`
(empty environment)
(v1.0.4-gpu) pkg> add CUDAdrv#master
Cloning default registries into /scratch/snx3000/omlins/julia_cuda_test_install/registries
Cloning registry General from "https://github.com/JuliaRegistries/General.git"
Cloning git-repo `https://github.com/JuliaGPU/CUDAdrv.jl.git`
Updating git-repo `https://github.com/JuliaGPU/CUDAdrv.jl.git`
Resolving package versions...
Installed CUDAapi โ v1.0.1
Updating `/scratch/snx3000/omlins/julia_cuda_test_install/environments/v1.0.4-gpu/Project.toml`
[c5f51814] + CUDAdrv v3.1.0 #master (https://github.com/JuliaGPU/CUDAdrv.jl.git)
Updating `/scratch/snx3000/omlins/julia_cuda_test_install/environments/v1.0.4-gpu/Manifest.toml`
[3895d2a7] + CUDAapi v1.0.1
[c5f51814] + CUDAdrv v3.1.0 #master (https://github.com/JuliaGPU/CUDAdrv.jl.git)
[8f399da3] + Libdl
[56ddb016] + Logging
[de0858da] + Printf
[4ec0a83e] + Unicode
(v1.0.4-gpu) pkg> add CUDAnative#master
Cloning git-repo `https://github.com/JuliaGPU/CUDAnative.jl.git`
Updating git-repo `https://github.com/JuliaGPU/CUDAnative.jl.git`
Resolving package versions...
Installed Adapt โโโโโโโโโโโโโโ v1.0.0
Installed CEnum โโโโโโโโโโโโโโ v0.2.0
Installed TimerOutputs โโโโโโโ v0.5.0
Installed Crayons โโโโโโโโโโโโ v4.0.0
Installed OrderedCollections โ v1.1.0
Installed DataStructures โโโโโ v0.17.0
Installed LLVM โโโโโโโโโโโโโโโ v1.2.0
Updating `/scratch/snx3000/omlins/julia_cuda_test_install/environments/v1.0.4-gpu/Project.toml`
[be33ccc6] + CUDAnative v2.2.1 #master (https://github.com/JuliaGPU/CUDAnative.jl.git)
Updating `/scratch/snx3000/omlins/julia_cuda_test_install/environments/v1.0.4-gpu/Manifest.toml`
[79e6a3ab] + Adapt v1.0.0
[fa961155] + CEnum v0.2.0
[be33ccc6] + CUDAnative v2.2.1 #master (https://github.com/JuliaGPU/CUDAnative.jl.git)
[a8cc5b0e] + Crayons v4.0.0
[864edb3b] + DataStructures v0.17.0
[929cbde3] + LLVM v1.2.0
[bac558e1] + OrderedCollections v1.1.0
[a759f4b9] + TimerOutputs v0.5.0
[2a0f44e3] + Base64
[8ba89e20] + Distributed
[b77e0a4c] + InteractiveUtils
[37e2e46d] + LinearAlgebra
[d6f4376e] + Markdown
[9a3f8284] + Random
[9e88b42a] + Serialization
[6462fe0b] + Sockets
[8dfed614] + Test
(v1.0.4-gpu) pkg> add CuArrays#master
Cloning git-repo `https://github.com/JuliaGPU/CuArrays.jl.git`
Updating git-repo `https://github.com/JuliaGPU/CuArrays.jl.git`
Resolving package versions...
Installed Reexport โโโโโโโ v0.2.0
Installed Requires โโโโโโโ v0.5.2
Installed VersionParsing โ v1.1.3
Installed NNlib โโโโโโโโโโ v0.6.0
Installed Tokenize โโโโโโโ v0.5.6
Installed FFTW โโโโโโโโโโโ v0.3.0
Installed GPUArrays โโโโโโ v1.0.0
Installed StaticArrays โโโ v0.11.0
Installed BinaryProvider โ v0.5.6
Installed AbstractFFTs โโโ v0.4.1
Installed Compat โโโโโโโโโ v2.1.0
Installed JSON โโโโโโโโโโโ v0.21.0
Installed FillArrays โโโโโ v0.6.4
Installed CSTParser โโโโโโ v0.6.2
Installed Conda โโโโโโโโโโ v1.3.0
Installed Parsers โโโโโโโโ v0.3.6
Installed MacroTools โโโโโ v0.5.1
Updating `/scratch/snx3000/omlins/julia_cuda_test_install/environments/v1.0.4-gpu/Project.toml`
[3a865a2d] + CuArrays v1.1.0 #master (https://github.com/JuliaGPU/CuArrays.jl.git)
Updating `/scratch/snx3000/omlins/julia_cuda_test_install/environments/v1.0.4-gpu/Manifest.toml`
[621f4979] + AbstractFFTs v0.4.1
[b99e7846] + BinaryProvider v0.5.6
[00ebfdb7] + CSTParser v0.6.2
[34da2185] + Compat v2.1.0
[8f4d0f93] + Conda v1.3.0
[3a865a2d] + CuArrays v1.1.0 #master (https://github.com/JuliaGPU/CuArrays.jl.git)
[7a1cc6ca] + FFTW v0.3.0
[1a297f60] + FillArrays v0.6.4
[0c68f7d7] + GPUArrays v1.0.0
[682c06a0] + JSON v0.21.0
[1914dd2f] + MacroTools v0.5.1
[872c559c] + NNlib v0.6.0
[69de0a69] + Parsers v0.3.6
[189a3867] + Reexport v0.2.0
[ae029012] + Requires v0.5.2
[90137ffa] + StaticArrays v0.11.0
[0796e94c] + Tokenize v0.5.6
[81def892] + VersionParsing v1.1.3
[ade2ca70] + Dates
[8bb1440f] + DelimitedFiles
[76f85450] + LibGit2
[a63ad114] + Mmap
[44cfe95a] + Pkg
[3fa0cd96] + REPL
[ea8e919c] + SHA
[1a1011a3] + SharedArrays
[2f01184e] + SparseArrays
[10745b16] + Statistics
[cf7118a7] + UUIDs
Building Conda โ `/scratch/snx3000/omlins/julia_cuda_test_install/packages/Conda/kLXeC/deps/build.log`
Building FFTW โโ `/scratch/snx3000/omlins/julia_cuda_test_install/packages/FFTW/2okGQ/deps/build.log`