Error while trying to test Flux

Hi everyone,

I am new to Julia programming language. I installed Julia 1.3.1 and was trying to install flux. The installation was complete. But, when I try to test it using the command:

Pkg.test(“Flux”)

This is my Test summary:

Test Summary:                         | Pass  Error  Total
Flux                                  |  279      1    280
  Throttle                            |   11            11
  Initialization                      |   16            16
  Params                              |    4             4
  Basic Stacking                      |    1             1
  Precision                           |    5             5
  Stacking                            |    3             3
  onecold                             |    4             4
  onehotbatch indexing                |    2             2
  Optimise                            |   12            12
  Optimiser                           |    3             3
  Training Loop                       |    2             2
  ExpDecay                            |    3             3
  basic                               |   29            29
  Dropout                             |   10            10
  BatchNorm                           |   12            12
  InstanceNorm                        |   13            13
  GroupNorm                           |   13            13
  losses                              |   24            24
  Pooling                             |    2             2
  CNN                                 |    1             1
  asymmetric padding                  |    7             7
  Depthwise Conv                      |    3             3
  ConvTranspose                       |    2             2
  CrossCor                            |    4             4
  Conv with non quadratic window #700 |    4             4
  CuArrays                            |   13      1     14
  onecold gpu                         |    2             2
  CUDNN BatchNorm                     |    8             8
  R = RNN                             |    1             1
  R = GRU                             |    1             1
  R = LSTM                            |    1             1
  RNN                                 |   50            50
  Doctests: Flux                      |    1             1
ERROR: LoadError: Some tests did not pass: 279 passed, 0 failed, 1 errored, 0 broken.

And I’m getting the following error:

ERROR: Package Flux errored during testing
Stacktrace:
 [1] pkgerror(::String, ::Vararg{String,N} where N) at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Pkg/src/Types.jl:113
 [2] #test#131(::Bool, ::Nothing, ::Cmd, ::Cmd, ::typeof(Pkg.Operations.test), ::Pkg.Types.Context, ::Array{Pkg.Types.PackageSpec,1}) at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Pkg/src/Operations.jl:1372
 [3] #test at ./none:0 [inlined]
 [4] #test#62(::Bool, ::Nothing, ::Cmd, ::Cmd, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(Pkg.API.test), ::Pkg.Types.Context, ::Array{Pkg.Types.PackageSpec,1}) at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Pkg/src/API.jl:253
 [5] test(::Pkg.Types.Context, ::Array{Pkg.Types.PackageSpec,1}) at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Pkg/src/API.jl:239
 [6] #test#61 at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Pkg/src/API.jl:233 [inlined]
 [7] test at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Pkg/src/API.jl:233 [inlined]
 [8] #test#60 at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Pkg/src/API.jl:232 [inlined]
 [9] test at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Pkg/src/API.jl:232 [inlined]
 [10] #test#59(::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(Pkg.API.test), ::String) at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Pkg/src/API.jl:231
 [11] test(::String) at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Pkg/src/API.jl:231
 [12] top-level scope at REPL[8]:1

Can someone tell me what is the actual problem and how to solve this. Thanking you in advance.

This error relates to arrays stored on the GPU. Other GPU related functionality seems to work fine, so your drivers are probably OK. Could you provide more context (this was the only error message you got)? I have no access to a GPU right now, but if you are interested in resolving this, you could try running the tests included in test/cuda/cuda.jl individually, see which one fails, and maybe open an issue for Flux.

When I tried doing

pkg> add CUDA

It shows:

 Updating registry at `~/.julia/registries/General`
  Updating git-repo `https://github.com/JuliaRegistries/General.git`
ERROR: The following package names could not be resolved:
 * CUDA (not found in project, manifest or registry)
Please specify by known `name=uuid`.

Next, when I did

pkg> add CUDAdrv CUDAnative CuArrays

it shows:

 Resolving package versions...
  Updating `~/.julia/environments/v1.3/Project.toml`
  [c5f51814] + CUDAdrv v6.0.0
  Updating `~/.julia/environments/v1.3/Manifest.toml`
 [no changes]

I did testing using

pkg> test CUDAdrv CUDAnative CuArrays

It shows the following:

Testing CUDAdrv
 Resolving package versions...
    Status `/tmp/jl_FNgZkh/Manifest.toml`
  [fa961155] CEnum v0.2.0
  [3895d2a7] CUDAapi v3.1.0
  [c5f51814] CUDAdrv v6.0.0
  [2a0f44e3] Base64  [`@stdlib/Base64`]
  [8ba89e20] Distributed  [`@stdlib/Distributed`]
  [b77e0a4c] InteractiveUtils  [`@stdlib/InteractiveUtils`]
  [8f399da3] Libdl  [`@stdlib/Libdl`]
  [56ddb016] Logging  [`@stdlib/Logging`]
  [d6f4376e] Markdown  [`@stdlib/Markdown`]
  [de0858da] Printf  [`@stdlib/Printf`]
  [9a3f8284] Random  [`@stdlib/Random`]
  [9e88b42a] Serialization  [`@stdlib/Serialization`]
  [6462fe0b] Sockets  [`@stdlib/Sockets`]
  [8dfed614] Test  [`@stdlib/Test`]
  [4ec0a83e] Unicode  [`@stdlib/Unicode`]
[ Info: Testing using device Quadro K2100M (compute capability 3.0.0, 1.376 GiB available memory) on CUDA driver 10.2.0
limits: Error During Test at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/test/context.jl:103
  Got exception outside of a @test
  CUDA error: limit is not supported on this architecture (code 215, ERROR_UNSUPPORTED_LIMIT)
  Stacktrace:
   [1] throw_api_error(::CUDAdrv.cudaError_enum) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/error.jl:131
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/error.jl:144 [inlined]
   [3] cuCtxGetLimit at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/libcuda.jl:166 [inlined]
   [4] limit(::CUDAdrv.CUlimit_enum) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/context.jl:199
   [5] top-level scope at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/test/context.jl:105
   [6] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [7] top-level scope at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/test/context.jl:105
   [8] include at ./boot.jl:328 [inlined]
   [9] include_relative(::Module, ::String) at ./loading.jl:1105
   [10] include(::Module, ::String) at ./Base.jl:31
   [11] include(::String) at ./client.jl:424
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/test/runtests.jl:41
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/test/runtests.jl:38
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/test/runtests.jl:7
   [17] include at ./boot.jl:328 [inlined]
   [18] include_relative(::Module, ::String) at ./loading.jl:1105
   [19] include(::Module, ::String) at ./Base.jl:31
   [20] include(::String) at ./client.jl:424
   [21] top-level scope at none:6
   [22] eval(::Module, ::Any) at ./boot.jl:330
   [23] exec_options(::Base.JLOptions) at ./client.jl:263
   [24] _start() at ./client.jl:460
  
┌ Warning: Calling CUDAdrv.@profile only informs an external profiler to start.
│ The user is responsible for launching Julia under a CUDA profiler like `nvprof`.
│ 
│ For improved usability, launch Julia under the Nsight Systems profiler:
│ $ nsys launch -t cuda,cublas,cudnn,nvtx julia
└ @ CUDAdrv.Profile ~/.julia/packages/CUDAdrv/b1mvw/src/profile.jl:42
Test Summary:       | Pass  Error  Total
CUDAdrv             |  133      1    134
  pointer           |    7             7
  API wrappers      |  118      1    119
    errors          |    5             5
    version         |    3             3
    devices         |    1             1
    context         |   11            11
    primary context |   20            20
    cache config    |    1             1
    shmem config    |    1             1
    limits          |           1      1
    module          |   12            12
    memory          |   39            39
    stream          |    3             3
    execution       |   15            15
    events          |    6             6
    profile         |              No tests
    occupancy       |    1             1
  examples          |    1             1
ERROR: LoadError: Some tests did not pass: 133 passed, 0 failed, 1 errored, 0 broken.
in expression starting at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/test/runtests.jl:5
   Testing CUDAnative
 Resolving package versions...
    Status `/tmp/jl_AydM7s/Manifest.toml`
  [621f4979] AbstractFFTs v0.5.0
  [79e6a3ab] Adapt v1.0.1
  [b99e7846] BinaryProvider v0.5.8
  [fa961155] CEnum v0.2.0
  [3895d2a7] CUDAapi v3.1.0
  [c5f51814] CUDAdrv v6.0.0
  [be33ccc6] CUDAnative v2.10.2
  [3a865a2d] CuArrays v1.7.2
  [864edb3b] DataStructures v0.17.9
  [0c68f7d7] GPUArrays v2.0.1
  [929cbde3] LLVM v1.3.3
  [1914dd2f] MacroTools v0.5.4
  [872c559c] NNlib v0.6.4
  [bac558e1] OrderedCollections v1.1.0
  [ae029012] Requires v1.0.1
  [a759f4b9] TimerOutputs v0.5.3
  [2a0f44e3] Base64  [`@stdlib/Base64`]
  [8ba89e20] Distributed  [`@stdlib/Distributed`]
  [b77e0a4c] InteractiveUtils  [`@stdlib/InteractiveUtils`]
  [8f399da3] Libdl  [`@stdlib/Libdl`]
  [37e2e46d] LinearAlgebra  [`@stdlib/LinearAlgebra`]
  [56ddb016] Logging  [`@stdlib/Logging`]
  [d6f4376e] Markdown  [`@stdlib/Markdown`]
  [de0858da] Printf  [`@stdlib/Printf`]
  [9a3f8284] Random  [`@stdlib/Random`]
  [ea8e919c] SHA  [`@stdlib/SHA`]
  [9e88b42a] Serialization  [`@stdlib/Serialization`]
  [6462fe0b] Sockets  [`@stdlib/Sockets`]
  [2f01184e] SparseArrays  [`@stdlib/SparseArrays`]
  [10745b16] Statistics  [`@stdlib/Statistics`]
  [8dfed614] Test  [`@stdlib/Test`]
  [cf7118a7] UUIDs  [`@stdlib/UUIDs`]
  [4ec0a83e] Unicode  [`@stdlib/Unicode`]
┌ Warning: You are using CUDNN 7.6.5 for CUDA 10.2.0 with CUDA toolkit 10.1.243; these might be incompatible.
└ @ CuArrays ~/.julia/packages/CuArrays/HE8G6/src/CuArrays.jl:127
[ Info: Testing using device Quadro K2100M (compute capability 3.0.0, 1.358 GiB available memory) on CUDA driver 10.2.0 and toolkit 10.1.243
basic reflection: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/codegen.jl:109
  Test threw exception
  Expression: CUDAnative.code_sass(devnull, valid_kernel, Tuple{}) == nothing
  CUPTIError: user doesn't have sufficient privileges which are required to start the profiling session (code 35, CUPTI_ERROR_INSUFFICIENT_PRIVILEGES)
  Stacktrace:
   [1] throw_api_error(::CUDAnative.CUPTI.CUptiResult) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/cupti/error.jl:117
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/cupti/error.jl:130 [inlined]
   [3] cuptiSubscribe at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/cupti/libcupti.jl:197 [inlined]
   [4] #code_sass#215(::Bool, ::typeof(CUDAnative.code_sass), ::Base.DevNull, ::CUDAnative.CompilerJob) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/reflection.jl:138
   [5] #code_sass at ./none:0 [inlined]
   [6] #code_sass#214(::VersionNumber, ::Bool, ::Bool, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(CUDAnative.code_sass), ::Base.DevNull, ::Any, ::Any) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/reflection.jl:123
   [7] code_sass(::Base.DevNull, ::Any, ::Any) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/reflection.jl:121
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/codegen.jl:109
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/codegen.jl:106
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/codegen.jl:105
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/codegen.jl:5
  
function name mangling: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/codegen.jl:113
  Got exception outside of a @test
  CUPTIError: user doesn't have sufficient privileges which are required to start the profiling session (code 35, CUPTI_ERROR_INSUFFICIENT_PRIVILEGES)
  Stacktrace:
   [1] throw_api_error(::CUDAnative.CUPTI.CUptiResult) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/cupti/error.jl:117
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/cupti/error.jl:130 [inlined]
   [3] cuptiSubscribe at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/cupti/libcupti.jl:197 [inlined]
   [4] #code_sass#215(::Bool, ::typeof(CUDAnative.code_sass), ::Base.DevNull, ::CUDAnative.CompilerJob) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/reflection.jl:138
   [5] #code_sass at ./none:0 [inlined]
   [6] #code_sass#214(::VersionNumber, ::Bool, ::Bool, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(CUDAnative.code_sass), ::Base.DevNull, ::Any, ::Any) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/reflection.jl:123
   [7] code_sass(::Base.DevNull, ::Any, ::Any) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/reflection.jl:121
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/codegen.jl:121
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/codegen.jl:114
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/codegen.jl:105
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/codegen.jl:5
   [15] include at ./boot.jl:328 [inlined]
   [16] include_relative(::Module, ::String) at ./loading.jl:1105
   [17] include(::Module, ::String) at ./Base.jl:31
   [18] include(::String) at ./client.jl:424
   [19] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:92
   [20] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [21] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [22] include at ./boot.jl:328 [inlined]
   [23] include_relative(::Module, ::String) at ./loading.jl:1105
   [24] include(::Module, ::String) at ./Base.jl:31
   [25] include(::String) at ./client.jl:424
   [26] top-level scope at none:6
   [27] eval(::Module, ::Any) at ./boot.jl:330
   [28] exec_options(::Base.JLOptions) at ./client.jl:263
   [29] _start() at ./client.jl:460
  
reflection: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:51
  Got exception outside of a @test
  CUPTIError: CUPTI is unable to initialize its connection to the CUDA driver (code 15, CUPTI_ERROR_NOT_INITIALIZED)
  Stacktrace:
   [1] throw_api_error(::CUDAnative.CUPTI.CUptiResult) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/cupti/error.jl:117
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/cupti/error.jl:130 [inlined]
   [3] cuptiSubscribe at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/cupti/libcupti.jl:197 [inlined]
   [4] #code_sass#215(::Bool, ::typeof(CUDAnative.code_sass), ::Base.DevNull, ::CUDAnative.CompilerJob) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/reflection.jl:138
   [5] #code_sass at ./none:0 [inlined]
   [6] #code_sass#214(::VersionNumber, ::Bool, ::Bool, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(CUDAnative.code_sass), ::Base.DevNull, ::Any, ::Any) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/reflection.jl:123
   [7] code_sass(::Base.DevNull, ::Any, ::Any) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/reflection.jl:121
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:57
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:52
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:9
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:5
   [15] include at ./boot.jl:328 [inlined]
   [16] include_relative(::Module, ::String) at ./loading.jl:1105
   [17] include(::Module, ::String) at ./Base.jl:31
   [18] include(::String) at ./client.jl:424
   [19] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:93
   [20] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [21] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [22] include at ./boot.jl:328 [inlined]
   [23] include_relative(::Module, ::String) at ./loading.jl:1105
   [24] include(::Module, ::String) at ./Base.jl:31
   [25] include(::String) at ./client.jl:424
   [26] top-level scope at none:6
   [27] eval(::Module, ::Any) at ./boot.jl:330
   [28] exec_options(::Base.JLOptions) at ./client.jl:263
   [29] _start() at ./client.jl:460
  
stack traces at different debug levels: Test Failed at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:548
  Expression: occursin("ERROR: a exception was thrown during kernel execution", out)
   Evaluated: occursin("ERROR: a exception was thrown during kernel execution", "")
Stacktrace:
 [1] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:548
 [2] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [3] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:526
 [4] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:524
 [6] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:5
stack traces at different debug levels: Test Failed at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:549
  Expression: occursin("Run Julia on debug level 2 for device stack traces", out)
   Evaluated: occursin("Run Julia on debug level 2 for device stack traces", "")
Stacktrace:
 [1] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:549
 [2] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [3] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:526
 [4] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:524
 [6] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:5
stack traces at different debug levels: Test Failed at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:555
  Expression: occursin("ERROR: a exception was thrown during kernel execution", out)
   Evaluated: occursin("ERROR: a exception was thrown during kernel execution", "")
Stacktrace:
 [1] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:555
 [2] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [3] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:526
 [4] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:524
 [6] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:5
stack traces at different debug levels: Test Failed at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:559
  Expression: occursin("[1] Int64 at float.jl", out)
   Evaluated: occursin("[1] Int64 at float.jl", "")
Stacktrace:
 [1] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:559
 [2] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [3] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:526
 [4] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:524
 [6] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:5
stack traces at different debug levels: Test Failed at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:561
  Expression: occursin("[2] kernel at none:2", out)
   Evaluated: occursin("[2] kernel at none:2", "")
Stacktrace:
 [1] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:561
 [2] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [3] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:526
 [4] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:524
 [6] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:5
#329: Test Failed at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:582
  Expression: occursin("ERROR: a exception was thrown during kernel execution", out)
   Evaluated: occursin("ERROR: a exception was thrown during kernel execution", "")
Stacktrace:
 [1] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:582
 [2] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [3] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:568
 [4] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:524
 [6] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:5
#329: Test Failed at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:583
  Expression: occursin("foo at none:1", out)
   Evaluated: occursin("foo at none:1", "")
Stacktrace:
 [1] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:583
 [2] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [3] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:568
 [4] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:524
 [6] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:5
#329: Test Failed at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:584
  Expression: occursin("bar at none:2", out)
   Evaluated: occursin("bar at none:2", "")
Stacktrace:
 [1] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:584
 [2] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [3] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:568
 [4] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:524
 [6] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
 [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:5
basic usage: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:834
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  Stacktrace:
   [1] throw_api_error(::CUDAdrv.cudaError_enum) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/error.jl:131
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/error.jl:144 [inlined]
   [3] cuLinkAddData_v2(::CuLink, ::CUDAdrv.CUjitInputType_enum, ::Ptr{Int8}, ::Int64, ::String, ::Int64, ::Ptr{Nothing}, ::Ptr{Nothing}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/libcuda.jl:281
   [4] add_data!(::CuLink, ::String, ::String) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module/linker.jl:67
   [5] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:417 [inlined]
   [6] macro expansion at /home/g2-test/.julia/packages/TimerOutputs/7Id5J/src/TimerOutput.jl:214 [inlined]
   [7] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:414 [inlined]
   [8] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#hello#333", ::Type{Tuple{}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [9] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [10] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179 [inlined]
   [11] macro expansion at ./gcutils.jl:91 [inlined]
   [12] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176 [inlined]
   [13] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:847 [inlined]
   [14] (::var"#233#336"{var"#hello#333"})() at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:41
   [15] redirect_stdout(::var"#233#336"{var"#hello#333"}, ::IOStream) at ./stream.jl:1152
   [16] #232 at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:40 [inlined]
   [17] #open#271(::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(open), ::var"#232#335"{var"#hello#333"}, ::String, ::Vararg{String,N} where N) at ./io.jl:298
   [18] open at ./io.jl:296 [inlined]
   [19] #231 at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:39 [inlined]
   [20] mktemp(::var"#231#334"{var"#hello#333"}, ::String) at ./file.jl:611
   [21] mktemp(::Function) at ./file.jl:609
   [22] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:37
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:846
   [24] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [25] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:835
   [26] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [27] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:834
   [28] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [29] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:5
   [30] include at ./boot.jl:328 [inlined]
   [31] include_relative(::Module, ::String) at ./loading.jl:1105
   [32] include(::Module, ::String) at ./Base.jl:31
   [33] include(::String) at ./client.jl:424
   [34] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:93
   [35] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [36] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [37] include at ./boot.jl:328 [inlined]
   [38] include_relative(::Module, ::String) at ./loading.jl:1105
   [39] include(::Module, ::String) at ./Base.jl:31
   [40] include(::String) at ./client.jl:424
   [41] top-level scope at none:6
   [42] eval(::Module, ::Any) at ./boot.jl:330
   [43] exec_options(::Base.JLOptions) at ./client.jl:263
   [44] _start() at ./client.jl:460
  
`


Since there is limit in words, I will post the results in multiple replies.
anonymous functions: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:853
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  Stacktrace:
   [1] throw_api_error(::CUDAdrv.cudaError_enum) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/error.jl:131
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/error.jl:144 [inlined]
   [3] cuLinkAddData_v2(::CuLink, ::CUDAdrv.CUjitInputType_enum, ::Ptr{Int8}, ::Int64, ::String, ::Int64, ::Ptr{Nothing}, ::Ptr{Nothing}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/libcuda.jl:281
   [4] add_data!(::CuLink, ::String, ::String) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module/linker.jl:67
   [5] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:417 [inlined]
   [6] macro expansion at /home/g2-test/.julia/packages/TimerOutputs/7Id5J/src/TimerOutput.jl:214 [inlined]
   [7] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:414 [inlined]
   [8] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#1355#hello#337", ::Type{Tuple{}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [9] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [10] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179 [inlined]
   [11] macro expansion at ./gcutils.jl:91 [inlined]
   [12] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176 [inlined]
   [13] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:862 [inlined]
   [14] (::var"#237#341"{var"#1355#hello#337"})() at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:41
   [15] redirect_stdout(::var"#237#341"{var"#1355#hello#337"}, ::IOStream) at ./stream.jl:1152
   [16] #236 at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:40 [inlined]
   [17] #open#271(::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(open), ::var"#236#340"{var"#1355#hello#337"}, ::String, ::Vararg{String,N} where N) at ./io.jl:298
   [18] open at ./io.jl:296 [inlined]
   [19] #235 at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:39 [inlined]
   [20] mktemp(::var"#235#339"{var"#1355#hello#337"}, ::String) at ./file.jl:611
   [21] mktemp(::Function) at ./file.jl:609
   [22] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:37
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:861
   [24] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [25] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:854
   [26] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [27] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:834
   [28] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [29] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:5
   [30] include at ./boot.jl:328 [inlined]
   [31] include_relative(::Module, ::String) at ./loading.jl:1105
   [32] include(::Module, ::String) at ./Base.jl:31
   [33] include(::String) at ./client.jl:424
   [34] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:93
   [35] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [36] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [37] include at ./boot.jl:328 [inlined]
   [38] include_relative(::Module, ::String) at ./loading.jl:1105
   [39] include(::Module, ::String) at ./Base.jl:31
   [40] include(::String) at ./client.jl:424
   [41] top-level scope at none:6
   [42] eval(::Module, ::Any) at ./boot.jl:330
   [43] exec_options(::Base.JLOptions) at ./client.jl:263
   [44] _start() at ./client.jl:460
  
closures: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:869
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  Stacktrace:
   [1] throw_api_error(::CUDAdrv.cudaError_enum) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/error.jl:131
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/error.jl:144 [inlined]
   [3] cuLinkAddData_v2(::CuLink, ::CUDAdrv.CUjitInputType_enum, ::Ptr{Int8}, ::Int64, ::String, ::Int64, ::Ptr{Nothing}, ::Ptr{Nothing}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/libcuda.jl:281
   [4] add_data!(::CuLink, ::String, ::String) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module/linker.jl:67
   [5] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:417 [inlined]
   [6] macro expansion at /home/g2-test/.julia/packages/TimerOutputs/7Id5J/src/TimerOutput.jl:214 [inlined]
   [7] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:414 [inlined]
   [8] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#1357#hello#342", ::Type{Tuple{}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [9] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [10] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179 [inlined]
   [11] macro expansion at ./gcutils.jl:91 [inlined]
   [12] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176 [inlined]
   [13] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:879 [inlined]
   [14] (::var"#241#346"{var"#1357#hello#342"})() at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:41
   [15] redirect_stdout(::var"#241#346"{var"#1357#hello#342"}, ::IOStream) at ./stream.jl:1152
   [16] #240 at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:40 [inlined]
   [17] #open#271(::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(open), ::var"#240#345"{var"#1357#hello#342"}, ::String, ::Vararg{String,N} where N) at ./io.jl:298
   [18] open at ./io.jl:296 [inlined]
   [19] #239 at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:39 [inlined]
   [20] mktemp(::var"#239#344"{var"#1357#hello#342"}, ::String) at ./file.jl:611
   [21] mktemp(::Function) at ./file.jl:609
   [22] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:37
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:878
   [24] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [25] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:870
   [26] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [27] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:834
   [28] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [29] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:5
   [30] include at ./boot.jl:328 [inlined]
   [31] include_relative(::Module, ::String) at ./loading.jl:1105
   [32] include(::Module, ::String) at ./Base.jl:31
   [33] include(::String) at ./client.jl:424
   [34] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:93
   [35] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [36] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [37] include at ./boot.jl:328 [inlined]
   [38] include_relative(::Module, ::String) at ./loading.jl:1105
   [39] include(::Module, ::String) at ./Base.jl:31
   [40] include(::String) at ./client.jl:424
   [41] top-level scope at none:6
   [42] eval(::Module, ::Any) at ./boot.jl:330
   [43] exec_options(::Base.JLOptions) at ./client.jl:263
   [44] _start() at ./client.jl:460
  
argument passing: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:886
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  Stacktrace:
   [1] throw_api_error(::CUDAdrv.cudaError_enum) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/error.jl:131
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/error.jl:144 [inlined]
   [3] cuLinkAddData_v2(::CuLink, ::CUDAdrv.CUjitInputType_enum, ::Ptr{Int8}, ::Int64, ::String, ::Int64, ::Ptr{Nothing}, ::Ptr{Nothing}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/libcuda.jl:281
   [4] add_data!(::CuLink, ::String, ::String) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module/linker.jl:67
   [5] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:417 [inlined]
   [6] macro expansion at /home/g2-test/.julia/packages/TimerOutputs/7Id5J/src/TimerOutput.jl:214 [inlined]
   [7] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:414 [inlined]
   [8] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#1359#kernel#347", ::Type{Tuple{CuDeviceArray{Int64,1,CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [9] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [11] top-level scope at gcutils.jl:91
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [13] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:918
   [14] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [15] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:889
   [16] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [17] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:834
   [18] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [19] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:5
   [20] include at ./boot.jl:328 [inlined]
   [21] include_relative(::Module, ::String) at ./loading.jl:1105
   [22] include(::Module, ::String) at ./Base.jl:31
   [23] include(::String) at ./client.jl:424
   [24] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:93
   [25] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [26] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [27] include at ./boot.jl:328 [inlined]
   [28] include_relative(::Module, ::String) at ./loading.jl:1105
   [29] include(::Module, ::String) at ./Base.jl:31
   [30] include(::String) at ./client.jl:424
   [31] top-level scope at none:6
   [32] eval(::Module, ::Any) at ./boot.jl:330
   [33] exec_options(::Base.JLOptions) at ./client.jl:263
   [34] _start() at ./client.jl:460
  
self-recursion: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:923
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  Stacktrace:
   [1] throw_api_error(::CUDAdrv.cudaError_enum) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/error.jl:131
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/error.jl:144 [inlined]
   [3] cuLinkAddData_v2(::CuLink, ::CUDAdrv.CUjitInputType_enum, ::Ptr{Int8}, ::Int64, ::String, ::Int64, ::Ptr{Nothing}, ::Ptr{Nothing}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/libcuda.jl:281
   [4] add_data!(::CuLink, ::String, ::String) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module/linker.jl:67
   [5] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:417 [inlined]
   [6] macro expansion at /home/g2-test/.julia/packages/TimerOutputs/7Id5J/src/TimerOutput.jl:214 [inlined]
   [7] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:414 [inlined]
   [8] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::typeof(kernel), ::Type{Tuple{Bool}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [9] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [10] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179 [inlined]
   [11] macro expansion at ./gcutils.jl:91 [inlined]
   [12] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176 [inlined]
   [13] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:935 [inlined]
   [14] (::var"#247#354")() at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:41
   [15] redirect_stdout(::var"#247#354", ::IOStream) at ./stream.jl:1152
   [16] #246 at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:40 [inlined]
   [17] #open#271(::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(open), ::var"#246#353", ::String, ::Vararg{String,N} where N) at ./io.jl:298
   [18] open at ./io.jl:296 [inlined]
   [19] #245 at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:39 [inlined]
   [20] mktemp(::var"#245#352", ::String) at ./file.jl:611
   [21] mktemp(::Function) at ./file.jl:609
   [22] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:37
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:934
   [24] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [25] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:924
   [26] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [27] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:834
   [28] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [29] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:5
   [30] include at ./boot.jl:328 [inlined]
   [31] include_relative(::Module, ::String) at ./loading.jl:1105
   [32] include(::Module, ::String) at ./Base.jl:31
   [33] include(::String) at ./client.jl:424
   [34] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:93
   [35] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [36] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [37] include at ./boot.jl:328 [inlined]
   [38] include_relative(::Module, ::String) at ./loading.jl:1105
   [39] include(::Module, ::String) at ./Base.jl:31
   [40] include(::String) at ./client.jl:424
   [41] top-level scope at none:6
   [42] eval(::Module, ::Any) at ./boot.jl:330
   [43] exec_options(::Base.JLOptions) at ./client.jl:263
   [44] _start() at ./client.jl:460
  

deep recursion: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:941
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  Stacktrace:
   [1] throw_api_error(::CUDAdrv.cudaError_enum) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/error.jl:131
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/error.jl:144 [inlined]
   [3] cuLinkAddData_v2(::CuLink, ::CUDAdrv.CUjitInputType_enum, ::Ptr{Int8}, ::Int64, ::String, ::Int64, ::Ptr{Nothing}, ::Ptr{Nothing}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/libcuda.jl:281
   [4] add_data!(::CuLink, ::String, ::String) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module/linker.jl:67
   [5] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:417 [inlined]
   [6] macro expansion at /home/g2-test/.julia/packages/TimerOutputs/7Id5J/src/TimerOutput.jl:214 [inlined]
   [7] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:414 [inlined]
   [8] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::typeof(kernel_a), ::Type{Tuple{Bool}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [9] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [10] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179 [inlined]
   [11] macro expansion at ./gcutils.jl:91 [inlined]
   [12] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176 [inlined]
   [13] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:966 [inlined]
   [14] (::var"#250#357")() at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:41
   [15] redirect_stdout(::var"#250#357", ::IOStream) at ./stream.jl:1152
   [16] #249 at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:40 [inlined]
   [17] #open#271(::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(open), ::var"#249#356", ::String, ::Vararg{String,N} where N) at ./io.jl:298
   [18] open at ./io.jl:296 [inlined]
   [19] #248 at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:39 [inlined]
   [20] mktemp(::var"#248#355", ::String) at ./file.jl:611
   [21] mktemp(::Function) at ./file.jl:609
   [22] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:37
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:965
   [24] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [25] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:942
   [26] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [27] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:834
   [28] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [29] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:5
   [30] include at ./boot.jl:328 [inlined]
   [31] include_relative(::Module, ::String) at ./loading.jl:1105
   [32] include(::Module, ::String) at ./Base.jl:31
   [33] include(::String) at ./client.jl:424
   [34] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:93
   [35] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [36] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [37] include at ./boot.jl:328 [inlined]
   [38] include_relative(::Module, ::String) at ./loading.jl:1105
   [39] include(::Module, ::String) at ./Base.jl:31
   [40] include(::String) at ./client.jl:424
   [41] top-level scope at none:6
   [42] eval(::Module, ::Any) at ./boot.jl:330
   [43] exec_options(::Base.JLOptions) at ./client.jl:263
   [44] _start() at ./client.jl:460
  
streams: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:972
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  Stacktrace:
   [1] throw_api_error(::CUDAdrv.cudaError_enum) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/error.jl:131
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/error.jl:144 [inlined]
   [3] cuLinkAddData_v2(::CuLink, ::CUDAdrv.CUjitInputType_enum, ::Ptr{Int8}, ::Int64, ::String, ::Int64, ::Ptr{Nothing}, ::Ptr{Nothing}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/libcuda.jl:281
   [4] add_data!(::CuLink, ::String, ::String) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module/linker.jl:67
   [5] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:417 [inlined]
   [6] macro expansion at /home/g2-test/.julia/packages/TimerOutputs/7Id5J/src/TimerOutput.jl:214 [inlined]
   [7] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:414 [inlined]
   [8] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#1366#hello#358", ::Type{Tuple{}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [9] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [10] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179 [inlined]
   [11] macro expansion at ./gcutils.jl:91 [inlined]
   [12] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176 [inlined]
   [13] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:987 [inlined]
   [14] (::var"#253#361"{var"#1366#hello#358"})() at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:41
   [15] redirect_stdout(::var"#253#361"{var"#1366#hello#358"}, ::IOStream) at ./stream.jl:1152
   [16] #252 at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:40 [inlined]
   [17] #open#271(::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(open), ::var"#252#360"{var"#1366#hello#358"}, ::String, ::Vararg{String,N} where N) at ./io.jl:298
   [18] open at ./io.jl:296 [inlined]
   [19] #251 at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:39 [inlined]
   [20] mktemp(::var"#251#359"{var"#1366#hello#358"}, ::String) at ./file.jl:611
   [21] mktemp(::Function) at ./file.jl:609
   [22] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/util.jl:37
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:986
   [24] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [25] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:973
   [26] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [27] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:834
   [28] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [29] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/execution.jl:5
   [30] include at ./boot.jl:328 [inlined]
   [31] include_relative(::Module, ::String) at ./loading.jl:1105
   [32] include(::Module, ::String) at ./Base.jl:31
   [33] include(::String) at ./client.jl:424
   [34] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:93
   [35] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [36] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [37] include at ./boot.jl:328 [inlined]
   [38] include_relative(::Module, ::String) at ./loading.jl:1105
   [39] include(::Module, ::String) at ./Base.jl:31
   [40] include(::String) at ./client.jl:424
   [41] top-level scope at none:6
   [42] eval(::Module, ::Any) at ./boot.jl:330
   [43] exec_options(::Base.JLOptions) at ./client.jl:263
   [44] _start() at ./client.jl:460
  
indexing: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/pointer.jl:33
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 30; error   : Instruction 'ld.nc' requires .target sm_32 or higher
  ptxas application ptx input, line 35; error   : Instruction 'ld.nc' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#kernel#366", ::Type{Tuple{CUDAnative.DevicePtr{Complex{Int8},CUDAnative.AS.Global},CUDAnative.DevicePtr{Complex{Int8},CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] top-level scope at gcutils.jl:91
   [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/pointer.jl:44
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/pointer.jl:34
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/pointer.jl:5
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/pointer.jl:3
   [15] include at ./boot.jl:328 [inlined]
   [16] include_relative(::Module, ::String) at ./loading.jl:1105
   [17] include(::Module, ::String) at ./Base.jl:31
   [18] include(::String) at ./client.jl:424
   [19] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:94
   [20] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [21] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [22] include at ./boot.jl:328 [inlined]
   [23] include_relative(::Module, ::String) at ./loading.jl:1105
   [24] include(::Module, ::String) at ./Base.jl:31
   [25] include(::String) at ./client.jl:424
   [26] top-level scope at none:6
   [27] eval(::Module, ::Any) at ./boot.jl:330
   [28] exec_options(::Base.JLOptions) at ./client.jl:263
   [29] _start() at ./client.jl:460
  
ldg: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/array.jl:144
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 129; error   : Instruction 'ld.nc' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#2459#kernel#374", ::Type{Tuple{CuDeviceArray{Int64,1,CUDAnative.AS.Global},CuDeviceArray{Int64,1,CUDAnative.AS.Global},Int64}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] top-level scope at gcutils.jl:91
   [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/reflection.jl:196
   [9] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/array.jl:154
   [10] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [11] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/array.jl:145
   [12] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [13] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/array.jl:3
   [14] include at ./boot.jl:328 [inlined]
   [15] include_relative(::Module, ::String) at ./loading.jl:1105
   [16] include(::Module, ::String) at ./Base.jl:31
   [17] include(::String) at ./client.jl:424
   [18] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:95
   [19] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [20] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [21] include at ./boot.jl:328 [inlined]
   [22] include_relative(::Module, ::String) at ./loading.jl:1105
   [23] include(::Module, ::String) at ./Base.jl:31
   [24] include(::String) at ./client.jl:424
   [25] top-level scope at none:6
   [26] eval(::Module, ::Any) at ./boot.jl:330
   [27] exec_options(::Base.JLOptions) at ./client.jl:263
   [28] _start() at ./client.jl:460
  
libcudadevrt: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:907
  Got exception outside of a @test
  CUDA error: unknown error (code 999, ERROR_UNKNOWN)
  Stacktrace:
   [1] throw_api_error(::CUDAdrv.cudaError_enum) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/error.jl:131
   [2] complete(::CuLink) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module/linker.jl:119
   [3] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:418 [inlined]
   [4] macro expansion at /home/g2-test/.julia/packages/TimerOutputs/7Id5J/src/TimerOutput.jl:214 [inlined]
   [5] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:414 [inlined]
   [6] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#3370#kernel#607", ::Type{Tuple{}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [7] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [9] top-level scope at gcutils.jl:91
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [11] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:909
   [12] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [13] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:908
   [14] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [15] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:5
   [16] include at ./boot.jl:328 [inlined]
   [17] include_relative(::Module, ::String) at ./loading.jl:1105
   [18] include(::Module, ::String) at ./Base.jl:31
   [19] include(::String) at ./client.jl:424
   [20] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:96
   [21] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [22] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [23] include at ./boot.jl:328 [inlined]
   [24] include_relative(::Module, ::String) at ./loading.jl:1105
   [25] include(::Module, ::String) at ./Base.jl:31
   [26] include(::String) at ./client.jl:424
   [27] top-level scope at none:6
   [28] eval(::Module, ::Any) at ./boot.jl:330
   [29] exec_options(::Base.JLOptions) at ./client.jl:263
   [30] _start() at ./client.jl:460

T = Int64: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:993
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 78; error   : Instruction 'atom.and.b64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#3389#kernel#613", ::Type{Tuple{CuDeviceArray{Int64,1,CUDAnative.AS.Global},Type{Int64}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] top-level scope at gcutils.jl:91
   [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1007
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1180
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:993
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:993
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:916
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:5
   [17] include at ./boot.jl:328 [inlined]
   [18] include_relative(::Module, ::String) at ./loading.jl:1105
   [19] include(::Module, ::String) at ./Base.jl:31
   [20] include(::String) at ./client.jl:424
   [21] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:96
   [22] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [24] include at ./boot.jl:328 [inlined]
   [25] include_relative(::Module, ::String) at ./loading.jl:1105
   [26] include(::Module, ::String) at ./Base.jl:31
   [27] include(::String) at ./client.jl:424
   [28] top-level scope at none:6
   [29] eval(::Module, ::Any) at ./boot.jl:330
   [30] exec_options(::Base.JLOptions) at ./client.jl:263
   [31] _start() at ./client.jl:460
  
T = UInt64: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:993
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 176; error   : Instruction 'atom.and.b64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#3389#kernel#613", ::Type{Tuple{CuDeviceArray{UInt64,1,CUDAnative.AS.Global},Type{UInt64}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] top-level scope at gcutils.jl:91
   [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1007
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1180
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:993
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:993
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:916
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:5
   [17] include at ./boot.jl:328 [inlined]
   [18] include_relative(::Module, ::String) at ./loading.jl:1105
   [19] include(::Module, ::String) at ./Base.jl:31
   [20] include(::String) at ./client.jl:424
   [21] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:96
   [22] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [24] include at ./boot.jl:328 [inlined]
   [25] include_relative(::Module, ::String) at ./loading.jl:1105
   [26] include(::Module, ::String) at ./Base.jl:31
   [27] include(::String) at ./client.jl:424
   [28] top-level scope at none:6
   [29] eval(::Module, ::Any) at ./boot.jl:330
   [30] exec_options(::Base.JLOptions) at ./client.jl:263
   [31] _start() at ./client.jl:460
  
T = Int64: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1013
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 134; error   : Instruction 'atom.or.b64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#3393#kernel#614", ::Type{Tuple{CuDeviceArray{Int64,1,CUDAnative.AS.Global},Type{Int64}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] top-level scope at gcutils.jl:91
   [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1027
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1180
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1013
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1013
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:916
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:5
   [17] include at ./boot.jl:328 [inlined]
   [18] include_relative(::Module, ::String) at ./loading.jl:1105
   [19] include(::Module, ::String) at ./Base.jl:31
   [20] include(::String) at ./client.jl:424
   [21] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:96
   [22] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [24] include at ./boot.jl:328 [inlined]
   [25] include_relative(::Module, ::String) at ./loading.jl:1105
   [26] include(::Module, ::String) at ./Base.jl:31
   [27] include(::String) at ./client.jl:424
   [28] top-level scope at none:6
   [29] eval(::Module, ::Any) at ./boot.jl:330
   [30] exec_options(::Base.JLOptions) at ./client.jl:263
   [31] _start() at ./client.jl:460
  
T = UInt64: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1013
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 133; error   : Instruction 'atom.or.b64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#3393#kernel#614", ::Type{Tuple{CuDeviceArray{UInt64,1,CUDAnative.AS.Global},Type{UInt64}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] top-level scope at gcutils.jl:91
   [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1027
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1180
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1013
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1013
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:916
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:5
   [17] include at ./boot.jl:328 [inlined]
   [18] include_relative(::Module, ::String) at ./loading.jl:1105
   [19] include(::Module, ::String) at ./Base.jl:31
   [20] include(::String) at ./client.jl:424
   [21] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:96
   [22] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [24] include at ./boot.jl:328 [inlined]
   [25] include_relative(::Module, ::String) at ./loading.jl:1105
   [26] include(::Module, ::String) at ./Base.jl:31
   [27] include(::String) at ./client.jl:424
   [28] top-level scope at none:6
   [29] eval(::Module, ::Any) at ./boot.jl:330
   [30] exec_options(::Base.JLOptions) at ./client.jl:263
   [31] _start() at ./client.jl:460
T = Int64: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1033
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 134; error   : Instruction 'atom.xor.b64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#3397#kernel#615", ::Type{Tuple{CuDeviceArray{Int64,1,CUDAnative.AS.Global},Type{Int64}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] top-level scope at gcutils.jl:91
   [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1047
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1180
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1033
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1033
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:916
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:5
   [17] include at ./boot.jl:328 [inlined]
   [18] include_relative(::Module, ::String) at ./loading.jl:1105
   [19] include(::Module, ::String) at ./Base.jl:31
   [20] include(::String) at ./client.jl:424
   [21] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:96
   [22] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [24] include at ./boot.jl:328 [inlined]
   [25] include_relative(::Module, ::String) at ./loading.jl:1105
   [26] include(::Module, ::String) at ./Base.jl:31
   [27] include(::String) at ./client.jl:424
   [28] top-level scope at none:6
   [29] eval(::Module, ::Any) at ./boot.jl:330
   [30] exec_options(::Base.JLOptions) at ./client.jl:263
   [31] _start() at ./client.jl:460
 
T = UInt64: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1033
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 133; error   : Instruction 'atom.xor.b64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#3397#kernel#615", ::Type{Tuple{CuDeviceArray{UInt64,1,CUDAnative.AS.Global},Type{UInt64}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] top-level scope at gcutils.jl:91
   [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1047
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1180
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1033
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1033
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:916
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:5
   [17] include at ./boot.jl:328 [inlined]
   [18] include_relative(::Module, ::String) at ./loading.jl:1105
   [19] include(::Module, ::String) at ./Base.jl:31
   [20] include(::String) at ./client.jl:424
   [21] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:96
   [22] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [24] include at ./boot.jl:328 [inlined]
   [25] include_relative(::Module, ::String) at ./loading.jl:1105
   [26] include(::Module, ::String) at ./Base.jl:31
   [27] include(::String) at ./client.jl:424
   [28] top-level scope at none:6
   [29] eval(::Module, ::Any) at ./boot.jl:330
   [30] exec_options(::Base.JLOptions) at ./client.jl:263
   [31] _start() at ./client.jl:460
  

T = Int64: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1074
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 32; error   : Instruction 'atom.max.s64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#3405#kernel#617", ::Type{Tuple{CuDeviceArray{Int64,1,CUDAnative.AS.Global},Type{Int64}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] top-level scope at gcutils.jl:91
   [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1083
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1180
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1074
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1071
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:916
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:5
   [17] include at ./boot.jl:328 [inlined]
   [18] include_relative(::Module, ::String) at ./loading.jl:1105
   [19] include(::Module, ::String) at ./Base.jl:31
   [20] include(::String) at ./client.jl:424
   [21] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:96
   [22] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [24] include at ./boot.jl:328 [inlined]
   [25] include_relative(::Module, ::String) at ./loading.jl:1105
   [26] include(::Module, ::String) at ./Base.jl:31
   [27] include(::String) at ./client.jl:424
   [28] top-level scope at none:6
   [29] eval(::Module, ::Any) at ./boot.jl:330
   [30] exec_options(::Base.JLOptions) at ./client.jl:263
   [31] _start() at ./client.jl:460
  
T = UInt64: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1074
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 33; error   : Instruction 'atom.max.u64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#3405#kernel#617", ::Type{Tuple{CuDeviceArray{UInt64,1,CUDAnative.AS.Global},Type{UInt64}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] top-level scope at gcutils.jl:91
   [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1083
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1180
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1074
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1071
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:916
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:5
   [17] include at ./boot.jl:328 [inlined]
   [18] include_relative(::Module, ::String) at ./loading.jl:1105
   [19] include(::Module, ::String) at ./Base.jl:31
   [20] include(::String) at ./client.jl:424
   [21] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:96
   [22] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [24] include at ./boot.jl:328 [inlined]
   [25] include_relative(::Module, ::String) at ./loading.jl:1105
   [26] include(::Module, ::String) at ./Base.jl:31
   [27] include(::String) at ./client.jl:424
   [28] top-level scope at none:6
   [29] eval(::Module, ::Any) at ./boot.jl:330
   [30] exec_options(::Base.JLOptions) at ./client.jl:263
   [31] _start() at ./client.jl:460
  
T = Int64: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1092
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 32; error   : Instruction 'atom.min.s64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#3409#kernel#618", ::Type{Tuple{CuDeviceArray{Int64,1,CUDAnative.AS.Global},Type{Int64}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] top-level scope at gcutils.jl:91
   [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1101
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1180
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1092
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1089
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:916
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:5
   [17] include at ./boot.jl:328 [inlined]
   [18] include_relative(::Module, ::String) at ./loading.jl:1105
   [19] include(::Module, ::String) at ./Base.jl:31
   [20] include(::String) at ./client.jl:424
   [21] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:96
   [22] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [24] include at ./boot.jl:328 [inlined]
   [25] include_relative(::Module, ::String) at ./loading.jl:1105
   [26] include(::Module, ::String) at ./Base.jl:31
   [27] include(::String) at ./client.jl:424
   [28] top-level scope at none:6
   [29] eval(::Module, ::Any) at ./boot.jl:330
   [30] exec_options(::Base.JLOptions) at ./client.jl:263
   [31] _start() at ./client.jl:460
  
T = UInt64: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1092
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 33; error   : Instruction 'atom.min.u64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#3409#kernel#618", ::Type{Tuple{CuDeviceArray{UInt64,1,CUDAnative.AS.Global},Type{UInt64}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] top-level scope at gcutils.jl:91
   [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1101
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1180
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1092
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1089
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:916
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:5
   [17] include at ./boot.jl:328 [inlined]
   [18] include_relative(::Module, ::String) at ./loading.jl:1105
   [19] include(::Module, ::String) at ./Base.jl:31
   [20] include(::String) at ./client.jl:424
   [21] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:96
   [22] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [24] include at ./boot.jl:328 [inlined]
   [25] include_relative(::Module, ::String) at ./loading.jl:1105
   [26] include(::Module, ::String) at ./Base.jl:31
   [27] include(::String) at ./client.jl:424
   [28] top-level scope at none:6
   [29] eval(::Module, ::Any) at ./boot.jl:330
   [30] exec_options(::Base.JLOptions) at ./client.jl:263
   [31] _start() at ./client.jl:460
  
T = Int64: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1176
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 47; error   : Instruction 'atom.and.b64 with generic state space' requires .target sm_32 or higher
  ptxas application ptx input, line 51; error   : Instruction 'atom.and.b64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#3425#kernel#623", ::Type{Tuple{Type{Int64},CuDeviceArray{Int64,1,CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] top-level scope at gcutils.jl:91
   [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1187
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1180
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1176
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1176
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1142
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:5
   [17] include at ./boot.jl:328 [inlined]
   [18] include_relative(::Module, ::String) at ./loading.jl:1105
   [19] include(::Module, ::String) at ./Base.jl:31
   [20] include(::String) at ./client.jl:424
   [21] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:96
   [22] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [24] include at ./boot.jl:328 [inlined]
   [25] include_relative(::Module, ::String) at ./loading.jl:1105
   [26] include(::Module, ::String) at ./Base.jl:31
   [27] include(::String) at ./client.jl:424
   [28] top-level scope at none:6
   [29] eval(::Module, ::Any) at ./boot.jl:330
   [30] exec_options(::Base.JLOptions) at ./client.jl:263
   [31] _start() at ./client.jl:460
  
T = UInt64: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1176
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 47; error   : Instruction 'atom.and.b64 with generic state space' requires .target sm_32 or higher
  ptxas application ptx input, line 51; error   : Instruction 'atom.and.b64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#3425#kernel#623", ::Type{Tuple{Type{UInt64},CuDeviceArray{UInt64,1,CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] top-level scope at gcutils.jl:91
   [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1187
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1180
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1176
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1176
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1142
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:5
   [17] include at ./boot.jl:328 [inlined]
   [18] include_relative(::Module, ::String) at ./loading.jl:1105
   [19] include(::Module, ::String) at ./Base.jl:31
   [20] include(::String) at ./client.jl:424
   [21] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:96
   [22] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [24] include at ./boot.jl:328 [inlined]
   [25] include_relative(::Module, ::String) at ./loading.jl:1105
   [26] include(::Module, ::String) at ./Base.jl:31
   [27] include(::String) at ./client.jl:424
   [28] top-level scope at none:6
   [29] eval(::Module, ::Any) at ./boot.jl:330
   [30] exec_options(::Base.JLOptions) at ./client.jl:263
   [31] _start() at ./client.jl:460
  
T = Int64: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1194
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 38; error   : Instruction 'atom.or.b64 with generic state space' requires .target sm_32 or higher
  ptxas application ptx input, line 42; error   : Instruction 'atom.or.b64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#3428#kernel#624", ::Type{Tuple{Type{Int64},CuDeviceArray{Int64,1,CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] top-level scope at gcutils.jl:91
   [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1205
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1180
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1194
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1194
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1142
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:5
   [17] include at ./boot.jl:328 [inlined]
   [18] include_relative(::Module, ::String) at ./loading.jl:1105
   [19] include(::Module, ::String) at ./Base.jl:31
   [20] include(::String) at ./client.jl:424
   [21] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:96
   [22] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [24] include at ./boot.jl:328 [inlined]
   [25] include_relative(::Module, ::String) at ./loading.jl:1105
   [26] include(::Module, ::String) at ./Base.jl:31
   [27] include(::String) at ./client.jl:424
   [28] top-level scope at none:6
   [29] eval(::Module, ::Any) at ./boot.jl:330
   [30] exec_options(::Base.JLOptions) at ./client.jl:263
   [31] _start() at ./client.jl:460
  
T = UInt64: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1194
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 38; error   : Instruction 'atom.or.b64 with generic state space' requires .target sm_32 or higher
  ptxas application ptx input, line 42; error   : Instruction 'atom.or.b64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#3428#kernel#624", ::Type{Tuple{Type{UInt64},CuDeviceArray{UInt64,1,CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] top-level scope at gcutils.jl:91
   [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1205
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1180
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1194
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1194
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1142
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:5
   [17] include at ./boot.jl:328 [inlined]
   [18] include_relative(::Module, ::String) at ./loading.jl:1105
   [19] include(::Module, ::String) at ./Base.jl:31
   [20] include(::String) at ./client.jl:424
   [21] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:96
   [22] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [24] include at ./boot.jl:328 [inlined]
   [25] include_relative(::Module, ::String) at ./loading.jl:1105
   [26] include(::Module, ::String) at ./Base.jl:31
   [27] include(::String) at ./client.jl:424
   [28] top-level scope at none:6
   [29] eval(::Module, ::Any) at ./boot.jl:330
   [30] exec_options(::Base.JLOptions) at ./client.jl:263
   [31] _start() at ./client.jl:460
  

T = Int64: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1212
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 36; error   : Instruction 'atom.xor.b64 with generic state space' requires .target sm_32 or higher
  ptxas application ptx input, line 40; error   : Instruction 'atom.xor.b64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#3431#kernel#625", ::Type{Tuple{Type{Int64},CuDeviceArray{Int64,1,CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] top-level scope at gcutils.jl:91
   [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1224
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1180
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1212
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1212
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1142
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:5
   [17] include at ./boot.jl:328 [inlined]
   [18] include_relative(::Module, ::String) at ./loading.jl:1105
   [19] include(::Module, ::String) at ./Base.jl:31
   [20] include(::String) at ./client.jl:424
   [21] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:96
   [22] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [24] include at ./boot.jl:328 [inlined]
   [25] include_relative(::Module, ::String) at ./loading.jl:1105
   [26] include(::Module, ::String) at ./Base.jl:31
   [27] include(::String) at ./client.jl:424
   [28] top-level scope at none:6
   [29] eval(::Module, ::Any) at ./boot.jl:330
   [30] exec_options(::Base.JLOptions) at ./client.jl:263
   [31] _start() at ./client.jl:460
  
T = UInt64: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1212
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 36; error   : Instruction 'atom.xor.b64 with generic state space' requires .target sm_32 or higher
  ptxas application ptx input, line 40; error   : Instruction 'atom.xor.b64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#3431#kernel#625", ::Type{Tuple{Type{UInt64},CuDeviceArray{UInt64,1,CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] top-level scope at gcutils.jl:91
   [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1224
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1180
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1212
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1212
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1142
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:5
   [17] include at ./boot.jl:328 [inlined]
   [18] include_relative(::Module, ::String) at ./loading.jl:1105
   [19] include(::Module, ::String) at ./Base.jl:31
   [20] include(::String) at ./client.jl:424
   [21] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:96
   [22] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [24] include at ./boot.jl:328 [inlined]
   [25] include_relative(::Module, ::String) at ./loading.jl:1105
   [26] include(::Module, ::String) at ./Base.jl:31
   [27] include(::String) at ./client.jl:424
   [28] top-level scope at none:6
   [29] eval(::Module, ::Any) at ./boot.jl:330
   [30] exec_options(::Base.JLOptions) at ./client.jl:263
   [31] _start() at ./client.jl:460
  
T = Int64: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1231
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 32; error   : Instruction 'atom.max.s64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#3434#kernel#626", ::Type{Tuple{Type{Int64},CuDeviceArray{Int64,1,CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] top-level scope at gcutils.jl:91
   [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1240
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1180
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1231
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1231
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1142
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:5
   [17] include at ./boot.jl:328 [inlined]
   [18] include_relative(::Module, ::String) at ./loading.jl:1105
   [19] include(::Module, ::String) at ./Base.jl:31
   [20] include(::String) at ./client.jl:424
   [21] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:96
   [22] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [24] include at ./boot.jl:328 [inlined]
   [25] include_relative(::Module, ::String) at ./loading.jl:1105
   [26] include(::Module, ::String) at ./Base.jl:31
   [27] include(::String) at ./client.jl:424
   [28] top-level scope at none:6
   [29] eval(::Module, ::Any) at ./boot.jl:330
   [30] exec_options(::Base.JLOptions) at ./client.jl:263
   [31] _start() at ./client.jl:460
  
T = UInt64: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1231
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 33; error   : Instruction 'atom.max.u64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#3434#kernel#626", ::Type{Tuple{Type{UInt64},CuDeviceArray{UInt64,1,CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] top-level scope at gcutils.jl:91
   [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1240
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1180
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1231
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1231
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1142
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:5
   [17] include at ./boot.jl:328 [inlined]
   [18] include_relative(::Module, ::String) at ./loading.jl:1105
   [19] include(::Module, ::String) at ./Base.jl:31
   [20] include(::String) at ./client.jl:424
   [21] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:96
   [22] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [24] include at ./boot.jl:328 [inlined]
   [25] include_relative(::Module, ::String) at ./loading.jl:1105
   [26] include(::Module, ::String) at ./Base.jl:31
   [27] include(::String) at ./client.jl:424
   [28] top-level scope at none:6
   [29] eval(::Module, ::Any) at ./boot.jl:330
   [30] exec_options(::Base.JLOptions) at ./client.jl:263
   [31] _start() at ./client.jl:460
  
T = Int64: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1246
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 32; error   : Instruction 'atom.min.s64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#3437#kernel#627", ::Type{Tuple{Type{Int64},CuDeviceArray{Int64,1,CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] top-level scope at gcutils.jl:91
   [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1255
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1180
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1246
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1246
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1142
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:5
   [17] include at ./boot.jl:328 [inlined]
   [18] include_relative(::Module, ::String) at ./loading.jl:1105
   [19] include(::Module, ::String) at ./Base.jl:31
   [20] include(::String) at ./client.jl:424
   [21] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:96
   [22] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [24] include at ./boot.jl:328 [inlined]
   [25] include_relative(::Module, ::String) at ./loading.jl:1105
   [26] include(::Module, ::String) at ./Base.jl:31
   [27] include(::String) at ./client.jl:424
   [28] top-level scope at none:6
   [29] eval(::Module, ::Any) at ./boot.jl:330
   [30] exec_options(::Base.JLOptions) at ./client.jl:263
   [31] _start() at ./client.jl:460
  
T = UInt64: Error During Test at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1246
  Got exception outside of a @test
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 33; error   : Instruction 'atom.min.u64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::var"#3437#kernel#627", ::Type{Tuple{Type{UInt64},CuDeviceArray{UInt64,1,CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] top-level scope at gcutils.jl:91
   [7] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176
   [8] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1255
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1180
   [10] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1246
   [11] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [12] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1246
   [13] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [14] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:1142
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/device/cuda.jl:5
   [17] include at ./boot.jl:328 [inlined]
   [18] include_relative(::Module, ::String) at ./loading.jl:1105
   [19] include(::Module, ::String) at ./Base.jl:31
   [20] include(::String) at ./client.jl:424
   [21] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:96
   [22] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [23] top-level scope at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:10
   [24] include at ./boot.jl:328 [inlined]
   [25] include_relative(::Module, ::String) at ./loading.jl:1105
   [26] include(::Module, ::String) at ./Base.jl:31
   [27] include(::String) at ./client.jl:424
   [28] top-level scope at none:6
   [29] eval(::Module, ::Any) at ./boot.jl:330
   [30] exec_options(::Base.JLOptions) at ./client.jl:263
   [31] _start() at ./client.jl:460
  
Test Summary:                                  | Pass  Fail  Error  Total
CUDAnative                                     |  452     8     33    493
  base interface                               |                    No tests
  pointer                                      |   20                  20
  code generation                              |   92                  92
  code generation (relying on a device)        |    7            2      9
    LLVM                                       |    5                   5
    SASS                                       |    2            2      4
      basic reflection                         |    1            1      2
      function name mangling                   |    1            1      2
  execution                                    |   51     8      8     67
    @cuda                                      |    3            1      4
      low-level interface                      |                    No tests
      launch configuration                     |                    No tests
      compilation params                       |    1                   1
      reflection                               |                 1      1
      shared memory                            |                    No tests
      streams                                  |                    No tests
      external kernels                         |                    No tests
      calling device function                  |                    No tests
    argument passing                           |   28                  28
    exceptions                                 |    9     8            17
      stack traces at different debug levels   |    7     5            12
      #329                                     |    2     3             5
    shmem divergence bug                       |    7                   7
    dynamic parallelism                        |    4            7     11
      basic usage                              |                 1      1
      anonymous functions                      |                 1      1
      closures                                 |                 1      1
      argument passing                         |    4            1      5
      self-recursion                           |                 1      1
      deep recursion                           |                 1      1
      streams                                  |                 1      1
  pointer                                      |   40            1     41
    unsafe_load & unsafe_store!                |   40            1     41
      T = Int8, cached = false                 |    2                   2
      T = Int8, cached = true                  |    2                   2
      T = UInt16, cached = false               |    2                   2
      T = UInt16, cached = true                |    2                   2
      T = Int32, cached = false                |    2                   2
      T = Int32, cached = true                 |    2                   2
      T = UInt32, cached = false               |    2                   2
      T = UInt32, cached = true                |    2                   2
      T = Int64, cached = false                |    2                   2
      T = Int64, cached = true                 |    2                   2
      T = UInt64, cached = false               |    2                   2
      T = UInt64, cached = true                |    2                   2
      T = Int128, cached = false               |    2                   2
      T = Int128, cached = true                |    2                   2
      T = Float32, cached = false              |    2                   2
      T = Float32, cached = true               |    2                   2
      T = Float64, cached = false              |    2                   2
      T = Float64, cached = true               |    2                   2
      T = LoadableStruct, cached = false       |    2                   2
      T = LoadableStruct, cached = true        |    2                   2
      indexing                                 |                 1      1
  device arrays                                |   18            1     19
    constructors                               |   14                  14
    basics                                     |    1                   1
    iteration                                  |    1                   1
    bounds checking                            |    1                   1
    views                                      |    1                   1
    non-Int index to unsafe_load               |                    No tests
    ldg                                        |                 1      1
  CUDA functionality                           |  209           21    230
    indexing                                   |    1                   1
    math                                       |   71                  71
    formatted output                           |    6                   6
    @cuprint                                   |   25                  25
    assertion                                  |                    No tests
    shared memory                              |   14                  14
    data movement and conversion               |    6                   6
    clock and nanosleep                        |                    No tests
    parallel synchronization and communication |   16                  16
    libcudadevrt                               |                 1      1
    atomics (low-level)                        |   25           10     35
      atomic_add                               |    5                   5
      atomic_sub                               |    4                   4
      atomic_inc                               |    1                   1
      atomic_dec                               |    1                   1
      atomic_xchg                              |    4                   4
      atomic_and                               |    2            2      4
        T = Int32                              |    1                   1
        T = Int64                              |                 1      1
        T = UInt32                             |    1                   1
        T = UInt64                             |                 1      1
      atomic_or                                |    2            2      4
        T = Int32                              |    1                   1
        T = Int64                              |                 1      1
        T = UInt32                             |    1                   1
        T = UInt64                             |                 1      1
      atomic_xor                               |    2            2      4
        T = Int32                              |    1                   1
        T = Int64                              |                 1      1
        T = UInt32                             |    1                   1
        T = UInt64                             |                 1      1
      atomic_max                               |    2            2      4
        T = Int32                              |    1                   1
        T = Int64                              |                 1      1
        T = UInt32                             |    1                   1
        T = UInt64                             |                 1      1
      atomic_min                               |    2            2      4
        T = Int32                              |    1                   1
        T = Int64                              |                 1      1
        T = UInt32                             |    1                   1
        T = UInt64                             |                 1      1
    atomics (high-level)                       |   45           10     55
      add                                      |    5                   5
      sub                                      |    4                   4
      and                                      |    4            2      6
        T = Int32                              |    2                   2
        T = Int64                              |                 1      1
        T = UInt32                             |    2                   2
        T = UInt64                             |                 1      1
      or                                       |    4            2      6
        T = Int32                              |    2                   2
        T = Int64                              |                 1      1
        T = UInt32                             |    2                   2
        T = UInt64                             |                 1      1
      xor                                      |    4            2      6
        T = Int32                              |    2                   2
        T = Int64                              |                 1      1
        T = UInt32                             |    2                   2
        T = UInt64                             |                 1      1
      max                                      |    2            2      4
        T = Int32                              |    1                   1
        T = Int64                              |                 1      1
        T = UInt32                             |    1                   1
        T = UInt64                             |                 1      1
      min                                      |    2            2      4
        T = Int32                              |    1                   1
        T = Int64                              |                 1      1
        T = UInt32                             |    1                   1
        T = UInt64                             |                 1      1
      macro                                    |   20                  20
  examples                                     |    8                   8
ERROR: LoadError: Some tests did not pass: 452 passed, 8 failed, 33 errored, 0 broken.
in expression starting at /home/g2-test/.julia/packages/CUDAnative/hfulr/test/runtests.jl:8
   Testing CuArrays
 Resolving package versions...
    Status `/tmp/jl_BiORdL/Manifest.toml`
  [621f4979] AbstractFFTs v0.5.0
  [79e6a3ab] Adapt v1.0.1
  [b99e7846] BinaryProvider v0.5.8
  [fa961155] CEnum v0.2.0
  [3895d2a7] CUDAapi v3.1.0
  [c5f51814] CUDAdrv v6.0.0
  [be33ccc6] CUDAnative v2.10.2
  [bbf7d656] CommonSubexpressions v0.2.0
  [3a865a2d] CuArrays v1.7.2
  [864edb3b] DataStructures v0.17.9
  [163ba53b] DiffResults v1.0.2
  [b552c78f] DiffRules v1.0.1
  [7a1cc6ca] FFTW v1.2.0
  [f5851436] FFTW_jll v3.3.9+3
  [1a297f60] FillArrays v0.8.4
  [f6369f11] ForwardDiff v0.10.9
  [0c68f7d7] GPUArrays v2.0.1
  [1d5cc7b8] IntelOpenMP_jll v2018.0.3+0
  [929cbde3] LLVM v1.3.3
  [856f044c] MKL_jll v2019.0.117+2
  [1914dd2f] MacroTools v0.5.4
  [872c559c] NNlib v0.6.4
  [77ba4419] NaNMath v0.3.3
  [efe28fd5] OpenSpecFun_jll v0.5.3+1
  [bac558e1] OrderedCollections v1.1.0
  [189a3867] Reexport v0.2.0
  [ae029012] Requires v1.0.1
  [276daf66] SpecialFunctions v0.10.0
  [90137ffa] StaticArrays v0.12.1
  [a759f4b9] TimerOutputs v0.5.3
  [2a0f44e3] Base64  [`@stdlib/Base64`]
  [ade2ca70] Dates  [`@stdlib/Dates`]
  [8ba89e20] Distributed  [`@stdlib/Distributed`]
  [b77e0a4c] InteractiveUtils  [`@stdlib/InteractiveUtils`]
  [76f85450] LibGit2  [`@stdlib/LibGit2`]
  [8f399da3] Libdl  [`@stdlib/Libdl`]
  [37e2e46d] LinearAlgebra  [`@stdlib/LinearAlgebra`]
  [56ddb016] Logging  [`@stdlib/Logging`]
  [d6f4376e] Markdown  [`@stdlib/Markdown`]
  [44cfe95a] Pkg  [`@stdlib/Pkg`]
  [de0858da] Printf  [`@stdlib/Printf`]
  [3fa0cd96] REPL  [`@stdlib/REPL`]
  [9a3f8284] Random  [`@stdlib/Random`]
  [ea8e919c] SHA  [`@stdlib/SHA`]
  [9e88b42a] Serialization  [`@stdlib/Serialization`]
  [6462fe0b] Sockets  [`@stdlib/Sockets`]
  [2f01184e] SparseArrays  [`@stdlib/SparseArrays`]
  [10745b16] Statistics  [`@stdlib/Statistics`]
  [8dfed614] Test  [`@stdlib/Test`]
  [cf7118a7] UUIDs  [`@stdlib/UUIDs`]
  [4ec0a83e] Unicode  [`@stdlib/Unicode`]
'''
 Warning: You are using CUDNN 7.6.5 for CUDA 10.2.0 with CUDA toolkit 10.1.243; these might be incompatible.
└ @ CuArrays ~/.julia/packages/CuArrays/HE8G6/src/CuArrays.jl:127
[ Info: Testing using device Quadro K2100M (compute capability 3.0.0, 1.358 GiB available memory) on CUDA driver 10.2.0 and toolkit 10.1.243
┌ Warning: Performing scalar operations on GPU arrays: This is very slow, consider disallowing these operations with `allowscalar(false)`
└ @ GPUArrays ~/.julia/packages/GPUArrays/1wgPO/src/indexing.jl:16
findfirst: Error During Test at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:405
  Test threw exception
  Expression: testf((x->begin
            findfirst(x)
        end), rand(Bool, 100))
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 153; error   : Instruction 'atom.min.s64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::String, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::CuArrays.var"#kernel#34"{typeof(identity)}, ::Type{Tuple{CuDeviceArray{Int64,1,CUDAnative.AS.Global},CuDeviceArray{Bool,1,CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] (::CUDAnative.var"#kw##cufunction")(::NamedTuple{(:name,),Tuple{String}}, ::typeof(cufunction), ::Function, ::Type) at ./none:0
   [5] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179 [inlined]
   [6] macro expansion at ./gcutils.jl:91 [inlined]
   [7] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176 [inlined]
   [8] findfirst(::typeof(identity), ::CuArray{Bool,1,Nothing}) at /home/g2-test/.julia/packages/CuArrays/HE8G6/src/indexing.jl:135
   [9] findfirst at /home/g2-test/.julia/packages/CuArrays/HE8G6/src/indexing.jl:141 [inlined]
   [10] (::var"#194#199")(::CuArray{Bool,1,Nothing}) at ./none:0
   [11] #compare#1(::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(Main.TestSuite.compare), ::Function, ::Type{CuArray}, ::Array{Bool,1}) at /home/g2-test/.julia/packages/GPUArrays/1wgPO/test/testsuite.jl:23
   [12] compare at /home/g2-test/.julia/packages/GPUArrays/1wgPO/test/testsuite.jl:20 [inlined]
   [13] #testf#3 at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/runtests.jl:18 [inlined]
   [14] testf(::Function, ::Array{Bool,1}) at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/runtests.jl:18
   [15] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:405
   [16] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [17] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:405
  
findfirst: Error During Test at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:406
  Test threw exception
  Expression: testf((x->begin
            findfirst((y->begin
                        y > 0.5
                    end), x)
        end), rand(100))
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 156; error   : Instruction 'atom.min.s64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::String, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::CuArrays.var"#kernel#34"{var"#196#201"}, ::Type{Tuple{CuDeviceArray{Int64,1,CUDAnative.AS.Global},CuDeviceArray{Float64,1,CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] (::CUDAnative.var"#kw##cufunction")(::NamedTuple{(:name,),Tuple{String}}, ::typeof(cufunction), ::Function, ::Type) at ./none:0
   [5] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179 [inlined]
   [6] macro expansion at ./gcutils.jl:91 [inlined]
   [7] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176 [inlined]
   [8] findfirst(::var"#196#201", ::CuArray{Float64,1,Nothing}) at /home/g2-test/.julia/packages/CuArrays/HE8G6/src/indexing.jl:135
   [9] (::var"#195#200")(::CuArray{Float64,1,Nothing}) at ./none:0
   [10] #compare#1(::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(Main.TestSuite.compare), ::Function, ::Type{CuArray}, ::Array{Float64,1}) at /home/g2-test/.julia/packages/GPUArrays/1wgPO/test/testsuite.jl:23
   [11] compare at /home/g2-test/.julia/packages/GPUArrays/1wgPO/test/testsuite.jl:20 [inlined]
   [12] #testf#3 at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/runtests.jl:18 [inlined]
   [13] testf(::Function, ::Array{Float64,1}) at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/runtests.jl:18
   [14] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:406
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:405
  
findfirst: Error During Test at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:410
  Test threw exception
  Expression: findfirst(x) == findfirst(CuArray(x))
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 157; error   : Instruction 'atom.min.s64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::String, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::CuArrays.var"#kernel#34"{typeof(identity)}, ::Type{Tuple{CuDeviceArray{Int64,1,CUDAnative.AS.Global},CuDeviceArray{Bool,2,CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] (::CUDAnative.var"#kw##cufunction")(::NamedTuple{(:name,),Tuple{String}}, ::typeof(cufunction), ::Function, ::Type) at ./none:0
   [5] findfirst(::typeof(identity), ::CuArray{Bool,2,Nothing}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] findfirst(::CuArray{Bool,2,Nothing}) at /home/g2-test/.julia/packages/CuArrays/HE8G6/src/indexing.jl:141
   [7] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:410
   [8] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [9] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:405
  
findfirst: Error During Test at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:413
  Test threw exception
  Expression: findfirst((y->begin
                y > 0.5
            end), x) == findfirst((y->begin
                y > 0.5
            end), CuArray(x))
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 160; error   : Instruction 'atom.min.s64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::String, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::CuArrays.var"#kernel#34"{var"#198#203"}, ::Type{Tuple{CuDeviceArray{Int64,1,CUDAnative.AS.Global},CuDeviceArray{Float64,2,CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] (::CUDAnative.var"#kw##cufunction")(::NamedTuple{(:name,),Tuple{String}}, ::typeof(cufunction), ::Function, ::Type) at ./none:0
   [5] findfirst(::var"#198#203", ::CuArray{Float64,2,Nothing}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:413
   [7] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [8] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:405
  
findmax & findmin: Error During Test at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:419
  Test threw exception
  Expression: findmax(x) == findmax(CuArray(x))
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 156; error   : Instruction 'atom.min.s64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::String, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::CuArrays.var"#kernel#34"{CuArrays.var"#48#49"{Float32}}, ::Type{Tuple{CuDeviceArray{Int64,1,CUDAnative.AS.Global},CuDeviceArray{Float32,1,CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] (::CUDAnative.var"#kw##cufunction")(::NamedTuple{(:name,),Tuple{String}}, ::typeof(cufunction), ::Function, ::Type) at ./none:0
   [5] findfirst(::CuArrays.var"#48#49"{Float32}, ::CuArray{Float32,1,Nothing}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] #findmax#47(::Function, ::typeof(findmax), ::CuArray{Float32,1,Nothing}) at /home/g2-test/.julia/packages/CuArrays/HE8G6/src/indexing.jl:240
   [7] findmax(::CuArray{Float32,1,Nothing}) at /home/g2-test/.julia/packages/CuArrays/HE8G6/src/indexing.jl:238
   [8] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:419
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [10] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:418
  
findmax & findmin: Error During Test at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:420
  Test threw exception
  Expression: findmax(x; dims=1) == Array.(findmax(CuArray(x); dims=1))
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 651; error   : Instruction 'atom.min.s64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::CuArrays.var"#kernel#41", ::Type{Tuple{CuDeviceArray{Float32,1,CUDAnative.AS.Global},CuDeviceArray{Float32,1,CUDAnative.AS.Global},CuDeviceArray{Int64,1,CUDAnative.AS.Global},CartesianIndices{1,Tuple{Base.OneTo{Int64}}},CartesianIndices{0,Tuple{}},CartesianIndices{0,Tuple{}},CartesianIndices{2,Tuple{Base.OneTo{Int64},Base.OneTo{Int64}}}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] findfirst(::CuArray{Float32,1,Nothing}, ::CuArray{Float32,1,Nothing}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] #findmax#47 at /home/g2-test/.julia/packages/CuArrays/HE8G6/src/indexing.jl:244 [inlined]
   [7] (::Base.var"#kw##findmax")(::NamedTuple{(:dims,),Tuple{Int64}}, ::typeof(findmax), ::CuArray{Float32,1,Nothing}) at ./none:0
   [8] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:420
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [10] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:418
  
findmax & findmin: Error During Test at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:423
  Test threw exception
  Expression: findmax(x) == findmax(CuArray(x))
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 160; error   : Instruction 'atom.min.s64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::String, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::CuArrays.var"#kernel#34"{CuArrays.var"#48#49"{Float32}}, ::Type{Tuple{CuDeviceArray{Int64,1,CUDAnative.AS.Global},CuDeviceArray{Float32,2,CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] (::CUDAnative.var"#kw##cufunction")(::NamedTuple{(:name,),Tuple{String}}, ::typeof(cufunction), ::Function, ::Type) at ./none:0
   [5] findfirst(::CuArrays.var"#48#49"{Float32}, ::CuArray{Float32,2,Nothing}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] #findmax#47(::Function, ::typeof(findmax), ::CuArray{Float32,2,Nothing}) at /home/g2-test/.julia/packages/CuArrays/HE8G6/src/indexing.jl:240
   [7] findmax(::CuArray{Float32,2,Nothing}) at /home/g2-test/.julia/packages/CuArrays/HE8G6/src/indexing.jl:238
   [8] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:423
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [10] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:418
  
findmax & findmin: Error During Test at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:424
  Test threw exception
  Expression: findmax(x; dims=1) == Array.(findmax(CuArray(x); dims=1))
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 739; error   : Instruction 'atom.min.s64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::CuArrays.var"#kernel#41", ::Type{Tuple{CuDeviceArray{Float32,2,CUDAnative.AS.Global},CuDeviceArray{Float32,2,CUDAnative.AS.Global},CuDeviceArray{Int64,2,CUDAnative.AS.Global},CartesianIndices{1,Tuple{Base.OneTo{Int64}}},CartesianIndices{0,Tuple{}},CartesianIndices{1,Tuple{Base.OneTo{Int64}}},CartesianIndices{2,Tuple{Base.OneTo{Int64},Base.OneTo{Int64}}}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179 [inlined]
   [6] macro expansion at ./gcutils.jl:91 [inlined]
   [7] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176 [inlined]
   [8] findfirst(::CuArray{Float32,2,Nothing}, ::CuArray{Float32,2,Nothing}) at /home/g2-test/.julia/packages/CuArrays/HE8G6/src/indexing.jl:202
   [9] #findmax#47 at /home/g2-test/.julia/packages/CuArrays/HE8G6/src/indexing.jl:244 [inlined]
   [10] (::Base.var"#kw##findmax")(::NamedTuple{(:dims,),Tuple{Int64}}, ::typeof(findmax), ::CuArray{Float32,2,Nothing}) at ./none:0
   [11] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:424
   [12] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [13] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:418
  
findmax & findmin: Error During Test at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:428
  Test threw exception
  Expression: findmin(x) == findmin(CuArray(x))
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 156; error   : Instruction 'atom.min.s64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::String, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::CuArrays.var"#kernel#34"{CuArrays.var"#45#46"{Float32}}, ::Type{Tuple{CuDeviceArray{Int64,1,CUDAnative.AS.Global},CuDeviceArray{Float32,1,CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] (::CUDAnative.var"#kw##cufunction")(::NamedTuple{(:name,),Tuple{String}}, ::typeof(cufunction), ::Function, ::Type) at ./none:0
   [5] findfirst(::CuArrays.var"#45#46"{Float32}, ::CuArray{Float32,1,Nothing}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] #findmin#44(::Function, ::typeof(findmin), ::CuArray{Float32,1,Nothing}) at /home/g2-test/.julia/packages/CuArrays/HE8G6/src/indexing.jl:228
   [7] findmin(::CuArray{Float32,1,Nothing}) at /home/g2-test/.julia/packages/CuArrays/HE8G6/src/indexing.jl:226
   [8] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:428
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [10] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:418
  
findmax & findmin: Error During Test at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:429
  Test threw exception
  Expression: findmin(x; dims=1) == Array.(findmin(CuArray(x); dims=1))
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 651; error   : Instruction 'atom.min.s64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::CuArrays.var"#kernel#41", ::Type{Tuple{CuDeviceArray{Float32,1,CUDAnative.AS.Global},CuDeviceArray{Float32,1,CUDAnative.AS.Global},CuDeviceArray{Int64,1,CUDAnative.AS.Global},CartesianIndices{1,Tuple{Base.OneTo{Int64}}},CartesianIndices{0,Tuple{}},CartesianIndices{0,Tuple{}},CartesianIndices{2,Tuple{Base.OneTo{Int64},Base.OneTo{Int64}}}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] findfirst(::CuArray{Float32,1,Nothing}, ::CuArray{Float32,1,Nothing}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] #findmin#44 at /home/g2-test/.julia/packages/CuArrays/HE8G6/src/indexing.jl:232 [inlined]
   [7] (::Base.var"#kw##findmin")(::NamedTuple{(:dims,),Tuple{Int64}}, ::typeof(findmin), ::CuArray{Float32,1,Nothing}) at ./none:0
   [8] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:429
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [10] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:418
  


findmax & findmin: Error During Test at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:432
  Test threw exception
  Expression: findmin(x) == findmin(CuArray(x))
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 160; error   : Instruction 'atom.min.s64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::String, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::CuArrays.var"#kernel#34"{CuArrays.var"#45#46"{Float32}}, ::Type{Tuple{CuDeviceArray{Int64,1,CUDAnative.AS.Global},CuDeviceArray{Float32,2,CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] (::CUDAnative.var"#kw##cufunction")(::NamedTuple{(:name,),Tuple{String}}, ::typeof(cufunction), ::Function, ::Type) at ./none:0
   [5] findfirst(::CuArrays.var"#45#46"{Float32}, ::CuArray{Float32,2,Nothing}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] #findmin#44(::Function, ::typeof(findmin), ::CuArray{Float32,2,Nothing}) at /home/g2-test/.julia/packages/CuArrays/HE8G6/src/indexing.jl:228
   [7] findmin(::CuArray{Float32,2,Nothing}) at /home/g2-test/.julia/packages/CuArrays/HE8G6/src/indexing.jl:226
   [8] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:432
   [9] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [10] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:418
  
findmax & findmin: Error During Test at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:433
  Test threw exception
  Expression: findmin(x; dims=1) == Array.(findmin(CuArray(x); dims=1))
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 739; error   : Instruction 'atom.min.s64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::Nothing, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::CuArrays.var"#kernel#41", ::Type{Tuple{CuDeviceArray{Float32,2,CUDAnative.AS.Global},CuDeviceArray{Float32,2,CUDAnative.AS.Global},CuDeviceArray{Int64,2,CUDAnative.AS.Global},CartesianIndices{1,Tuple{Base.OneTo{Int64}}},CartesianIndices{0,Tuple{}},CartesianIndices{1,Tuple{Base.OneTo{Int64}}},CartesianIndices{2,Tuple{Base.OneTo{Int64},Base.OneTo{Int64}}}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] cufunction(::Function, ::Type) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [5] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179 [inlined]
   [6] macro expansion at ./gcutils.jl:91 [inlined]
   [7] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:176 [inlined]
   [8] findfirst(::CuArray{Float32,2,Nothing}, ::CuArray{Float32,2,Nothing}) at /home/g2-test/.julia/packages/CuArrays/HE8G6/src/indexing.jl:202
   [9] #findmin#44 at /home/g2-test/.julia/packages/CuArrays/HE8G6/src/indexing.jl:232 [inlined]
   [10] (::Base.var"#kw##findmin")(::NamedTuple{(:dims,),Tuple{Int64}}, ::typeof(findmin), ::CuArray{Float32,2,Nothing}) at ./none:0
   [11] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:433
   [12] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [13] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:418
  
argmax & argmin: Error During Test at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:438
  Test threw exception
  Expression: testf(argmax, rand(Int, 10))
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 155; error   : Instruction 'atom.min.s64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::String, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::CuArrays.var"#kernel#34"{CuArrays.var"#48#49"{Int64}}, ::Type{Tuple{CuDeviceArray{Int64,1,CUDAnative.AS.Global},CuDeviceArray{Int64,1,CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] (::CUDAnative.var"#kw##cufunction")(::NamedTuple{(:name,),Tuple{String}}, ::typeof(cufunction), ::Function, ::Type) at ./none:0
   [5] findfirst(::CuArrays.var"#48#49"{Int64}, ::CuArray{Int64,1,Nothing}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] #findmax#47(::Function, ::typeof(findmax), ::CuArray{Int64,1,Nothing}) at /home/g2-test/.julia/packages/CuArrays/HE8G6/src/indexing.jl:240
   [7] #findmax at ./none:0 [inlined]
   [8] #argmax#616 at ./reducedim.jl:882 [inlined]
   [9] argmax(::CuArray{Int64,1,Nothing}) at ./reducedim.jl:882
   [10] #compare#1(::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(Main.TestSuite.compare), ::Function, ::Type{CuArray}, ::Array{Int64,1}) at /home/g2-test/.julia/packages/GPUArrays/1wgPO/test/testsuite.jl:23
   [11] compare at /home/g2-test/.julia/packages/GPUArrays/1wgPO/test/testsuite.jl:20 [inlined]
   [12] #testf#3 at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/runtests.jl:18 [inlined]
   [13] testf(::Function, ::Array{Int64,1}) at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/runtests.jl:18
   [14] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:438
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:438
  
argmax & argmin: Error During Test at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:439
  Test threw exception
  Expression: testf(argmax, -(rand(Int, 10)))
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 155; error   : Instruction 'atom.min.s64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::String, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::CuArrays.var"#kernel#34"{CuArrays.var"#48#49"{Int64}}, ::Type{Tuple{CuDeviceArray{Int64,1,CUDAnative.AS.Global},CuDeviceArray{Int64,1,CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] (::CUDAnative.var"#kw##cufunction")(::NamedTuple{(:name,),Tuple{String}}, ::typeof(cufunction), ::Function, ::Type) at ./none:0
   [5] findfirst(::CuArrays.var"#48#49"{Int64}, ::CuArray{Int64,1,Nothing}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] #findmax#47(::Function, ::typeof(findmax), ::CuArray{Int64,1,Nothing}) at /home/g2-test/.julia/packages/CuArrays/HE8G6/src/indexing.jl:240
   [7] #findmax at ./none:0 [inlined]
   [8] #argmax#616 at ./reducedim.jl:882 [inlined]
   [9] argmax(::CuArray{Int64,1,Nothing}) at ./reducedim.jl:882
   [10] #compare#1(::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(Main.TestSuite.compare), ::Function, ::Type{CuArray}, ::Array{Int64,1}) at /home/g2-test/.julia/packages/GPUArrays/1wgPO/test/testsuite.jl:23
   [11] compare at /home/g2-test/.julia/packages/GPUArrays/1wgPO/test/testsuite.jl:20 [inlined]
   [12] #testf#3 at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/runtests.jl:18 [inlined]
   [13] testf(::Function, ::Array{Int64,1}) at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/runtests.jl:18
   [14] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:439
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:438
  
argmax & argmin: Error During Test at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:441
  Test threw exception
  Expression: testf(argmin, rand(Int, 10))
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 155; error   : Instruction 'atom.min.s64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::String, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::CuArrays.var"#kernel#34"{CuArrays.var"#45#46"{Int64}}, ::Type{Tuple{CuDeviceArray{Int64,1,CUDAnative.AS.Global},CuDeviceArray{Int64,1,CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] (::CUDAnative.var"#kw##cufunction")(::NamedTuple{(:name,),Tuple{String}}, ::typeof(cufunction), ::Function, ::Type) at ./none:0
   [5] findfirst(::CuArrays.var"#45#46"{Int64}, ::CuArray{Int64,1,Nothing}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] #findmin#44(::Function, ::typeof(findmin), ::CuArray{Int64,1,Nothing}) at /home/g2-test/.julia/packages/CuArrays/HE8G6/src/indexing.jl:228
   [7] #findmin at ./none:0 [inlined]
   [8] #argmin#615 at ./reducedim.jl:857 [inlined]
   [9] argmin(::CuArray{Int64,1,Nothing}) at ./reducedim.jl:857
   [10] #compare#1(::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(Main.TestSuite.compare), ::Function, ::Type{CuArray}, ::Array{Int64,1}) at /home/g2-test/.julia/packages/GPUArrays/1wgPO/test/testsuite.jl:23
   [11] compare at /home/g2-test/.julia/packages/GPUArrays/1wgPO/test/testsuite.jl:20 [inlined]
   [12] #testf#3 at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/runtests.jl:18 [inlined]
   [13] testf(::Function, ::Array{Int64,1}) at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/runtests.jl:18
   [14] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:441
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:438
  
argmax & argmin: Error During Test at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:442
  Test threw exception
  Expression: testf(argmin, -(rand(Int, 10)))
  CUDA error: a PTX JIT compilation failed (code 218, ERROR_INVALID_PTX)
  ptxas application ptx input, line 155; error   : Instruction 'atom.min.s64 with generic state space' requires .target sm_32 or higher
  ptxas fatal   : Ptx assembly aborted due to errors
  Stacktrace:
   [1] CuModule(::String, ::Dict{CUDAdrv.CUjit_option_enum,Any}) at /home/g2-test/.julia/packages/CUDAdrv/b1mvw/src/module.jl:40
   [2] macro expansion at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:423 [inlined]
   [3] #cufunction#200(::String, ::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(cufunction), ::CuArrays.var"#kernel#34"{CuArrays.var"#45#46"{Int64}}, ::Type{Tuple{CuDeviceArray{Int64,1,CUDAnative.AS.Global},CuDeviceArray{Int64,1,CUDAnative.AS.Global}}}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:360
   [4] (::CUDAnative.var"#kw##cufunction")(::NamedTuple{(:name,),Tuple{String}}, ::typeof(cufunction), ::Function, ::Type) at ./none:0
   [5] findfirst(::CuArrays.var"#45#46"{Int64}, ::CuArray{Int64,1,Nothing}) at /home/g2-test/.julia/packages/CUDAnative/hfulr/src/execution.jl:179
   [6] #findmin#44(::Function, ::typeof(findmin), ::CuArray{Int64,1,Nothing}) at /home/g2-test/.julia/packages/CuArrays/HE8G6/src/indexing.jl:228
   [7] #findmin at ./none:0 [inlined]
   [8] #argmin#615 at ./reducedim.jl:857 [inlined]
   [9] argmin(::CuArray{Int64,1,Nothing}) at ./reducedim.jl:857
   [10] #compare#1(::Base.Iterators.Pairs{Union{},Union{},Tuple{},NamedTuple{(),Tuple{}}}, ::typeof(Main.TestSuite.compare), ::Function, ::Type{CuArray}, ::Array{Int64,1}) at /home/g2-test/.julia/packages/GPUArrays/1wgPO/test/testsuite.jl:23
   [11] compare at /home/g2-test/.julia/packages/GPUArrays/1wgPO/test/testsuite.jl:20 [inlined]
   [12] #testf#3 at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/runtests.jl:18 [inlined]
   [13] testf(::Function, ::Array{Int64,1}) at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/runtests.jl:18
   [14] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:442
   [15] top-level scope at /buildworker/worker/package_linux64/build/usr/share/julia/stdlib/v1.3/Test/src/Test.jl:1107
   [16] top-level scope at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/base.jl:438
  
[ Info: Testing CUDNN 7.6.5
┌ Warning: Not testing CUTENSOR
└ @ Main ~/.julia/packages/CuArrays/HE8G6/test/tensor.jl:7
[ Info: Testing ForwardDiff integration
Effective GPU memory usage: 42.22% (844.375 MiB/1.953 GiB)
CuArrays GPU memory usage: 10.676 MiB
BinnedPool usage: 10.676 MiB (10.443 MiB allocated, 239.251 KiB cached)
BinnedPool efficiency: 70.76% (7.555 MiB requested, 10.676 MiB allocated)
 ────────────────────────────────────────────────────────
                                           Time          
                                   ──────────────────────
         Tot / % measured:               476s / 0.77%    

 Section                   ncalls     time   %tot     avg
 ────────────────────────────────────────────────────────
 background task               20    875ms  23.8%  43.8ms
   pooled free                177    287μs  0.01%  1.62μs
   reclaim                     20   13.6ms  0.37%   682μs
   scan                        20   31.4μs  0.00%  1.57μs
 pooled alloc               10.2k    2.74s  74.5%   269μs
   1. try alloc             3.44k    109ms  2.96%  31.6μs
   2. gc (incremental)          6   90.7ms  2.46%  15.1ms
     pooled free               12   13.5μs  0.00%  1.12μs
   3. reclaim unused            6    364μs  0.01%  60.7μs
     reclaim                    6    344μs  0.01%  57.4μs
     scan                       6   10.9μs  0.00%  1.81μs
   4. try alloc                 6   6.55ms  0.18%  1.09ms
   5. gc (full)                 6    1.90s  51.6%   317ms
     pooled free               40   42.3μs  0.00%  1.06μs
   6. reclaim unused            6    437μs  0.01%  72.9μs
     reclaim                    6    411μs  0.01%  68.4μs
     scan                       6   14.5μs  0.00%  2.41μs
   7. try alloc                 6   10.0ms  0.27%  1.67ms
   8. reclaim everything        6   18.4μs  0.00%  3.06μs
     reclaim                    6   4.77μs  0.00%   796ns
     scan                       6   5.78μs  0.00%   963ns
   9. try alloc                 6   3.64ms  0.10%   607μs
 pooled free                7.87k   61.6ms  1.67%  7.83μs
 reclaim                        6    195μs  0.01%  32.5μs
 scan                           6   14.0μs  0.00%  2.33μs
 ────────────────────────────────────────────────────────
 ────────────────────────────────────────
                           Time          
                   ──────────────────────
 Tot / % measured:       476s / 0.03%    

 Section   ncalls     time   %tot     avg
 ────────────────────────────────────────
 alloc      3.46k    124ms  89.9%  35.9μs
 free       1.28k   14.0ms  10.1%  10.9μs
 ────────────────────────────────────────
Test Summary:            | Pass  Error  Total
CuArrays                 | 4882     16   4898
  GPUArrays test suite   | 1104          1104
  essential utilities    |    3             3
  Memory                 |    5             5
  Array                  |   25            25
  Adapt                  |    2             2
  Broadcast              |   18            18
  Cufunc                 |    8             8
  Ref Broadcast          |    1             1
  Broadcast Fix          |    3             3
  Reduce                 |   19            19
  0D                     |    2             2
  SubArray               |   22            22
  reshape                |    1             1
  triu! with diagonal -2 |    1             1
  triu! with diagonal -1 |    1             1
  triu! with diagonal 0  |    1             1
  triu! with diagonal 1  |    1             1
  triu! with diagonal 2  |    1             1
  tril! with diagonal -2 |    1             1
  tril! with diagonal -1 |    1             1
  tril! with diagonal 0  |    1             1
  tril! with diagonal 1  |    1             1
  tril! with diagonal 2  |    1             1
  Utilities              |    2             2
  accumulate             |   17            17
  logical indexing       |   15            15
  generic fallbacks      |   11            11
  reverse                |   26            26
  permutedims            |    2             2
  findall                |    4             4
  findfirst              |           4      4
  findmax & findmin      |           8      8
  argmax & argmin        |           4      4
  issue #543             |    2             2
  resizing               |    9             9
  memory allocator       |    2             2
  CUBLAS                 | 1502          1502
  CURAND                 |   99            99
  CUFFT                  |  150           150
  CUSPARSE               | 1235          1235
  CUSOLVER               |  305           305
  CUSPARSE + CUSOLVER    |   84            84
  CUDNN                  |   86            86
  CUTENSOR               |              No tests
  ForwardDiff            |  107           107
ERROR: LoadError: Some tests did not pass: 4882 passed, 0 failed, 16 errored, 0 broken.
in expression starting at /home/g2-test/.julia/packages/CuArrays/HE8G6/test/runtests.jl:41
ERROR: Packages CUDAdrv, CUDAnative, CuArrays errored during testing

Thank you, this thing above is the key here. SM stands for streaming multiprocessor, and 32 is the version of the required ‘compute capability’, 3.2 in this case. You can check this page to see if your graphics card is in fact lower than this required version. If you have access to a machine with a newer GPU, I suggest you try it there.

edit: I see it now, it’s a mobile Quadro K2100M. Unfortunately, this probably won’t work with that.

1 Like

Is there any other way I could learn Flux? Something of the sort online editor?

You don’t explicitly need to use a GPU to do things with Flux, you can put together moderately large models and train them on the CPU as well. I think for your first steps not using a GPU won’t limit you at all.