I get “too many resources requested for launch” in CUDA.jl kernel when I try to either
set value to the array set in global memory like
OR print anything using
I suspect that It is becouse the amount of registers use is too high so
- I suppose that in this case it may be related to caching reasults fetched from global memory - can i swith it off?
- can i run GC on variables in register memory - so I would manually clear it before problem arise (CUDA.unsafe_free!() seems to work on arrays only - am I wrong? )
- How can I increase maxrregcount in CUDA.jl ?