Is there a plan to integrate Layered Textures support into CUDA.jl? I am currently using a 3D texture for my application, but I suspect I am getting sub-optimal performance since my data structure really is a set of 2D textures.
I believe this would require two modifications:
-The underlying CuTextureArray should be allocated with the CUDA_ARRAY3D_LAYERED flag set. I tested this myself by hacking the alloc() call and my Texture fetch performance improved very slightly
-The tex2DLayered() device nvvm calls need to be implemented. This seems less straightforward since as far as I can tell, the layered tex calls do not seem to be in the main nvvm documentation NVVM IR :: CUDA Toolkit Documentation
Are there any thoughts whether this is feasible? And/or leads on where to find documentation for the layered calls