Julia Programming Language
Sequence of warp and how to avoid divergence when folding shared memory in a reduction kernel
Specific Domains
GPU
cuda
Lian_Yunlong
July 18, 2018, 4:18pm
2
I just need some keywords and/or some links
show post in topic
Related topics
Topic
Replies
Views
Activity
Cuprint makes CUDA kernell Work well
GPU
1
1016
September 9, 2021
CUDA | nested loops kernel
GPU
question
5
240
May 12, 2025
Cuda kernel help
Performance
2
63
March 29, 2025
I don't understand why it is slower with CuStaticSharedArray
New to Julia
gpu
,
cuda
,
sharedarrays
,
cudajl
9
357
March 17, 2025
Accessing array elements too slow?
GPU
10
658
April 23, 2021