Packages to write a blog post on “Optimizing an X matmul kernel” in Julia

If you want to be GPU-vender agnostic, then yes, KernelAbstractions.jl or AcceleratedKernels.jl are probably the most relevant options.
I see there are also WebGPU bindings JuliaWGPU · GitHub, GitHub - cshenton/WebGPU.jl: Julia bindings and native wrapper for the gfx webgpu implementation, though they don’t have any documentation.

2 Likes