Array addition of oneAPI.jl slower

raman_kumar · November 4, 2025, 7:05pm

Why oneAPI.jl array addition is slow?

julia> using BenchmarkTools, oneAPI

julia> c = rand(100,100);
julia> @btime $c.+1
  4.950 μs (3 allocations: 78.21 KiB)

julia> a = oneArray(rand(100,100));
julia> @btime $a.+1
  423.117 μs (526 allocations: 63.27 KiB)

Michel_Schanen · November 5, 2025, 1:25pm

Three (possible) reasons:

Your array is small. Your are likely measuring what it costs to launch the broadcast.
if you want best performance, use a kernel.
oneAPI.jl is by far not yet as optimized as CUDA.jl.

However, once you get beyond the overhead the raw compute performance in Julia on Intel GPUs is pretty good.

raman_kumar · November 5, 2025, 1:30pm

I have Intel GPU so i can’t use CUDA.jl. Please improve the documentation of oneAPI.jl. I see that this oneAPI.jl documentation link is broken. How to use kernel?

Michel_Schanen · November 10, 2025, 2:25pm

I would suggest starting with writing kernels in KernelAbstractions.jl. I assume that in the long run, you don’t want to support Intel GPUs solely anyway. With KernelAbstractions.jl you can target CUDA, oneAPI, ROCm, and Metal together.

raman_kumar · November 10, 2025, 5:08pm

Yes, But KernelAbstractions.jl don’t have documentations to start with. Will KernelAbstractions.jl be slower than oneAPI.jl due to overhead of converting code etc.?

gdalle · November 10, 2025, 5:14pm

https://juliagpu.github.io/KernelAbstractions.jl/stable/

raman_kumar · November 10, 2025, 5:15pm

Thanks, I think it would be better to link it on GitHub - JuliaGPU/KernelAbstractions.jl: Heterogeneous programming in Julia page. I used to look in About section on right. Sorry

gdalle · November 10, 2025, 5:32pm

As for nearly every Julia package, it is linked through the “docs:stable” blue badge on the README.

gdalle · November 10, 2025, 5:36pm

But you’re right that I also like to have the link in the about section on top

giordano · November 10, 2025, 8:07pm

Where did you find it? From the navigation menu, if I go to “backends” and then “oneAPI” I arrive to Intel oneAPI ⋅ JuliaGPU

raman_kumar · November 10, 2025, 8:30pm

I found it in about section of GitHub - JuliaGPU/oneAPI.jl: Julia support for the oneAPI programming toolkit.

Topic		Replies	Views
Introduction to oneAPI.jl GPU announcement , gpu , oneapi	2	906	December 7, 2020
Vector addition in Julia slower than numpy in Linux Performance	21	1386	May 1, 2020
Intel oneAPI General Usage	3	806	October 6, 2020
oneAPI.jl @sync bug? GPU	3	409	November 24, 2020
CUDA.jl tutorial code kernel slower than broadcast Performance gpu , cuda	4	710	March 30, 2021

Array addition of oneAPI.jl slower

Related topics