Efficient calculation of many complex Lorentzians on a large array

LaurentPlagne · February 10, 2022, 10:33am

Reading the julia inv source code (https://github.com/JuliaLang/julia/blob/master/base/complex.jl),
it appears that Complex{Float32} inverse convert (widen) to Complex{Float64} and calls the corresponding specialized implementation. It does explain at least a part of the overhead.
I do not know enough to tell if this implementation choice is optimal or not.

inv(z::Complex{<:Union{Float16,Float32}}) =
    oftype(z, inv(widen(z)))

/(z::Complex{T}, w::Complex{T}) where {T<:Union{Float16,Float32}} =
    oftype(z, widen(z)*inv(widen(w)))

ComplexF64 specialization

# robust complex division for double precision
# variables are scaled & unscaled to avoid over/underflow, if necessary
# based on arxiv.1210.4539
#             a + i*b
#  p + i*q = ---------
#             c + i*d
function /(z::ComplexF64, w::ComplexF64)
    a, b = reim(z); c, d = reim(w)
    absa = abs(a); absb = abs(b);  ab = absa >= absb ? absa : absb # equiv. to max(abs(a),abs(b)) but without NaN-handling (faster)
    absc = abs(c); absd = abs(d);  cd = absc >= absd ? absc : absd

    halfov = 0.5*floatmax(Float64)              # overflow threshold
    twounϵ = floatmin(Float64)*2.0/eps(Float64) # underflow threshold

    # actual division operations
    if  ab>=halfov || ab<=twounϵ || cd>=halfov || cd<=twounϵ # over/underflow case
        p,q = scaling_cdiv(a,b,c,d,ab,cd) # scales a,b,c,d before division (unscales after)
    else
        p,q = cdiv(a,b,c,d)
    end

    return ComplexF64(p,q)
end

This text will be hidden

Topic		Replies	Views
Compiler optimizations for ComplexF64 vs Fortran Performance fortran , optimization , complex-numbers , compiler	7	621	March 30, 2023
SIMD Complex Numbers General Usage simd , complex-numbers	19	2824	July 22, 2021
Why is this Julia code considerably slower than Matlab New to Julia performance	64	8714	March 5, 2017
How to optimize computation within vectorized list operation and large array? Performance	16	562	October 21, 2022
I just decided to migrate from Python+Fortran to Julia as Julia was faster in my test Community fortran , performance , python , tullio , loopvectorization	37	7391	June 25, 2021

Efficient calculation of many complex Lorentzians on a large array

Related topics