A_mul_B! deviates from C = A*B with complex numbers

Jan · April 4, 2018, 10:56pm

TL;DR:
LinAlg.A_mul_B!(result, A, α*v) deviates from result = α*A*v.

Hey everybody!

So I am new to the game here. I was drawn to implement the code of my project in Julia because it had all the features I was looking for built in: Sparse Matrices, native Complex Numbers, fast runtime.

Part of my project includes solving iteratively the ODE of the Schrödinger equation:

\partial_t {\psi} = -i H {\psi}

In my project a state vector \psi is represented by a 2^N-component vector; H is represented by a Sparse 2^N \times 2^N Matrix.

I opted for the 4-step (classical) Runge-Kutta solver to preserve the L²-Norm of the vector.

While debugging I was puzzled by the following (mis-)behaviour of the Low-Level BLAS function LinAlg.A_mul_B!(), which I wanted to employ because of the very high iterations in the iterative solving.

Consider the following code snippet:

N = 2^16

v = randn(N)
v[:] = v/norm(v)

H = sprandn(N, N, 0.002)

result1 = Array{Complex{Float64}}(v)
result2 = Array{Complex{Float64}}(v)

Δt = 0.0005

result1[:] = -im*Δt*H*v
LinAlg.A_mul_B!(result2, H, -im*Δt*v)

println("Do the two results agree?\n$(prod( result1 .== result2 ))")

Δv = norm( result1 - result2 )
println("What is the norm of the difference vector Δv?\n$(Δv)")

println("Δv/Δt = $(Δv/Δt)")

which yields after running the following output:

$ julia snippet.jl

Do the two results agree?
false
What is the norm of the difference vector Δv?
1.390578911436828e-18
Δv/Δt = 2.781157822873656e-15

\mathcal{O}(10^{-15}) might not seem much of a difference, yet I am puzzled there is one at all!
The difference of those two results is accumulating over the four step Runge-Kutta iteration and ends up being \mathcal{O}(2\% \Delta t). This is rather unacceptable for one simple iteration step…

The previous statement can be observed with, for example, this code:

function rungekutta!(input::Array{Complex{Float64}}, H::SparseMatrixCSC{Float64,Int64}, deltaT::Float64, k1::Array{Complex{Float64}}, k2::Array{Complex{Float64}}, k3::Array{Complex{Float64}}, k4::Array{Complex{Float64}})
    LinAlg.A_mul_B!(k1, H, -im*deltaT*input )
    LinAlg.A_mul_B!(k2, H, -im*deltaT*(input + LinAlg.scale!( 1./2., k1) ) )
    LinAlg.A_mul_B!(k3, H, -im*deltaT*(input + LinAlg.scale!( 1./2., k2) ) )
    LinAlg.A_mul_B!(k4, H, -im*deltaT*(input + k3 )  )
    input[:] += (LinAlg.scale!(2.0, k2) + LinAlg.scale!(2.0, k3) + k4)/6.
    return nothing
end

function rungekutta2!(input::Array{Complex{Float64}}, H::SparseMatrixCSC{Float64,Int64}, deltaT::Float64, k1::Array{Complex{Float64}}, k2::Array{Complex{Float64}}, k3::Array{Complex{Float64}}, k4::Array{Complex{Float64}})
    k1[:] = -im*deltaT*(H*(input))
    k2[:] = -im*deltaT*(H*(input+k1/2.0))
    k3[:] = -im*deltaT*(H*(input+k2/2.0))
    k4[:] = -im*deltaT*(H*(input+k3))
    input[:] += (k1+2.*k2+2.*k3+k4)/6.0
    return nothing
end


N = 2^16

v = randn(N)
v[:] = v/norm(v)

H = sprandn(N, N, 0.002)

k1 = zeros(Complex{Float64}, N)
k2 = zeros(Complex{Float64}, N)
k3 = zeros(Complex{Float64}, N)
k4 = zeros(Complex{Float64}, N)

result1 = Array{Complex{Float64}}(v)
result2 = Array{Complex{Float64}}(v)


Δt = 0.0005

result1[:] = -im*Δt*H*v
LinAlg.A_mul_B!(result2, H, -im*Δt*v)


rungekutta!(result1, H, Δt, k1, k2, k3, k4)
rungekutta2!(result2, H, Δt, k1, k2, k3, k4)

println("Do the two results agree?\n$(prod( result1 .== result2 ))")

Δv = norm( result1 - result2 )
println("What is the norm of the difference vector Δv?\n$(Δv)")

println("Δv/Δt = $(Δv/Δt)")

which yields after running the following output:

$ julia snippet2.jl

Do the two results agree?
false
What is the norm of the difference vector Δv?
1.096038473328683e-5
Δv/Δt = 0.021920769466573658

I would be very happy to find out why these two operations doe not agree with each other and even more happy, if someone finds a mistake in my code!!

Cheers
Jan

tkoolen · April 5, 2018, 12:24am

The order of operations matters in floating point world.

julia> result3 = H * (-im*Δt*v);

julia> norm(result3 - result2)
0.0

Edit: as an extreme example:

julia> 1e16 + 1.0 - 1e16
0.0

julia> 1e16 - 1e16 + 1.0
1.0

and see PSA: floating-point arithmetic.

By the way, you may want to consider using OrdinaryDiffEq.jl (part of DifferentialEquations.jl) if you’re solving ODEs, for a vast array of integrators that are highly optimized and tested for accuracy.

ChrisRackauckas · April 5, 2018, 5:13am

RK4 is not a conservative (geometric or symplectic) integrator. It’s also an integrator with large error terms and low order. It’s really not a good choice if you’re looking for accuracy or conservation properties.

That’s missing k1.

antoine-levitt · April 5, 2018, 6:28am

I second the comments here. Floating point results depending on the order of operations is simply a fact of life (one that is not that important once you learn to stop worrying about it) Depending on your accuracy/timing requirements, you can get away with just calling DiffEq with an explicit solver. If unitarity is important to you, try Crank-Nicolson, or an exponential integrator (using https://github.com/marcusps/ExpmV.jl for instance). There’s mostly no point in recoding a diff eq solver (except for learning purposes): using an existing one gets you things like timestep adaptivity for free. I don’t think there is a good solution for time-dependent Schrödinger (i dt psi = H(t) psi) in DiffEq yet, though.

ChrisRackauckas · April 5, 2018, 1:22pm

Well there’s tooling at least in the sense of Runge-Kutta methods, but yes for time-dependent Schrondinger we want Krylov exponential integrators with String splitting:

github.com/SciML/OrdinaryDiffEq.jl

Strang splitting

opened 01:40PM - 06 Aug 17 UTC

closed 08:14PM - 12 Mar 20 UTC

jagot

new-algorithm

For composed linear operators of the type `F = A + B + C + ...`, there should, i…n addition to the normal methods (implicit Euler, Crank–Nicolson, etc), which operate using `F*u` and `F\u`, an operator splitting method which approximate the propagator as `...exp(C/2)exp(B/2)exp(A)exp(B/2)exp(C/2)...` with automatic symmetric placement of the exponentials. My suggestion is that the first operator should be placed inner-most.

A GSoC potential has already gotten the Krylov exponential integrators up and running, along with a draft for reformulating the operator spec in a way that’s time-dependent splitting compatible:

github.com/SciML/DiffEqOperators.jl

New DiffEqOperator Interface for Array and Composite Operators

opened 08:28PM - 03 Apr 18 UTC

closed 11:19AM - 28 Mar 21 UTC

MSeeker1340

The current interface of `AbstractDiffEqOperator` is a bit messy and could use s…ome rework. Some of the major problems are: - The definitions are not centralized, with some of the type hierarchy defined in DiffEqBase instead of DiffEqOperators. - `AbstractDiffEqOperator` is defined as a subtype of `AbstractLinearMap` from LinearMaps.jl. This is problematic because the combination and composition of operators will be of type `LinearCombination` and `CompositeMap`, which goes beyond the `AbstractDiffEqOperator` type hierarchy. Also the implementation in LinearMaps.jl for operator combinations and compositions are not optimal for out standard (e.g. multiplication requires allocation of additional memory). - Time-dependent coefficients. Currently they are implemented as `DiffEqScalar` and included within array operators. Presumably we need to do the same for all operators, which can be cumbersome. Below is my draft for a new `DiffEqOperator` interface that gets rid of the LinearMaps dependence and build up the type hierarchy from scratch. The overall structure is similar to LinearMaps while at the same time tailored to the needs of JuliaDiffEq. Feedbacks are welcomed :P [New DiffEqArrayOperator Interface Draft](https://nbviewer.jupyter.org/github/MSeeker1340/DiffEqOperators.jl/blob/new_operator_interface/docs/New%20DiffEqOperator%20Interface%20Draft.ipynb)

So while this project would be based around semilinear exponential integrators, it will build all of the infrastructure to make these methods easy to implement, and then we just need to update the file of Strang splitting methods for that new interface (unreleased: we did this as an interface test)

github.com

SciML/OrdinaryDiffEq.jl/blob/master/src/perform_step/linear_perform_step.jl

function initialize!(integrator, cache::MagnusMidpointCache)
    integrator.kshortsize = 2
    integrator.fsalfirst = cache.fsalfirst
    integrator.fsallast = cache.k
    resize!(integrator.k, integrator.kshortsize)
    integrator.k[1] = integrator.fsalfirst
    integrator.k[2] = integrator.fsallast
    integrator.f(integrator.fsalfirst, integrator.uprev, integrator.p, integrator.t) # For the interpolation, needs k at the updated point
    integrator.stats.nf += 1
end

function perform_step!(integrator, cache::MagnusMidpointCache, repeat_step = false)
    @unpack t, dt, uprev, u, p = integrator
    alg = unwrap_alg(integrator, nothing)
    @unpack W, k, tmp, exp_cache = cache
    mass_matrix = integrator.f.mass_matrix
    exp_method = ExpMethodGeneric()

    L = integrator.f.f
    update_coefficients!(L, u, p, t + dt / 2)

This file has been truncated. show original

tl;dr: I would guess we will have all of the tools to make methods specifically for u' = A(t)u well-optimized by the end of the summer. If you want them just bump this thread sometime midsummer to remind me.

antoine-levitt · April 5, 2018, 2:01pm

That’s great, and I will be interested in testing it. Cases of interest would be i psi’ = A(t) psi / eps, and i psi’ = A(t) psi + eps B(t) psi (adiabatic and linear response respectively); I’m not sure if efficient (= robust to eps->0) methods are known for these.

ChrisRackauckas · April 5, 2018, 2:12pm

Yeah, I don’t know of any methods specifically for this too, but we’ll have a good setup to develop one .

Topic		Replies	Views
Precision issue: (A.+R.-A.-R) do not vanish ?! Performance question	33	1849	July 29, 2018
Vector - Matrix - Vector multiplication Performance	19	4442	March 14, 2021
Performance of a key step in Strang Splitting method Performance pde	10	718	March 17, 2021
Linear solver \(A, B) performance vs Matlab A\b General Usage	32	7836	May 21, 2017
Multiplication times for complex matrices: julia vs Matlab New to Julia question	18	2962	March 21, 2020

A_mul_B! deviates from C = A*B with complex numbers

Related topics