Setting up implicit solvers to beat the performance of explicit solvers

JordiBolibar · August 2, 2023, 11:57am

We are trying to benchmark implicit solvers for an ice flow diffusivity PDE. So far we’ve stuck to explicit ones, but now we would like to cover all of them. Since many implicit solvers from DifferentialEquations.jl require AD, we’re currently having the following issue:

First call to automatic differentiation for the Jacobian
failed. This means that the user `f` function is not compatible
with automatic differentiation. Methods to fix this include:

1. Turn off automatic differentiation (e.g. Rosenbrock23() becomes
   Rosenbrock23(autodiff=false)). More details can befound at
   https://docs.sciml.ai/DiffEqDocs/stable/features/performance_overloads/
2. Improving the compatibility of `f` with ForwardDiff.jl automatic 
   differentiation (using tools like PreallocationTools.jl). More details
   can be found at https://docs.sciml.ai/DiffEqDocs/stable/basics/faq/#Autodifferentiation-and-Dual-Numbers
3. Defining analytical Jacobians. More details can be
   found at https://docs.sciml.ai/DiffEqDocs/stable/types/ode_types/#SciMLBase.ODEFunction

Since our f function is really optimized to avoid almost all memory allocations, I guess the problem comes from the usual fact that AD doesn’t like mutation. We have tried deactivating AD, but it’s terribly slow.

So my question is: what should one do in this case? The documentation is really scarce, and we haven’t found any clear examples on how to work around this issue. The code can be found here, and here’s the repository with all the benchmarks (with explicit solvers so far).

Thanks in advance!

Oscar_Smith · August 2, 2023, 12:05pm

the easy answer is to pass autodiff=false to the solver. it will then use finite different

JordiBolibar · August 2, 2023, 12:12pm

Yes, as I mentioned, we tried this and it’s horribly slow. We’re looking into implicit solvers to beat the performance of our best explicit solver.

baggepinnen · August 2, 2023, 12:28pm

It looks like the error message suggests improving compatiblity with ForwardDiff.jl. The problem then is likely that you allocate float arrays instead of generically typed arrays. How do your allocations look? Can you manually compute the Jacobian using ForwardDiff?

github.com

ODINN-SciML/iceflow_sandbox/blob/ad205e17e6141520c1d66c27f8228566f2bca182/scripts/1D_SIA.jl#L106


      
          end
          
          function stop_condition_tstops(u,t,integrator, tstops) 
              t in tstops
          end
          
          function iceflow!(dH, H, p, t)
              # Retrieve model parameters
             dx::Float64, width::Vector{Float64}, bed_hs::Vector{Float64},surface_gradient::Vector{Float64}, surface_gradient_s::Vector{Float64},
              diffusivity::Vector{Float64},diffusivity_s::Vector{Float64}, grad_x_diff::Vector{Float64}, Γ::Vector{Float64}, surface::Vector{Float64} ,
              flux_div::Vector{Float64} = p
          
              surface .= bed_hs .+ H 
          
              # Clip negative ice thickness values
              @views H[H.<0.0] .= 0.0
              @assert H[end-2] .== 0.0 "Glacier exceeding boundaries! at time $(t/sec_in_year)"
          
              # Surface gradient
              diff2!(surface_gradient, surface, 2.0*dx)

It does indeed look like you hard code types everywhere, don’t do that

Oscar_Smith · August 2, 2023, 12:38pm

You can use the implicit solvers without AD. For example pass FBDF(autodiff=false) as the solver.

ChrisRackauckas · August 2, 2023, 1:20pm

Note that finite difference of the solver does not make a huge impact on performance unless it’s a Rodas method. It’s like a 2x-4x performance thing because it’s forward mode AD and has nothing to do with the adjoints of the back solve which is what really matters.

That said, for PDEs the issue is that Rosenbrock23 is a bad idea. As the docs mention, it’s not a method that scales to larger systems well. Did you try FBDF or KenCarp47? Those are more sensible algorithms for large equations. And then when optimizing that, you should look into the tutorial on handling large systems:

Setting up sparse Jacobians and iLU or multigrid preconditioning is such a huge boost that you should always do it for PDEs.

LucilleGimenes · August 17, 2023, 9:15am

I am working with @JordiBolibar on this ice flow diffusivity PDE issue. Actually, the latest version of the f function that we use can be found here and the latest benchmark is available here.

We tried using FBDF and KenCarp47, and it is still at least 10 times slower than when using explicit solvers.

Also, when trying to set up sparse Jacobians with
u0 = iceflow_model.H
du0 = zeros(Float64,iceflow_model.nx)
jac_sparsity = Symbolics.jacobian_sparsity((du, u) -> SIA1D!(du, u, iceflow_model, 0.0), du0,u0)
we get the following error :

 MethodError: no method matching SIA1D!(::Vector{Num}, ::Vector{Num}, ::SIA1Dmodel{Float64, Int64}, ::Float64)
Closest candidates are:
  SIA1D!(!Matched::Vector{Float64}, !Matched::Vector{Float64}, ::SIA1Dmodel{Float64, Int64}, ::Float64) at ~/oggm/oggm/core/SIA1D_utils.jl:14
Stacktrace:
 [1] (::var"#10#13"{SIA1Dmodel{Float64, Int64}})(du::Vector{Num}, u::Vector{Num})
   @ Main ~/oggm/oggm/core/SIA1D_utils.jl:174
 [2] jacobian_sparsity(::var"#10#13"{SIA1Dmodel{Float64, Int64}}, ::Vector{Float64}, ::Vector{Float64}; kwargs::Base.Pairs{Symbol, Union{}, Tuple{}, NamedTuple{(), Tuple{}}})
   @ Symbolics ~/.julia/packages/Symbolics/BQlmn/src/diff.jl:584
 [3] jacobian_sparsity(::Function, ::Vector{Float64}, ::Vector{Float64})
   @ Symbolics ~/.julia/packages/Symbolics/BQlmn/src/diff.jl:579

ChrisRackauckas · August 17, 2023, 9:27am

Your caches force Float64, so that’s of course going to fail on AD and sparsity detection. That’s what PreallocationTools.jl is for:

To see if this is a direction you should go, did you try using GMRES without a preconditioner and see how that does?

LucilleGimenes · August 24, 2023, 2:54pm

Thanks for your message; by using PreallocationTools.jl we were able to use the AD of implicit solvers such as KenCarp47 or FBDF.

However, they still perform at least 3 times slower that the explicit solver we were using previously (RDPK3Sp35), even when setting up sparse jacobians and using GMRES (i.e adding linsolve = KrylovJL_GMRES() as a solver agument if that’s what you meant).

rveltz · August 24, 2023, 3:52pm

You need to tune KrylovJL_GMRES(), pass in verbose mode and see how fast it converges

ChrisRackauckas · August 24, 2023, 4:16pm

Are you sure the equation is stiff? What makes you think so?

If it’s on the edge, did you try ROCK2() or ROCK4()? There are some PDE cases which have “not tiny but not large real valued eigenvalues” (i.e. a Laplacian) where this is the most efficient solver.

What preconditioner? Did you ilu and tune the cutoff?

Topic		Replies	Views
DifferentialEquationsjl with ForwardDiff.jl New to Julia	4	1052	January 24, 2022
DifferentialEquations.jl Mass Matrix DAE Explicit Modelling & Simulations question	3	56	May 29, 2025
Passing the next proposed time step as a parameter into your ODEFunction where du depends on dt General Usage	7	81	July 10, 2025
ANN: Differentiable implicit functions in Julia (optimisation, nonlinear solves and fixed point iterations) Package Announcements optimization , nonlinear , autodiff	3	975	February 8, 2022
Implicit differentiation of rootfinding problem (w/ numerical issues) Numerics question , ad , implicit-equation	13	225	February 26, 2025

Setting up implicit solvers to beat the performance of explicit solvers

Related topics