Multiple shooting and multiple options

JordiBolibar · April 30, 2024, 3:32pm

I am trying to add multiple shooting to a neural differential equation. After browsing multiple options, I see I have two possibilities:

Use DiffEqFlux.jl’s multiple_shoot
Use DiffEqParamEstim.jl’s multiple_shooting_objective.

According to DiffEqFlux’s docs, this option “is a specialized form for implicit layer deep learning (known as data shooting) which assumes full observability of the underlying dynamics and lack of noise”.

I am unsure what is exactly meant by this. Right now, in our problem we are running an ODE forward as a ground truth and we are adding Fisher noise on top of it. Then, we are trying to learn the underlying dynamics from that “ground truth” using a neural ODE. Would DiffEqFlux be the way to go there, or it would only work for cases where the dynamics are perfectly clean and known? So far I have implemented this and it keeps going without even making it to the first epoch.

Would DiffEqParamEstim be a better alternative? I like the interface of multiple_shoot better, since it allows for more control.

If necessary, I can provide more details on the exact ODE and problem we are trying to solve. Thanks in advance!

Vaibhavdixit02 · April 30, 2024, 3:35pm

Prefer the DiffEqFlux one, that should be more recent

ChrisRackauckas · May 3, 2024, 12:26pm

DiffEqFlux’s is a special form that I call data shooting. It’s not the more general form. I think someone should generalize it a bit before I’d recommend it more.

JordiBolibar · May 3, 2024, 12:45pm

I see. What do you mean exactly by "it is a specialized form for implicit layer deep learning (known as data shooting) which assumes full observability of the underlying dynamics and lack of noise”? Do you need to have a dense continuous solution without noise for it to work?

We are trying it in our SphereUDE project with @facusapienza and it doesn’t seem to converge for a controlled experiment with noise.

ChrisRackauckas · May 3, 2024, 12:47pm

No I mean it takes the data points directly as samples for initial conditions, rather than having free floating initial conditions that are learned. This thus assumes the data points are “exactly correct” in some sense.

Not surprising.

JordiBolibar · May 3, 2024, 12:48pm

OK, so what is the right tool for this? It’s really not clear in the docs.

ChrisRackauckas · May 6, 2024, 11:00am

I don’t have an implementation I’d prefer right now.

facusapienza · May 8, 2024, 10:48pm

@ChrisRackauckas I am thinking for this problem just to make an implementation of multiple shooting where the initial conditions for each batch can be learned. I understand the example in multiple_shoot is just for the noiseless case, so no initial condition is trained. I am planning to implement something like this in here SphereUDE.jl by increasing the parameter to be optimized, in the same way we did this for the initial condition:

    # Set component vector for Optimization
    if params.train_initial_condition
        β = ComponentVector{Float64}(θ=θ, u0=params.u0)
    else
        β = ComponentVector{Float64}(θ=θ)
    end

I can try to work on an example that I can then PR to DiffEqFlux.jl if this is an useful contribution. I am happy to help here with an implementation for the case with noise, I just need some guidance of where to contribute and being sure this is a required feature.

Topic		Replies	Views
Questions about the DiffEqFlux multiple shooting example Machine Learning diffeqflux	4	100	June 2, 2025
DiffEqParamEstim.jl and multiple_shooting_objective General Usage diffeq , optim	6	1159	May 17, 2021
No method matching loss_multiple_shooting General Usage	1	549	May 3, 2021
Parameter optimization using Multiple shooting Machine Learning	1	316	September 21, 2023
Multiple shooting with multiple trajectories: how to properly implement this? Optimization (Mathematical) question , diffeq	2	158	September 3, 2024

Multiple shooting and multiple options

Related topics