Neural ODE works for small networks, but throws error for larger networks (getindex() method)?

jvkoch · August 3, 2022, 6:47pm

Hello all:

I’m running into strange behavior with neural ODEs defined with composite functions and FastChain definitions. In short, the below code works w/o issue for small neural networks, but fails for larger networks because of a getindex() issue. See this toy minimal (non)working example:

using Random, DiffEqFlux, DifferentialEquations, Flux, Optim
Random.seed!(0)

# Width <= 15 works fine:
width = 15
# Width >= 16 fails because of getindex method failure:
#width = 16

NN = FastChain(FastDense(1,width,swish), FastDense(width,1))
pNN = initial_params(NN)

p = [pNN;1.0]

function neural_ode(u, p, t)
    pNN = p[1:end-1]
    m = p[end]

    dudt = NN(u,pNN)[] - m
    return dudt
end

u0 = rand(1)[1]
tspan = (0.0,10.0)
t = Array(range(0,10,100))
prob_neuralode = ODEProblem(neural_ode, u0, tspan, p)

function loss_neuralode(p)
    trial = Array(solve(prob_neuralode,AutoTsit5(Rosenbrock23()),u0=u0,p=p,saveat=t,abstol = 1e-6,reltol = 1e-6))
    loss = sum(abs2, trial)
    return loss, trial
end

callback = function (p, l, pred; doplot = true)
    display(l)
    return false
end

result_neuralode = DiffEqFlux.sciml_train(loss_neuralode,
                                            p,
                                            ADAM(0.1),
                                            cb = callback,
                                            maxiters = 10)

And the beginning of the stacktrace:

ERROR: MethodError: no method matching getindex(::Float64, ::UnitRange{Int64})
Closest candidates are:
getindex(::Number) at /Applications/Julia-1.7.app/Contents/Resources/julia/share/julia/base/number.jl:95
getindex(::Union{AbstractChar, Number}, ::CartesianIndex{0}) at /Applications/Julia-1.7.app/Contents/Resources/julia/share/julia/base/multidimensional.jl:831
getindex(::Number, ::Integer) at /Applications/Julia-1.7.app/Contents/Resources/julia/share/julia/base/number.jl:96
…

I’m running 1.7 for this example.

Has anyone experienced behavior? Any idea what I might be doing wrong?

ChrisRackauckas · August 3, 2022, 10:46pm

Did you get a warning thrown during the solve that the solver diverged? I have a guess that is the case, and of course then you’d have a getindex error because it didn’t solve all of the way.

jvkoch · August 4, 2022, 12:44am

No solver divergence warning. To simplify the MWE even further, I’ve specified the RHS of the neural ODE as something trivial that is stable by construction (function dudt() always returns zero, regardless of the network architecture):

using Random, DiffEqFlux, DifferentialEquations, Flux, Optim
Random.seed!(0)

# Width <= 15 works fine:
width = 15
# Width >= 16 fails because of getindex method failure:
width = 16

NN = FastChain(FastDense(1,width,swish), FastDense(width,1))
pNN = initial_params(NN)

p = [zeros(length(pNN));1.0]

function neural_ode(u, p, t)
    pNN = p[1:end-1]
    m = p[end]

    dudt = zeros(length(NN(u,pNN)))[]
    return dudt
end

u0 = rand(1)[1]
tspan = (0.0,10.0)
t = Array(range(0,.10,100))
prob_neuralode = ODEProblem(neural_ode, u0, tspan, p)

function loss_neuralode(p)
    trial = Array(solve(prob_neuralode,AutoTsit5(Rosenbrock23()),u0=u0,p=p,saveat=t,abstol = 1e-6,reltol = 1e-6))
    loss = sum(abs2, trial.-zeros(length(t)))
    return loss, trial
end

callback = function (p, l, pred; doplot = true)
    display(l)
    return false
end

result_neuralode = DiffEqFlux.sciml_train(loss_neuralode,
                                            p,
                                            ADAM(0.1),
                                            cb = callback,
                                            maxiters = 10)

This example has the same behavior as that of my original post.

ChrisRackauckas · August 4, 2022, 2:43pm

ForwardDiffSensitivity wasn’t compatible with u0 as a scalar (we always used arrays!). Fixed that here:

Other adjoints will throw a nice error, but instead of fixing the error message here, this dispatch was easy to just solve.

jvkoch · August 4, 2022, 3:06pm

Ah! Thanks, Chris. Code is working now!

Topic		Replies	Views
ODE solver trying to access index [0] when running Graph NODEs Modelling & Simulations	6	139	July 5, 2024
Neural ODE in DiffEqFlux that is not a time series Machine Learning diffeq	6	1403	March 30, 2019
DiffEqFlux NaN Bug with Neural ODE: "BoundsError: attempt to access 1-element Vector{Float64} at index [2]" Machine Learning neural-network	3	516	October 30, 2021
A simplified example for DiffEqFlux New to Julia	1	415	July 27, 2021
Using ODE solution in NEURAL ODE General Usage sciml	6	275	July 20, 2023

Neural ODE works for small networks, but throws error for larger networks (getindex() method)?

Related topics