DiffEqFlux and Threaded Ensemble Problems with ForwardDiff

Allan_Baker · May 28, 2023, 1:41pm

Hi Julia Speed Guru’s,

I’m trying to demonstrate Julia’s cool capabilities for embedding shallow neural networks to select key parameters for an ensemble of differential equations. I can use the ensemble problem fine and when I solve with threads, I get speed up vs serial evaluations. But it seems like I could get much MORE speedup.

Let me describe the problem, which unfortunately I don’t have a minimum working example, so I understand if this gets no support, but really I’m looking for if I’m approaching the concept correctly and there isn’t a better way hidden deep down in the documentation.

This is what I have:

DiffEq based simulation with states and parameters
Several of the parameters (5-7 or so need to be optimized based on initial conditions of the states)
A Flux Network (because I haven’t tried Lux yet and need to find a “how-to-convert-to-Lux” document) takes key initial conditions and choses the best 5-7 parameters as the network output using a cost function to evaluate the end state of the differential equation accumulated through the entire ensemble
I create a random problem set to feed an ensemble of Monte Carlo Runs so that the network can see a variety of start conditions
I calculate a loss/cost for each instance of the ensemble and sum it up for all training vectors
This is all part of the Optimization Problem Solve I use AutoForwardDiff for the gradient maker as I haven’t been able to figure out all of the changes to my diffeq to make Zygote work.
This all works.

Problem: The threading seems to only take place on the ensemble set diffeq. So if I have 24 cores say and 50 ensemble seeds it isn’t very efficient to call out to the threading. But my network has on the order of 600 parameters. So depending on ForwardDiff chunk sizes, I run those 50 training vectors threaded about 600/chunksize times. Well submitting 50 seeds to 24 cores is much less efficient than submitting 50*600/chunksize evaluations.

How do I get the threading to consider the gradient calculations in the thread submission? Is this even mathematically possible (he lazily asks). Because if I could use distributed, I actually have 960 cores at my disposal right now on an evaluation cluster. So I could get this to really move using the distributed ensemble… at least in my dreams. At the very least the single processor 24 core job would be more thread-overhead efficient.

Thoughts on how I could enable more throughput?

Are there good examples out there?

Thanks for the great support. Love the Julia community!

Best Regards,
Allan

ChrisRackauckas · May 28, 2023, 1:51pm

Allan_Baker · May 28, 2023, 6:20pm

Thanks Chris!!! I will dig into this! I just put that into Optimization.OptimizationFunction as the adtype? I’ll post the basics of the code, will not run without the rest of it, but I think I put everything in it just in case I’m using old functionality.

 adtype = Optimization.AutoForwardDiff(), 
 opt=Optimisers.ADAM(0.1)
 opt_thread = EnsembleThreads()
 loss(θx) = (sum(abs2,Array(model(θx,opt_thread,trdata))), θx)
 optf = Optimization.OptimizationFunction((x,p)->loss(x), adtype)
 cb=callback
 optprob = Optimization.OptimizationProblem(optf, θ)
 res = Optimization.solve(optprob, opt; callback = cb, maxiters=iterations)

Is it as simple as changing out the adtype?

ChrisRackauckas · May 28, 2023, 6:48pm

No, it’s not setup with Optimization.jl so you’d have to do it manually. We could add it there though

Topic		Replies	Views
Parallel computing using Parallel Ensemble Simulations (DiffEq) does not seem to work Modelling & Simulations	13	1327	August 30, 2019
EnsembleThreads slower than EnsembleSerial or EnsembleDistributed Modelling & Simulations package , diffeq	3	817	April 29, 2021
Parallel workflow Performance parallel	2	520	July 1, 2020
Training Neural ODEs - Advice Machine Learning diffeqflux , lux	2	86	June 24, 2025
Combining multiple-shooting with out-of-ODE multithreading Machine Learning diffeq , sciml	1	518	August 13, 2021

DiffEqFlux and Threaded Ensemble Problems with ForwardDiff

Related topics