Optimising an imperfect, high-dimensional simulation with experimental data

duncan · October 25, 2022, 6:31am

I have used data from CFD simulations to create a Gaussian process surrogate of a mixing tank. The GP can estimate the reaction rate of the tank given a subset of the operating conditions, e.g. reagent concentration and impeller speed. However, my problem is that the underlying process is complex, involving the physicochemical interaction of three stages of matter. Therefore, the CFD simulations only approximate the actual behaviour, meaning the simulated dataset is somewhat idealistic.

I also have access to experimental reaction rate data of a similarly designed tank, although the data is likely very noisy. So my question is, what is the recommended approach for combining this experimental data with an imperfect simulator? I am currently considering pretraining a neural network with data from my simulator and finetuning the weights with the experimental data, but I would love any suggestions.

dlakelan · October 25, 2022, 1:51pm

Have you considered a Bayesian model with the existing neural network as predictor and some kind of simple model as a corrector, then learn the parameters of the corrector and you’d also get measures of uncertainty which might be useful rather than just a point estimate from the neural net.

duncan · October 25, 2022, 2:30pm

Thanks for the response! Uncertainty quantification would be nice to have, especially considering my GP simulator already quantifies its uncertainty.

So is the corrector essentially a Bayesian model of the residuals between my simulator and the experimental data? If that’s the case, would my corrected model be the sum of the GP simulator and the Bayesian corrector?

stevengj · October 25, 2022, 3:29pm

There is a wide literature on scientific machine learning trying to grapple with the best way to combine models and data. The answer depends on the circumstance, and even then it’s often still an open question what the best solution is.

However, in your case I would suggest looking into something like neural ODEs (or neural PDEs in your case). The goal is to keep as much of your trusted CFD model as possible, and only put a neural network into a small piece, e.g. for the unknown physicochemical interaction (presumably some local equation of state) — then you train the NN by minimizing error against experimental data end-to-end against the output of your NN-augmented CFD model.

@ChrisRackauckas does quite a bit of this sort of thing, and you might look into some of his papers on the topic as well as his DiffEqFlux.jl package.

dlakelan · October 25, 2022, 3:50pm

Yes that’s what I was thinking of though it’s hard to make specific suggestions without knowing more details in particular I was thinking that the space in which you are fitting is the space of not some cfd output but some summary such as the overall reaction rate.

duncan · October 25, 2022, 11:49pm

Thanks for the suggestion! Neural PDEs sound like an excellent option for my use case. Although, I guess the challenge will be embedding the NN into my current CFD code. I’m using OpenFOAM for my simulation, and I don’t know how this integrates with the DiffEqFlux.jl package.

For now, I’ll try @dlakelan’s suggestion since it is relatively straightforward to implement; however, I’m very keen to explore the development of an NN-augmented CFD model.

I appreciate the help!

stevengj · October 26, 2022, 12:00am

Ouch, backpropagating sensitivities through someone else’s huge C++ CFD code is not for the faint of heart.

Topic		Replies	Views
Parameter tuning in neural ODE Machine Learning diffeq , flux	15	2042	July 21, 2019
Fun with Flux and Differential equations? General Usage diffeq	12	1190	May 28, 2019
Uncertainties in DiffEqFlux.jl Modelling & Simulations question , diffeq	2	619	February 11, 2019
DiffEqFlux: neural_ode stops prematurely General Usage diffeq	4	571	January 24, 2019
GPU simulation time of neural ODE program Machine Learning diffeq , flux	8	897	August 16, 2019

Optimising an imperfect, high-dimensional simulation with experimental data

Related topics