Hobby project: SequentialFit.jl

gustaphe · February 21, 2021, 1:49pm

I don’t know mathematical optimization, and this might be a tremendously dumb idea, but I couldn’t get it out of my head after working on a project where I was fitting a surrogate model to the outputs from a very expensive simulation. So I made this thing for fun:

Not going to register it or anything, but I’m happy for feedback/discussions/links to related papers.

Basically, if fitting, taking the gradient of your model with respect to its parameters and optimizing on that gradient are all orders-of-magnitude cheaper operations than sampling, it might make sense to use your current model to inform the choice of sample points. Big drawback: It requires you to already be fairly confident in the general quality of your model, so it probably doesn’t hold up for science.

gustaphe · February 21, 2021, 2:52pm

And a picture because those drive engagement.

example

odow · February 21, 2021, 9:22pm

You might already be aware, but this is pretty similar to Bayesian Optimization.

gustaphe · February 22, 2021, 5:40am

I’ve encountered it, and it’s similar, but the big difference is that I impose a specific model function, and bayesian optimization (from what I understand) uses GPs, which are essentially polynomials.

In my use case, the purpose of the fit is to find specific, physically meaningful parameters, like linewidth, height and position of a gaussian. Extracting them from a GP seems difficult (but I’m not convinced I understand bayesian optimization enough to say for sure that this is not just a poor implementation of it).

odow · February 22, 2021, 8:14am

Your code seems different to pure BO, but the idea of fitting a surrogate and optimizing a metric of that pops up in different areas.

You could also look up “Bobyqa”.

mohamed82008 · February 22, 2021, 8:55am

There is also https://github.com/SciML/Surrogates.jl.

aplavin · February 22, 2021, 9:28am

The idea of this approach looks very interesting for fitting analytical physical models to expensive simulations, or to experimental data when one can actively control the next point to measure. It seems a more direct approach than going through some surrogate model and fitting an analytical function after that.
Could you please clarify what do you mean by:

Big drawback: It requires you to already be fairly confident in the general quality of your model, so it probably doesn’t hold up for science.

?

gustaphe · February 22, 2021, 9:48am

Sure. Like I said, I haven’t investigated this thoroughly, but unlike Bayesian optimization it doesn’t actually act on uncertainty, but rather on “If my model is (close to) correct, where should I sample to get maximum information?”. If your initial guess of parameters is a bit off, there is no proper method to explore other regions (we have exploitation but not exploration if my opti-lingo is up to date).

Maybe there’s some good change I could make to improve the algorithm in this respect – I know for certain that adding the prod(s.X .- x)^2 factor was necessary to stop it from just sampling the same spot on what it thought were the slopes over and over.

In the example above we can see that it does a decent job, but I’m not certain I could trust it without checking against the ground truth. And I have no idea what kind of functions this would work for vs not.

aplavin · February 22, 2021, 10:51am

Hm, I see… It relies on the general behaviour of the model function with the current parameter values, and if the reality is vastly different the point selection will be far from optimal.
Are you aware of any papers/studies of such methods? I tried googling some keywords but no success.

gustaphe · February 22, 2021, 11:12am

Nope, I was kind of hoping some would turn up here

ctkelley · February 22, 2021, 12:18pm

This paper seems to propose a similar idea. The authors implemented it in the old Boeing Design Explorer software.

@INPROCEEDINGS{jdaiaa2,
author = “{Audet, C.} and {J.E. Dennis} and {D. W. Moore} and
{A. J. Booker} and {P. D. Frank}”,
title = “A Surrogate-Model-Based Method for Constrained Optimization,
AIAA-2000-4891”,
year = “2000”,
booktitle= “Eighth AIAA/USAF/NASA/ISSMO Symposium on Multidisciplinary
Analysis and
Optimization”
}

rafael.guerra · February 22, 2021, 3:14pm

@mohamed82008, on the surrogates topic, the following slides: Introduction to Gaussian Process Surrogate Models, by Nicolas Durrande and Rodolphe Le Riche (2017), provide a really good overview.

mohamed82008 · February 22, 2021, 3:22pm

This is wrong btw. GPs aren’t essentially polynomials. They are very different beasts. GPs are semi-parametric models in the sense that you need all the data around to do prediction with them, not just the parameters. GPs are naturally probabilistic and Bayesian which makes them an awkward choice to “fit” deterministic functions but it will work anyways if you ignore the variance. But that property of GPs also makes them seamlessly extend to stochastic optimisation where the objective or constraints can be random in nature.

gustaphe · February 22, 2021, 9:44pm

This does not surprise me. I haven’t found a definition that relates them to anything I understand. I think there’s a couple of steps of reading I would have to do in between.

Topic		Replies	Views
[ANN] SurrogateModelOptim.jl Package Announcements optimization	3	1343	January 13, 2020
Parallel Optimization with Surrogates.jl Optimization (Mathematical) question	6	590	June 14, 2023
Finding MAP estimate in Turing? Statistics bayesian-inference	1	976	June 20, 2020
Optimizing a very expensive function evaluated in C++ Optimization (Mathematical)	15	1452	June 22, 2020
Bayesian optimization Optimization (Mathematical) question	8	680	February 5, 2023

Hobby project: SequentialFit.jl

Related topics