A community convention for target function return values

oschulz · June 29, 2020, 2:50pm

@Tamas_Papp I am not sure we need a common convention, as conversions should be trivial. Also
I would rather have the user convert between log and linear values when applicable. TBH, for MCMC a non-log PDF just does not make sense numerically and is not something one can recover from with a simple log as most of the precision is already lost to under/overflow.

It’s not so much about the auto-conversion, but about defending against the user passing the wrong kind of value. For example, in BAT.jl I have a PosteriorDensity(likelihood, prior). It’s meant to accept pretty much anything as a likelihood - a function, a distribution, etc., but I do want to defend against (possibly less experienced) users passing a function that produces a linear value. So by returning some kind of wrapper type (kind of like a unit), the user makes an explicit statement “this is a log value”. Also, this way a likelihood can just be called that. It’s just weird having PosteriorDensity(likelihood, prior) when the likelihood is an object like density or distribution, but having to write ``PosteriorDensity(log_likelihood, prior)` just to make it clear that it returns a log if it’s a function.

As for auxiliary/incidental computations, it is kind of tricky to decide what should be done with them, or if they are needed. Keeping eg 1024 of them around for a NUTS tree of depth 10 may be excessive — I would rather allow them to be calculated on demand with a closure, similarly to how Zygote does backtracing.

Sure, values that come with extra computational effort should only be calculated on demand. I mean data that is generated inherently as part of the (likelihood or whatever) calculation and that the user wants to preserve, in large-scale applications, that’s not an uncommon requirement.

Of course you’re right, one probably will want to filter/thin it a bit in the output, but that could be controlled by config parameters for the algorithm or so.

Topic		Replies	Views
Types for log-domain values General Usage	12	523	November 24, 2019
Reworking Distributions.jl Statistics proposal	51	5346	April 27, 2020
Looking for feedback on package type design - tree log volumes General Usage	1	1095	December 31, 2017
Organizing many function returns New to Julia data_structures , functions	14	1827	August 19, 2021
Strategies to use ReverseDiff.jl with NamedTuples (or ComponentArrays) General Usage question	5	350	March 31, 2022

A community convention for target function return values

Related topics