T Distribution - Automatic Differentiation on quantile and cdf function

mrVeng · October 28, 2022, 9:01am

Hi there,

I need to use the cdf (and quantile) function of the T distribution inside a function that I want to use Automatic Differentiation on. This works fine with almost any univariate distribution from the Distributions.jl package, but unfortunately not with the T distribution. Here is an MWE:

using ForwardDiff, ReverseDiff
using Distributions
function mytargetfunction(data::AbstractVector)
    function obtaingradient(ν::AbstractVector{R}) where {R<:Real}
        dist = Distributions.TDist(ν[1])
        data_uniform = cdf.(dist, data)
        return sum( logpdf(dist, data_uniform[i]) for i in eachindex(data_uniform) )
    end
end

#working
ν = [3.0]
data = randn(1000)
target = mytargetfunction(data)
target(ν)
#not working
ForwardDiff.gradient(target, ν) #MethodError: no method matching Float64(::ForwardDiff.Dual{ForwardDiff.Tag
ReverseDiff.gradient(target, ν) #ArgumentError: Converting an instance of ReverseDiff.TrackedReal{Float64, Float64, ReverseDiff.TrackedArray{Float64, Float64, 1, Vector{Float64}, Vector{Float64}}} to Float64 is not defined. Please use `ReverseDiff.value` instead.

If I swap the T distribution in the closure with, e.g., a Normal, everything works fine. Are there any alternative packages for Julia that allow me to use a AutoDiff friendly T distribution? Or does anyone know a better trick to solve that problem? I can write down the cdf/quantile function analytically but only for a few selected cases unfortunately.

goerch · October 28, 2022, 10:57am

Open problem?

mrVeng · October 31, 2022, 8:57am

Anyone having an idea of what I could do? I am afraid the Github link is out of my league.

stevengj · October 31, 2022, 11:36am

Finite differences are always available as workaround?

cgeoga · October 31, 2022, 1:20pm

Perhaps using HypergeometricFunctions.jl would be acceptable? I doubt this is as fast or robust as some other options, but in easy tests it seems to be fine:

using StatsFuns, SpecialFunctions, HypergeometricFunctions, ForwardDiff, FiniteDifferences

# not thoughtfully written at all
function tcdf(v, x)
  1/2 + x*gamma((v+1)/2)*_₂F₁(1/2, (v+1)/2, 3/2, -x*x/v)/(sqrt(pi*v)*gamma(v/2))
end

# CDF check:
@assert tcdf(1.1, 3.1) ≈ StatsFuns.tdistcdf(1.1, 3.1)

# Forward auto check:
@assert isapprox(ForwardDiff.derivative(_v->tcdf(_v, 3.1), 1.1),
                 central_fdm(10,1)(_v->tcdf(_v, 3.1), 1.1))

Obviously it’s preferable to have rules than to hope that ForwardDiff will pass through every branch of a function like that and give you the correct output—I had this fight with besselk a while ago, and it was an enormous pain to sort out…so I would guess the issue is even worse for a much more complicated function like 2F1.

For the quantile function, you could use a root finder on this CDF and ImplicitDifferentiation.jl to get the derivatives.

Again, though, it’s not obvious at all that a code implementation of 2F1 will give correct auto-derivatives of 2F1, so you should probably write a lot of tests that try to cover your uses cases and domain region and stuff.

Hope this helps!

tbeason · October 31, 2022, 3:00pm

Don’t these packages address this problem?

mrVeng · November 7, 2022, 3:24pm

Thanks a lot for the suggestions and your work!

I checked the manual implementation of the cdf with HypergeometricFunctions and it does work for my example! Unfortunately, it seems to segfault for quite a wide range of parameter values, making it difficult to work with.

I adjusted my expectations a bit and will try to define a custom Chainrules rule for my case to work.

mrVeng · November 7, 2022, 3:26pm

Thanks for your reply!

DistributionsAD does indeed do that, but there does not seem to be a method defined for the cdf of the Tdistribution.

lrnv · November 7, 2022, 3:37pm

Or also there add tdist cdfs and quantiles by oscardssmith · Pull Request #147 · JuliaStats/StatsFuns.jl · GitHub there is a PR fixing this issue properly (implementing the right functions into StatsFuns.jl). It might just need some help to comply to the reviews and fix it properly, but this is a good start.

mrVeng · November 7, 2022, 3:42pm

Thats great news, thanks for the link!

Topic		Replies	Views
SpecialFunctions + Turing, AD support for quantile function of Gamma distribution gamma_inv_cdf General Usage turing , distributions , specialfunctions	0	121	April 30, 2024
ReverseDiff not using custom derivative from ChainRules General Usage reversediff , autodiff , chainrulescore	3	377	November 7, 2022
(approximate) PDFs/CDFs suitable for automatic differentiation? Statistics	6	1416	April 3, 2017
Use FiniteDiff.jl in Turing.jl Probabilistic Programming turing	20	480	May 9, 2024
Differentiation of incomplete Beta function Numerics question , differentiation , forwarddiff	3	960	June 9, 2021

T Distribution - Automatic Differentiation on quantile and cdf function

Related topics