Computing value and first & second derivatives of a scalar function with scalar input, using ForwardDiff

user664303 · August 16, 2023, 3:29pm

I cannot find, either in the documentation of ForwardDiff, or on the web more widely, any information on how to compute the second derivative of a scalar function with a scalar input. How can I do this? (Given a function f that takes x<:Real as input).

Is there a way to compute the function output and first and second derivatives simultaneously, like calling hessian with a DiffResult? Note that this latter approach only works with array-like inputs.

gdalle · August 16, 2023, 3:36pm

I think just differentiating twice would work. Although you probably wanna take a look at GitHub - JuliaDiff/TaylorDiff.jl: Taylor-mode automatic differentiation for higher-order derivatives too

user664303 · August 16, 2023, 4:12pm

My current solution is as follows:

function computehessian(f, x::AbstractArray)
    result = DiffResults.HessianResult(x)
    result = ForwardDiff.hessian!(result, f, x)
    return DiffResults.value(result), DiffResults.gradient(result), DiffResults.hessian(result)
end

computehessian(f, x::T) where T <: Number = map(first, computehessian(f ∘ first, StaticArrays.SVector{1, T}(x)))

rafael.guerra · August 16, 2023, 7:05pm

You can differentiate twice:

df(x)  = ForwardDiff.derivative(f,x)
d2f(x) = ForwardDiff.derivative(df,x)

sought_result = (f(x), df(x), d2f(x))

user664303 · August 16, 2023, 9:00pm

Thanks. The following code:

using ForwardDiff, DiffResults, StaticArrays, BenchmarkTools

function computehessian(f, x::AbstractArray)
    result = DiffResults.HessianResult(x)
    result = ForwardDiff.hessian!(result, f, x)
    return DiffResults.value(result), DiffResults.gradient(result), DiffResults.hessian(result)
end
computehessian(f, x::T) where T <: Number = map(first, computehessian(f ∘ first, SVector{1, T}(x)))

function computehessian2(f, x)
    df(x)  = ForwardDiff.derivative(f, x)
    d2f(x) = ForwardDiff.derivative(df, x)
    return (f(x), df(x), d2f(x))
end

x = rand()
@btime computehessian(exp, $x)
@btime computehessian2(exp, $x)

produces

  50.279 ns (1 allocation: 64 bytes)
  27.513 ns (0 allocations: 0 bytes)

so it seems your approach is better.

user664303 · August 16, 2023, 9:18pm

However, with a more complicated f function it isn’t so clear cut:

x = rand()
w = rand()
s1 = rand()
s2 = rand()
f(w, s1, s2, x) = log(w * s1 * exp(-0.5 * x * s1 ^ 2) + (1-w) * s2 * exp(-0.5 * x * s2 ^ 2))
expandfunc(args, v) = args[1](args[2:end]..., v)
fixallbutlast(func, args...) = Base.Fix1(expandfunc, (func, args...))
g = fixallbutlast(f, w, s1, s2)
@btime computehessian($g, $x)
@btime computehessian2($g, $x)

outputs:

  69.701 ns (1 allocation: 64 bytes)
  83.289 ns (0 allocations: 0 bytes)

user664303 · August 16, 2023, 10:04pm

Given this slow-down (caused by calling f 4 times instead of once), I think it strange that the hessian! functionality doesn’t exist for scalar x, so I opened an issue about it.

Topic		Replies	Views
Best method for (scalar) second derivative with ForwardDiff Performance forwarddiff	4	1078	February 23, 2023
Scalar mixed derivative using ForwardDiff General Usage	2	213	July 12, 2023
Evaluation, gradient and Hessian of a scalar function for multiple values using ForwardDiff.jl Numerics	5	783	July 7, 2020
Fast f(x) ∂f∂x and ∂2f∂x2 Performance autodiff	5	592	July 30, 2021
Mixed partials Performance question , forwarddiff	1	379	October 1, 2021

Computing value and first & second derivatives of a scalar function with scalar input, using ForwardDiff

Related topics