Compile FastDifferentiation derivatives one time only

abraemer · October 28, 2024, 3:09pm

Is it though? The type FunctionWithPreppedGradient is owned by the user. So it is not type piracy in a strict sense I think. Depending on the dispatch rules inside DifferentiationInterface (which you are the expert of ) it could cause method ambiguity problems and perhaps the signature needs to be more specific.

In this case it is convenient to define this additional method then the resulting code would work independently of whether the gradient has been prepared or not (if I understand correctly from the documentation that value_and_gradient does not require the prepared thing).

Thinking this over while typing, I agree that it probably would be better to define a custom function and just forward to DifferentiationInterface.jl.

gdalle · October 28, 2024, 3:11pm

Sorry yes, not piracy but it will definitely cause ambiguities because DI specializes on the backend and not the function, whereas this goes the other way.

abraemer · October 28, 2024, 5:53pm

I think you could make this type FunctionWithPreppedGradient in a way that it lazily saves the prepared hessian/gradient/etc on first call. Then it would not be very cumbersome I think? You’d just have to wrap each of the function into that struct and the rest of the code would look like the Tensorial.jl code, no? I don’t think you could get any closer - at least not easily.

gdalle · October 28, 2024, 8:45pm

But if your version with Tensorial’s custom differentiation operators works well, maybe there’s no need for more

KeitaNakamura · October 30, 2024, 3:30am

Sorry for the late reply. If I understand correctly, I would still need to pass a FunctionWithPreppedGradient object instead of prep to each function. In that case, I don’t think it’s a fundamental solution, because if I want to call func4 from within func3 , for example, I would need to modify all the other functions as well. I think that structure is not good.

gdalle · October 30, 2024, 5:39am

With a static array (or tensor) you can deduce the size from the type, so maybe a macro inside a generated function could be enough? I don’t know if that’s possible, just grasping at straws here.

KeitaNakamura · October 31, 2024, 5:20am

Thank you always for your thoughtful suggestion. However, to construct the framework I want, it seems essential to evaluate the given function f at compile-time within the derivative(f,x) function, for example. This approach appears to be infeasible. Alternatively, the preparation process should return a bitstype similar to ForwardDiff.jl with SArray case.

moble · November 12, 2024, 6:09am

Sorry to be late to the party, but I have experience with a very similar — though more complicated — problem. In fact, I think I might be the “someone” brianguenter mentioned in the context of make_Expr. Creating an @generated function that takes a Val(func) has worked out very well for me. The advice to avoid @generated functions should probably be more like “avoid them unless you understand exactly why you need them instead of regular functions.” But in this case, we do understand, and we still need them.

I should note that I’m actually not differentiating the Val(func); I’m differentiating a function that depends on another type parameter (PNOrder). As mentioned above, you may need RuntimeGeneratedFunctions.jl to solve world-age issues. So I guess I’m saying, take this as a proof-of-principle example:

github.com

moble/PostNewtonian.jl/blob/e24356e1bff120d11a61faa14a734a3b0b599048/src/pn_expressions/binding_energy.jl#L153-L245


      
          """
              𝓔′(pnsystem)
              binding_energy_deriv(pnsystem)
          
          Compute the derivative with respect to ``v`` of the binding energy of a compact binary.
          
          This is computed automatically (via `FastDifferentiation`) from [`𝓔`](@ref); see that
          function for details of the PN formulas.
          """
          @generated function 𝓔′(
              pnsystem::PNSystem{ST,PNOrder}; pn_expansion_reducer::Val{PNExpansionReducer}=Val(sum)
          ) where {ST,PNOrder,PNExpansionReducer}
              # Create a `PNSystem` with `FastDifferentiation` (henceforth FD) variables, using the
              # same PNOrder as the input `pnsystem`.
              fdpnsystem = FDPNSystem(eltype(ST), PNOrder)
          
              # FD expects a single vector of variables, so we concatenate the state vector with the
              # two tidal-coupling parameters
              vars = FastDifferentiation.Node[fdpnsystem.state; Λ₁(fdpnsystem); Λ₂(fdpnsystem)]

This file has been truncated. show original

Topic		Replies	Views
Directly modifying partials in ForwardDiff Specific Domains forwarddiff	14	541	August 21, 2023
Automatic Differentiation Slow (Slower than Finite-Differences) Optimization (Mathematical)	18	3468	May 31, 2018
Comparison of automatic differentiation tools from 2016 still accurate? Numerics differentiation	41	5822	August 16, 2018
Vectorized gradient using ForwardDiff Specific Domains forwarddiff	5	424	July 17, 2022
[ANN] FastDifferentiation.jl Package Announcements package , announcement , autodiff	3	1544	July 15, 2023

Compile FastDifferentiation derivatives one time only

Related topics