What's the state of Automatic Differentiation in Julia January 2023?

ChrisRackauckas · January 4, 2023, 2:39pm

It’s going really well! Enzyme is looking to become the general AD IMO, given that it has such a wide surface of support. That said, whether Enzyme is right for you (or machine learning) is really a binary thing. Right now it doesn’t have support for all of the GC and dynamic dispatch. Part of this delay was because Valentin (one of the biggest contributors) was just off… adding native precompilation caching to Julia (https://github.com/JuliaLang/julia/pull/47184). So can’t be mad about that. But if your code hits GC or dynamic dispatch, it’s not a coin toss as to whether that will work as there are parts of that which are not quite supported yet, which basically means “there be dragons” for now and I would only suggest using it for non-allocating fully inferred code.

That being said, it’s the default that’s used inside of SciMLSensitivity.jl these days, it’s extremely fast, supports mutation, and is robust within the confines of those two caveats above. Its rules system is mostly worked out:

it’s just a question of making it less tedious in the context of activity analysis.

Enzyme core is growing in contributors. There’s an Enzyme conference coming up:

They received an award at SuperComputing 2022.

https://www.csail.mit.edu/news/mit-csail-phd-students-receive-best-student-paper-supercomputing-2022

So with that kind of momentum, the contributor base growing (and at the LLVM level, shared with contributors from Rust), and a solid foundation that supports mutation from the get-go, it’s really on the right path to be a full language-wide AD system. It’s not quite there yet, but that shows it has the momentum as the new foundation.

In the meantime, using Zygote where you define adjoint rules on mutating operations to just call Enzyme isn’t a bad option.

That said…

The most fun is the new AD work:

StochasticAD.jl is based on a new form of automatic differentiation which extends it to discrete stochastic programs.

https://arxiv.org/abs/2210.08572

This allows things like agent-based models and particle filters to be differentiated with automatic differentiation.

Additionally, there’s a new ForwardDiff-like AD being developed for higher order AD:

It adds some vector-based rules that ForwardDiff doesn’t have as well, which makes it able to handle neural networks and linear algebra in a good way.

github.com

JuliaDiff/TaylorDiff.jl/blob/vector/src/vector.jl


using Zygote: @adjoint
import Base: getindex, setindex!, size, +, *
import Base: BroadcastStyle, broadcasted, similar, copy
using Base.Broadcast

export TaylorVector

struct TaylorVector{T<:Number,N} <: AbstractVector{T}
    value::NTuple{N,Vector{T}}
end

TaylorVector(xs::Vararg{T,N}) where {T<:Vector,N} = TaylorVector(xs)
@adjoint TaylorVector(v) = TaylorVector(v), t̄ -> (t̄.value,)
value(t::TaylorVector) = t.value
@adjoint value(t::TaylorVector) = value(t), v̄ -> (TaylorVector(v̄),)

@generated function TaylorVector{T,N}(x::Vector{T}, l::Vector{T}) where {T<:Number, N}
    return quote
        $(Expr(:meta, :inline))

This file has been truncated. show original

It’s still under some heavy development, but it’s avoiding the compiler parts that generally makes AD more difficult so it should be quicker for it to get up to speed.

Topic		Replies	Views
Reliability of Enzyme.jl General Usage question , diffeq , autodiff	11	2099	October 23, 2022
Any faster way of computing small gradients? Performance zygote , forwarddiff , symbolics , autodiff	21	2038	August 11, 2022
Enzyme ready for everyday use? (2024) General Usage autodiff , enzyme	16	1572	August 31, 2024
Open discussion on the state of differentiable physics in Julia Community sciml , ad , dp	9	6907	August 15, 2022
Lux, ComponentArrays and flat parameters : computing the gradient works with Zygote but not with Enzyme New to Julia enzyme	16	1757	May 14, 2024

What's the state of Automatic Differentiation in Julia January 2023?

That said…

Related topics