Zygote custom adjoint has surprising performance effects

SEA · April 3, 2020, 3:26pm

I am experimenting with custom adjoints in Zygote. In the following code, I defined an adjoint for the function gauss. Surprisingly, benchmarking gradient on gauss and a “wrapper” closure f around gauss shows that the gradient of f is 3-4 times faster than that of gauss. Before defining the custom adjoint, the two gradients have the same speed. After the definition, f’s gradient is 50% faster than before, while gauss’s gradient is 2 times slower.

Why does this happen, and how can I get optimal performance with custom adjoints consistently?

using Zygote
using BenchmarkTools

function gauss(x, μ, σ)
    y = (x-μ)/σ
    exp(-y^2/2) / (sqrt(2π)*σ)
end

using Zygote: @adjoint
@adjoint function gauss(x, μ, σ) 
    y = (x-μ)/σ
    e = exp(-y^2/2) / (sqrt(2π)*σ)
    function back(Δ)
        ey = e*y/σ # could pool e*Δ too
        (-ey * Δ, ey * Δ, e*(1 - y^2)/σ * Δ)
    end
    return e, back
end

x = randn()
μ = randn()
σ = exp(randn())
f(x, μ, σ) = gauss(x, μ, σ)
@btime gradient(f, $x, $μ, $σ)
@btime gradient(gauss, $x, $μ, $σ) # 3-4 times slower

Topic		Replies	Views
This custom Zygote.jl adjoint is not giving me the speed up I expected and how to migrate to GPU? Machine Learning	0	955	November 16, 2019
Zygote Performance (Again...) General Usage zygote , forwarddiff , tullio	17	1809	June 11, 2021
Zygote Performance Machine Learning question	22	4980	September 23, 2019
How to deal with Zygote sometimes "pirating" its own adjoints with worse ones? General Usage	3	656	December 24, 2019
Zygote is very slow for log barrier function General Usage	3	684	November 19, 2020

Zygote custom adjoint has surprising performance effects

Related topics