Zygote gradient

wormholestds9 · July 24, 2019, 3:27am

Zygote gradient seems to return an odd result for a constant function. That is,
f1 = (x) → 0.0
gradient(f1,0) = (nothing,)
but
f2 = (x) → x^2
gradient(f2, 0) = (0,)

Why does the first one return “nothing” instead of 0, which is what I expected.

mcabbott · July 24, 2019, 7:18am

Zygote uses nothing as a kind of strong zero, since the derivative of f1 is always zero, while that of f2 depends on x and is only zero for this particular value.

The reason to make this distinction is that if you give it f1(g(x,y)), it knows never to bother working out the gradient of g at all, as it will never be needed. But for f2(g(x,y)), the gradient function which gets compiled must allow for g(x,y) being nonzero.

wormholestds9 · July 24, 2019, 9:58am

Thank you.

Topic		Replies	Views
Strange behavior of differentiating constant functions in Zygote New to Julia	1	329	December 23, 2019
[SOLVED] Nothing returned in the Zygote derivative New to Julia zygote	1	442	January 2, 2021
Factor of two in Zygote complex gradient Optimization (Mathematical) zygote	5	962	March 8, 2022
How can I use Zygote to get the gradient New to Julia	6	676	September 25, 2020
Relatively complex function breaking with Zygote - what limitations exist for differential programming? General Usage question , zygote	5	525	September 1, 2021

Zygote gradient

Related topics