Different loss with Zygote when taking gradients vs. without

holylorenzo · December 20, 2019, 6:13pm

I am working on the newest Flux branch and noticed some weird behavior.

When I calculate the loss of my model’s outputs I get the following number:

> y_pred = model(x)
> loss(y_pred, y)
0.003791215f0

However, doing the same calculation while taking gradients gives a different result:

> ps = Params(params(model))
  gradient(ps) do
      y_pred = model(x)
      l = loss(y_pred, y)
      println(l)
      return l
  end

0.035433643

Maybe there is something obvious I’m missing, otherwise I’ll probably have to put together a minimum working example.
Thanks for any help!

Topic		Replies	Views
Flux Custom Loss Function Not Working Properly Machine Learning flux , zygote	20	2243	April 2, 2021
Why calculating gradients from Params is different than doing it directly? General Usage flux	5	716	April 28, 2020
Why doesn't the loss calculated by Flux `withgradient` match what I have calculated? Machine Learning question , flux	2	254	January 26, 2024
Flux/Zygote: Gradient with respect to inputs and implicit parameters (in 2021) Machine Learning question , flux , zygote	1	974	November 23, 2021
Gradient error in Flux model inputs Machine Learning question , flux , zygote	5	1324	January 13, 2021