NaN errors in Flux

Christopher_Fisher · April 26, 2022, 9:49pm

Hi all,

I am trying to replicate a Likelihood Approximation Network (LAN) with Flux.jl. LANs are used to learn the likelihood function of intractable computational models. As a proof of concept, I am trying to apply the method to two simple models for which the likelihood function is known: a Gaussian model and a decision model called the Linear Ballistic Accumulator (LBA). I was successful in developing a LAN for the Gaussian model, but the LAN for the LBA produces NaNs as predictions. I tried various solutions from other threads, such as decreasing the learning rate and using BatchNorm, but those recommendations did not solve the problem. Changing the activation function to relu, solved the NaN problem, but interfered with the ability of the NN to learn the likelihood function.

Can I do anything to fix this problem? Please let me know if there are more details I can provide.

goerch · April 26, 2022, 9:53pm

Just to reiterate an old point of discussion: signaling NaNs would help you to discover the root of the problem…

Christopher_Fisher · April 26, 2022, 10:59pm

Indeed, it would be helpful to know where the NaN originated. As far as I can tell, I replicated the procedure described in the paper. This makes me wonder whether there is a problem with the AD.

Christopher_Fisher · April 27, 2022, 4:45pm

I was wondering whether someone might be able to tell me if I misspecified the NN model or if NaNs are likely due to a bug?

Christopher_Fisher · April 27, 2022, 10:33pm

I tracked down the problem to a few large outliers in the training data, which caused a NaN when passed to tanh. So far, removing the outliers seems to have solved the problem.

Topic		Replies	Views
Why does my Flux model return in all NaN? Machine Learning question , flux	2	743	October 9, 2023
How come Flux.jl's network parameters go to NaN? Machine Learning first-steps , flux	10	4067	June 9, 2021
Flux training gives NaNs Machine Learning	3	1131	July 3, 2023
Flux error: Loss is NaN New to Julia flux	5	980	August 4, 2019
Getting NaNs in the hello world example of Flux Machine Learning question	2	744	October 28, 2021

NaN errors in Flux

Related topics