Hi, I am writing a toy model to test the performance of Flux.jl. I generated some dummy data with the following code import numpy as np traindata=np.random.random((10000,50)) target=np.random.random(10000) np.savetxt("traindata.csv",traindata,delimiter=',') np.savetxt("target.csv",target,delimit…

It seems to work with a batch size of 32 (and still a relu activation function) using Base.Iterators: repeated using CSV,Random,Printf using Flux using Flux: glorot_uniform traindata=Matrix(CSV.read("traindata.csv"; header=false))' target=Matrix(CSV.read("target.csv"; header=false))' model=Chain(…

As there is no relationship between input and output, the best the NN can do is to return the mean i.e. 0.5. So the expected MSE is 1/12 = 0.083333 (the variance of a uniform standard distribution). So it seems that tensorflow gives the correct result. But flux seems to still give random numbers whi…

As a test I would try with a different activation function as rely has a zero gradient for negative values.

Thank you. As you said I tried a linear activation (in tensorflow it’s 'linear' and in Flux.jl it’s 'identity'), the trend remains the same. In Flux.jl the loss is loss(traindata, target) = 0.26388273830343645 (tracked) loss(traindata, target) = 0.254745100985269 (tracked) loss(traindata, target)…

Friends dont let friends use minibatches larger than 32

This is the reason. The official document of flux.jl seems not to mention how to set the batchsize. Perhaps I should open an issue to ask them to add the information. Thank you very much!

I agree, it is not so obvious to find such information.

Thanks you very much, I had the same problem and I saw your post.

I believe this code also needs using Base.Iterators:partition, otherwise, partition is not defined. Thanks for the nice minibatch example.

I am new in Julia and Flux world and I would like to test a simple neural network: My training data are training_X training_Y size(training_X) # 10000 times pre-calculated profile (lenght 80) for three parameters (10000,80) size(training_Y) (10000,3) My network is like this: using Flux, S…

The same network performs differently in Flux.jl and tensorflow

Specific Domains Machine Learning

Alexander-Barth September 4, 2019, 6:12pm 5

Could the batch size be an issue? It seems that keras defaults to 32 if unspecified (The Model class).

Why the result from Flux.jl is totally different from tf.Keras (with the same simple MLP)

Topic		Replies	Views
Flux results not similar to Tensorflow Machine Learning question	3	1866	March 11, 2019
Why the result from Flux.jl is totally different from tf.Keras (with the same simple MLP) Machine Learning question , package	6	1554	December 3, 2019
Slow LSTM on GPU in Flux Machine Learning gpu , flux , pytorch	21	2430	February 15, 2024
Different behaviour between Flux.jl and Pytorch Machine Learning machine-learning	17	2526	February 13, 2021
Flux ready for a beginner deep learning project? Machine Learning flux	31	8993	June 20, 2019

The same network performs differently in Flux.jl and tensorflow

Related topics